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LETTERS TO THE EDITOR 


As an experiment in mathematical communication, the MONTHLY will accept and print Letters 
to the Editor on mathematical subjects of interest to members of the Association. 

The emphasis is on mathematics, and, in particular, not on politics of any sort, be it 
mathematical, academic, national, or global. 

The length of a letter should typically be not more than half a printed page. 

Letters could be comments on articles (including errors in, additions to, and digressions 
inspired by them), suggested rephrasings of problems, disagreements with proposed teaching 
methods, objections to book reviews (based on mathematical grounds), mathematical observations 
(either new or so old that most of us have forgotten them), possibly overlooked references of 
possibly wide interest, queries about a mathematical statement (is it known, who did it, where, 
and when), attractive and challenging “quickie” problems—or anything else that both the readers 
and the editors of the MONTHLY consider to be of mathematical significance. 

Send letters to P. R. Halmos, Department of Mathematics, Indiana University, Bloomington, 
Indiana 47405. 


(See page 67 of this issue.) 


ANOTHER LOOK AT THE CHEBYSHEV POLYNOMIALS 


E. L. ORTIZ 
Department of Mathematics, Imperial College of Science and Technology, Queen’s Gate, 
London, England SW7 2BZ 


T. J. RIVLIN 
Mathematical Sciences Department, IBM Thomas J. Watson Research Center, 
Yorktown Heights, NY 10598 


The Chebyshev polynomial of degree n > 0 is defined by 
(1) T,(x)=cosn@, x=cos0,0<0<7. 


This defines the polynomial on the interval [— 1,1] (and hence for all complex numbers). Clearly 
Ty(x) = 1, T,(x) = x, and elementary trigonometric identities show that T,, (x) = 2xT),(x) — 
T,_,(x) for n = 1,2,.... These polynomials have many interesting properties and have been 


E. L. Ortiz. I received my doctorate from Buenos Aires University where I was a student of Mischa Cotlar, 
working in functional analysis. I did postdoctoral research in applied analysis and approximation theory under 
Cornelius Lanczos at the Institute for Advanced Studies, Dublin, Ireland. I have been professor of mathematics at 
Buenos Aires University and head of the Numerical Analysis Section of Imperial College, London, England, where I 
have taught since 1968. 

T. J. Rivlin. 1 received my Ph.D. from Harvard University as a student of J. L. Walsh. After teaching at The 
Johns Hopkins University and spending a year as a Research Assistant at the Courant Institute for Mathematical 
Sciences, N.Y.U., I worked as a mathematical analyst in the aircraft industry on Long Island before joining the IBM 
Research Division in 1959. I have been a Visiting Professor of Computer Science at Stanford, Adjunct Professor of 
Mathematics at the Graduate Center of the City University of New York, and Science Research Council Senior 
Research Fellow at the Department of Mathematics, Imperial College, London, where I collaborated with Dr. Ortiz 
on this piece of work. My major research interest is approximation theory and I am an associate editor of The 
Journal of Approximation Theory. I have written two books: The Chebyshev Polynomials (Wiley, 1974) and An 
Introduction to the Approximation of Functions (Dover, 1981). 
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tudied ly (Cf. Rivlin [2]). In f (1) clear that the graph of y = 7,(x) for 

l< ly in t uare 1 <y< 1. Fig. 1 shows the graphs of 

orn = 1,2,. Som hite” curves streaking through the square A 

are clearl bl ig. 1. Our primary purpose h explain this phenomenon. We wish to 
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_ 2a [7s | 
9 = J n+tm’ J=O,1,-..5 2 
7 2ka n—-m-—1 
or n—-m-’ k= Ls | a) | 
and thus 
_ 2 j7 aa 
a; = COS tm Jf grees 5) 
2ka n—-m-—l1 
bj, = cos 7" k=1,...| 5 


are all nm zeros (counting multiplicities) of 7,,(x) — 7,,(x) = 0 in view of (2) and (1). Moreover, we 
have 


(3) mT, (a;) + nT,(a;)=0, -l<a,<1 
and 
(4) mT, (b,) —nTj,(b,) =0, -1<b, <1. 
To establish (3) and (4) we note that 
. 2jan . 2j7m . 2kan . 2kam 
sin —— + sin ———— = sin ———— — sin ——— = 0, 
n+m n+m n—-—m n— 
and 
T'(x) = nano x = cos 8 
” sin ’ 


Now suppose that (x, 7;,(x)) is an intersection point of 7,, and 7, in the interior of A. Then 
T,/(x)T,, (x) * 0. For if 7;'(x)T7,,(x) = 0, then (3) and (4) imply that 77(x) = T/,(x) = 0 and 
hence that 7, (x) = T,,(x) = +1, contradicting the assumption that (x, T,(x)) is an interior point 
of A. Thus, in view of (3) and (4) 


©) T(x) a x = b,. 


(5) provides a basis for our explanation of the existence of the “white” curves. Roughly speaking, 
for 1 < m < nand m,n < 30, intersection points of type a; are much more frequent than those of 
type b,, and at each intersection point of T,, and T,, of type a; in the interior of A the slopes of T,, 
and T,, are of opposite sign and in magnitude in the ratio n to m. We believe that the resulting 
concatenation of blank areas about these intersection points is what is seen as a “white” curve in 
Fig. 1. 

The connection between the “white” curves and points of intersection of pairs of Chebyshev 
polynomials receives support from the following considerations. 


THEOREM. If 0 < m < nand T(x) = T,(x) = y, then 
(6) (1 = T,~m(x)) (Ta(¥) — Th-m(%)) = 9. 
Proof. The proof follows from the following polynomial identity: 
(7) 72 - 2T,_,T.T,, + 72 =1- Tem: 


(The special case m = 1 of (7) can be found in Schur [4].) To establish (7) note that for p > q, 
2T,T, = (T,+q + T,—q) hence 
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(8) (T, — Tr)” = (1 = Trim) — Tym) 
and 
(9) 211 — T,,T,) = 1 - Tham) + (A - Them): 


(7) now follows upon multiplying (9) by 1 — T,_,, and subtracting the result from (8). Next put 
T(x) = T,,(x) = y in (7) and the result is (6). (An identity analogous to (7) holds for the 
Chebyshev polynomials of the second kind.) 

Note that T,_,,(x) = 1 precisely for x = b,,k =0,...,[((” — m— 1)/2]. Thus the points 
a;,j =9,1,...,[(a + m)/2] all lie on the curve T,(y) = T,_,,(x). This curve is symmetric in the 
x-axis. It is shown in Fig. 2 for n — m = q, q = 1,2,3,4. When q = 2p we have y = +T7,(x), 


while for odd q 
1+ T,(x) 
yo ty 


T,(y) = T,(x) is a parabola, 7,(y) = 7;(x) the folium of Descartes, x = T,(t), y = 7;3(t), 
—-l<t<l. 


Pa 


. 
awe 


<7 


Fic. 2 


Observe that the curves for g = 1,2,3,4 in Figure 2 are identical to the brightest “white” 
curves in Fig. 1. Lower values of n — m = q correspond to higher numbers of intersection points 
of 7}, and T,,. Hence for 1 < m <n < 30 we cannot expect to see the curves corresponding to 
q > 4 very clearly in Fig. 1. In the next section we get another view of the “white” curves by 
applying a suitable homeomorphism to the square A. 


2. The Stylized Chebyshev Polynomials. The mapping S:(x, y) > (arccos x, arccos y) = 
(x’, y’) is a homeomorphism of the square A onto the square B:0 < x’ < 7,0 < yp’ <7. Let us 
determine the image of the graph of T(x) under S. 


S:(x,T7,(x)) > (x’, are cos T, (cos x’)). 
Thus y’ = V,(x’) = arccos(cos nx’) is the image of y = T(x). Suppose 
ka _ (k+1)a 


<x’ < ——_,, k=0,],...,n—-1. 
n n 
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i) If k 1s even, cos nx’ = cos(nx’ — km), and since 0 < nx’ -—ka <a 
V(x’) = nx' — kao. 
ii) If k is odd, cos nx’ = cos((k + 1)7 — nx’), and since 0 < (kK + l)a — nx’ <a 
V(x’) = (k + 1)0 — nx’. 

In short, then, y’ = V,,(x’) is a polygonal line issuing from the origin with slope n and with slope 
changing sign (but maintaining magnitude n) consecutively, at the top and bottom of B, that is at 
each abscissa (j7)/n, j = 1,...,n — 1. (Thus, this image of the Chebyshev polynomial is a 
perfect spline of degree 1 with knots at (j7)/n, j = 1,..., n — 1. (Cf. Schoenberg [3)])). 

Each curve y’ = V,(x’) passes through (0,0) while each y = T, (x) is satisfied by (1,1). In order 
to make the orientation of the polygonal lines agree with those of the Chebyshev polynomials, and 


to place them on the square A as well, we subject the square B to a linear mapping onto A: 
-l<x<l,-l<y<1,L:(x’, y’)> (x, y) defined by 


and thereby obtain as image of y’ = V,(x’) 
2 T 
y=o,(x) =1-2¥,(F(1 ~ 2). 


We call the piecewise linear curve in A, y = v,,(x), the stylized Chebyshev polynomial of degree n 
(a name suggested by Kaczmarz and Steinhaus [1, p. 43]). Fig. 3 shows y = v,(x) forn = 1,..., 5. 
Fig. 4 shows the same curves for n = 1,..., 30. The “white” curves now seem to be piecewise 
linear. 
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If v,(x) = v,,(x) for 0 < m < n, then x is either 


oe 4j 2] 
c, = | men’ jJ=0,..., 


Or 
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—n) which we believe causes the blank areas that are seen as white lines in Fig. 4, and whose 
homeomorphic images are the white curves in the Chebyshev case. 

The curve T,(y) = T,,_,,(x) contains the intersection points (a,, T,,(a;)) of y = T(x) = T,,(x) 
interior to A, according to the Theorem. Its image under the homeomorphism LS of A onto A is 
(10) v(y) = U,—-m(x) 
which contains the intersection points (c,, v,(c;)) of y = v,(x) = v,,(x) interior to A. (10) is a 
piecewise linear curve in A which is symmetric with respect to the x-axis. For y > 0 its equation is 


(11) y = tt tem) 


The curves (11) are shown in Fig. 5 for g = n — m = 1,2,3,4 and their full, symmetric, version 
(10) is shown in Fig. 6 for g =n — m= 1,2,3,4 also. Note that the curves in Fig. 6 fit the 
brightest “white” curves in Fig. 4 exactly. 
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LG 
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3. Final Remarks. 

(i) Generality of the Phenomenon. The phenomenon of the white curves persists for all the 
Jacobi polynomials, when they are suitably weighted. This is a consequence of an asymptotic 
formula due to Darboux (see Szegd [4, p. 194]). For example, we have for the Legendre 
polynomials, as n > oo 


2 2 4 
The graphs of 


; 1/2 
(ease) P,(cos 8) = cos( (1 4 5) _ 4 + O(n'),0<0<z. 


y= (28) = 22) 4P,(x) 


forn = 1,..., 30 are shown in Fig. 7. 

(ii) Conclusions. We have attempted to explain the presence of the “white” curves in Fig. 1. We 
have suggested a mechanism which connects intersection points of pairs of Chebyshev polynomi- 
als with the “white” curves based on concatenated blank domains. Strong support is lent to this 
connection by the fact that the “white” curves coincide with the curves of Fig. 2. Another view of 
the phenomenon under consideration is offered by the graphs of the stylized Chebyshev poly- 
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Do you remember Constance Reid, the author of “Hilbert”? The subject above is her sister. 
Her name is on page 51. 
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be used to design and administer individual remediation programs. It cannot be over-emphasized 
that a mathematics appreciation course cannot fulfill its goal if it degenerates into the teaching of 
arithmetic computations or pre-algebra skills, or if it is limited to a topic such as “consumer 
mathematics.” 


6. A large proportion of students enter two-year colleges with little realistic expectation 
concerning majors. Many of these students have had poor experiences with mathematics and, if 
there is a general education mathematics requirement which may be satisfied by either a 
mathematics appreciation course or a pre-calculus/calculus course, they will often elect the 
mathematics appreciation course. Well into a successful term, the student may begin to think 
realistically about mathematics requirements of various university majors. Since most majors 
outside the humanities will necessitate at least some mathematics at a technical level rarely 
achieved in the typical two-year college mathematics appreciation course, an important service of 
this course can be to channel these students back into regular sequence mathematics courses. 
Without violating the spirit of a mathematics appreciation course, it is possible to include a 
topically organized unit requiring the review and use of elementary algebra and graphing 
techniques; this may give the student a successful experience in doing mathematics that serve as 
encouragement to return to regular sequence mathematics courses. (A linear programming unit, 
for example, requires the students to review or acquire facility with graphing and algebra 
techniques. Many of the topics suitable for a mathematics appreciation course can be handled in 
this way.) Students with the experience will frequently place higher in the sequence courses than 
they would have upon original enrollment, and will go on as solid, though late-blooming, students. 


VII. Films. Since students in mathematics appreciation courses frequently have little experi- 
ence in sustaining interest in regular mathematics lectures, it is usually appropriate in these 
courses to provide a variety of class activities. Films are a useful but under-utilized medium for 
mathematics instruction generally. They are especially useful for the mathematics appreciation 
course. 

We list in the Center Section a selection of films about mathematical subjects that are suitable 
for lay audiences. (Distributor addresses are listed at the end.) Further information on these and 
other films is available in the booklet Annotated Bibliography of Films and Videotapes for College 
Mathematics by David Schneider (M.A.A., 1980). 


VIII. Classroom Aids. Certain topics treated in mathematics appreciation courses are particu- 
larly amenable to demonstration with physical or geometric devices. Useful exhibits can often be 
seen at NCTM meetings. A list of major suppliers of mathematics classroom devices is given in 
the Center Section. 


IX. References. Since many of the topics that arise in mathematics appreciation courses occur 
nowhere else in the mathematics curriculum, it is quite important that instructors be aware of the 
expository literature of mathematics that treats its relations to science and society. Student term 
papers in courses on mathematics appreciation typically tax the instructor’s knowledge of the 
literature more than any other course in the mathematics curriculum. 

To aid instructors of mathematics appreciation courses, we list in the Center Section major 
references that would be suitable for background reading, and as sources for special projects. This 
list does not include textbooks, partly because we do not wish to endorse some books over others, 
and partly because texts go in and out of print much more rapidly than the reference classics. 
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Julia Robinson, current president of the American Mathematical Society. Photo by George 
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AN ALGORITHMIC DERIVATION OF THE JORDAN CANONICAL FORM 


R. FLETCHER 
Department of Mathematics, The University, Dundee, Scotland DD] 4HN 


D. C. SORENSEN 
Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439 


1. Introduction. When teaching an introductory course on linear algebra, it might be desirable 
to introduce Jordan canonical form. However, a standard derivation usually requires more 
preparation than is justified in such a course. In this note we present an algorithmic derivation of 
the Jordan canonical form. It requires no preparation other than the Schur decomposition [5] and 
the solution of linear systems. Because of this the topic can be introduced within the framework of 
an introductory course with minimal effort. 

Other elementary proofs of Jordan form have been given. Strang [7] relates a proof of Filippov 
[1], and a recent derivation has been by Galperin and Waksman [2]. The argument presented here 
is much more in the spirit of matrix factorization than either of these. It is of particular interest 
that the main result is obtained by refining the Schur decomposition to the desired form. The 
proofs are all based upon induction arguments and thus the development is close to being an 
algorithmic procedure for the transformation of the Schur decomposition into Jordan canonical 
form. The derivation given here is completely self-contained and quite simple in our estimation. 

We shall use lower case Greek letters to denote scalars, lower case Latin letters to denote 
column vectors, and upper case Latin letters to denote matrices. All matrices and vectors are 
assumed to have complex entries. We shall denote the collection of m X n complex matrices by 
the symbol C””*”. The transpose of a matrix (vector) is denoted by T, and the conjugate transpose 
by *. The set of eigenvalues of a square matrix A is denoted by (A). 


2. Jordan Canonical Form. Our aim is to give a simple proof of the following theorem. 
THEOREM (2.1). Given any matrix A © C"”", there is a nonsingular matrix X € C"*", such that 


X VAX = diag(J,, J5,. ..5Jn) 


0X _ 
where each J, = \;1 + E, is a square matrix of order k,, with E, = | Ks |. and X, € C for 
i= 1,2,..., m. 0 0 


Our proof of this well-known result starts with a closely related matrix factorization due to 
Schur and systematically transforms this factorization into the Jordan canonical form. This is 
really a natural approach for anyone who is familiar with the techniques of numerical linear 
algebra. The Schur decomposition is fundamental to understanding the famous Q-R iterative 
technique for computing the eigenvalues and eigenvectors of a matrix, and it is evident that the 
Schur decomposition may be used in almost any situation which might seem to require Jordan 
form. 
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The proof proceeds in three stages. The Schur decomposition tells us that any matrix 
B © C"*" may be written in the form B = QRQ* with R upper triangular and Q unitary. The 
next step is to construct a nonsingular matrix X such that 


X~'RX = diag(R,, R5,..., R,,) 


with each R , being upper triangular and having its main diagonal in the form A ,J. The final step is 
to show how to transform one of the triangular matrices R, to canonical form. These main steps 
are described in detail in Lemmas (2.2), (2.8), and (2.12) given below. 

Our proof of the Schur decomposition follows the discussion in [6, p. 279] very closely but the 
idea of the proof is essentially the same as the original proof given by Schur in [5]. We include it 
here for the sake of completeness. 


LEMMA (2.2) (Schur). Let B € C*** be any k X k complex matrix. Then there is a unitary 
matrix Q © C*** and an upper triangular matrix R © C*** such that 


O*BO=R. 
The diagonal entries of R are the eigenvalues \(B) of B. 


Proof. The proof is by induction on k the order of the matrix. The result is clearly true for 
matrices of order 1. Assume the result for matrices of order k, and suppose that B © C**!**+!, 
Let A be an eigenvalue of B, and let g be a corresponding eigenvector such that g*q = 1. Let 
U € C**!** be chosen so that the matrix (g U) is unitary. Then 


q° _(UBq q*BU\_{(A 2z* 
[ 7") B(4 U) en u*BU] \O B,}’ 


Now by the induction hypothesis, there is a unitary Q, € C*** such that Q*B,Q, = R, with R, 
upper triangular. Put 


1 O 
Q=(q U(g 3} 
Then Q is unitary, and 


QO*BQ = R, with R = () ” } 
| 
This completes the proof. m 
It is worth noting for the sequel that there is no loss of generality in assuming these eigenvalues 
occur in a specified order on the diagonal of R. 
Our proof of Theorem (2.1) will require knowledge of the existence and uniqueness of solutions 
to the matrix equation 


A,B — BA, =S. 


It is straightforward using the Schur decomposition Q*4,Q, = R, for j = 1,2 to transform the 
above equation to an equivalent equation with upper triangular matrices R,, R, as coefficient 
matrices in place of A,, A,. Thus the following result is completely general. 


LEMMA (2.3). Let R, and R, be upper triangular matrices in C*'\**' and C*2**? respectively, and 
let S € C*\**2, Then the matrix equation 


(2.4) R,B — BR, =S 
has a unique solution B € C*\**? if and only if 


(2.5) A(R,) NA(R3) = @. 


14 R. FLETCHER AND D. C. SORENSEN [January 
Proof. The proof is a straightforward induction on k, the order of R,. Let k, be any fixed 


positive integer. The lemma is clearly true for k, = 1. Assume it is true for upper triangular 
matrices R, of order less than k, and consider the following partitioned form of equation (2.4): 


(2.6) ’ ri [? \-(F " -|* 4 
w B w B Ss, S$ 


0 R, 
Multiplying the left side of (2.6) out gives the equation 
Bd, Bri + b'R, 


wr, wrt + BR, 


T 
v5 ry 


0 R, 


(2.7) 


A,B + riw \,b7 + r7B 
Rw RB 


The induction hypothesis applied to (2.7) will allow the equations to be solved in the following 
order 


(b) (A, —A,)B = 96, — rw 
(c) R,B- BR, =S + wrt 
(4) "(Ayr — Ry) =s7 — 7B 
if and only if the matrices R, and R, satisfy (2.5) This completes the induction. @ 


The next lemma refines the Schur decomposition to a form that is very close to the one we 
seek. 


LEMMA (2.8). Let R € C"*" be upper triangular. Then there is a nonsingular X € C"*" such 
that 


(2.9) X~'RX = diag( R,, R>,.--, Rm), 
where 

R, =A, I+ U,forj =1,2,...,m 
with each U,; strictly upper triangular, and each X, distinct. 


Proof. The proof is by induction on n. The result is clearly true when n = 1. Assume the result 
for upper triangular matrices of order less than n. Let R € C”*" be upper triangular. The Schur 
decomposition of a general matrix can be obtained with eigenvalues in any given order. Thus, we 
may assume without loss of generality that 


ei(k § 
10) RS]? 


where R, and R, have no eigenvalues in common, and R, = A,J + U, with U, strictly upper 
triangular. Now, there is a matrix B of appropriate dimensions which satisfies 


(7 8) RX, § (7 +) - R, 0 
0 T/\O R,}J\oO TF 0 R,]’ 
if and only if 


(2.10) S = R,B-— BR,. 


The matrix equation (2.10) does have a unique solution B since A(R,) N A(R) = ©, and the 
induction hypothesis implies that R, may be reduced to block diagonal form as in (2.9). This 
completes the proof. @ 
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LEMMA (2.11). Let E © C*** be of the form 
E=(( fo), 
0 0 


) E* = 0, Ee,,, =e,,and(I — E"E)r=ce,, 


Then 


0 0 


Tr — 
ETE = ( I, 


where e, is the ith coordinate vector and o = e'r. 


The proof of Lemma (2.11) is straightforward. The next lemma will show how to reduce the strict 
upper triangle of the block form in Lemma (2.8) to the form required by Theorem (2.1). In the 
proof of this lemma we use the notation e; to denote the ith unit vector without reference to its 
dimension. In all cases this dimension is meant to be consistent with dimensions implied in 
context by the matrix partitioning. . 


LEMMA (2.12). Let U © C"*" be a strictly upper triangular matrix. Then there is a nonsingular 
matrix X such that 


X~'UX = J, 
. | 0 I, | 
where J = diag(E,, E,,..., E,,) with each E; = 1 |, and with order (E_,,) < order(E;) = 
0 0 
kK, + 1 forj = 1,2,...,m— 1. 


Proof. The proof is by induction on n. The result is clearly true for n = 1. Assume the result is 
true for strictly upper triangular matrices U of order <n, and let U € G””" be strictly upper 


T 
triangular. Partition U = 0 i . By the induction hypothesis there is a nonsingular X, such 
| 
that 
—] EB, 0 . 
(2.13) X, U,X, = o 7) where J, = diag( E,, E;,..., E,,) 
| 


and where order (E,) > order (E,) for j > 2. Then 


1 0 1 0 
t 'J8(6 ab 


Partition u’X, as (u’, ui) = u'X, consistent with the partitioning in (2.13). With the aid of 
Lemma (2.11) we obtain 


0 ul X, 
0 xX, 'UX, 


1 —uTET 0\{/9 uw 42\(1 uTET 0 0 cet s7 
(2.14) 0 sr of; 4 O}lo 7 of; =}o FO 
0 o fr/\o o v flo o 7 oo VF, 


If o + O then the matrix on the right in (2.14) is similar to 


E es" 
0 J, 


0 ef 


2.15 
(2.15) 0 E. 


. with E = 


To see this multiply the matrix in (2.14) from the left by diag(o ~', J, o~'J) and from the right by 
diag(o, J, oJ) to obtain (2.15). Note that the order k of E is strictly greater than the order of any 
diagonal block of J, so that JK~' = 0. 

Define s’ = s7J'~' fori = 1,2,..., k. Then 
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T 
7 es) E es - “es! E €,415,41 ,l<i<k-1. 
0 I 0 J, /\O I 0 J, 
; k-1 ; rae _ [EE 0 
Now, s, = 0 since J;* ° = 0, and it follows that U is similar to the matrix 0 v,}: On the other 


hand, if o = 0 in (2.14) then a simple permutation of the rows and colums of the matrix on the 
right of (2.14) shows that U is similar to 


E, 0 0O 
0 oO s! 
0 O J, 


Thus, U is similar to the matrix , where J, has the desired block diagonal form. Now a 


| 

0 SF 
simple (block) permutation of the rows and columns will bring this matrix to the proper form. 
This completes the induction. lm 

Theorem (2.1) follows readily from these results. The Schur decomposition shows that a given 
square matrix is similar to an upper triangular matrix. Lemma (2.8) implies that this triangular 
matrix is similar to a block upper triangular matrix of the form (2.9). Lemma (2.12) implies that 
each triangular block is similar to a matrix of the form in Theorem (2.1) because for each diagonal 


block R, there is a nonsingular X; such that 
X, (AT + U,)X, =A,1 + diag( E,, Ey,..., En). 


3. Conclusions. The derivation given here is very algorithmic in nature. However, we do not 
wish to leave the impression that this is a sound numerical computation in any sense. Actual 
computation of the Jordan canonical form is a formidable task and the reader interested in this 
computation is referred to [3] for an extensive study of the numerical problem and to [4] for a 
computer program. Nevertheless, it is interesting to note that the only nonconstructive step in this 
proof occurs in obtaining the Schur decomposition. For this reason we feel this discussion would 
fit well in a course concerned with applications or algorithms. 


Work supported in part by the Applied Mathematical Sciences Research Program (KC-04-02) of the Office of 
Energy Research of the U.S. Department of Energy under Contract W-31-109-Eng-38. Part of this work was done 
while the authors were at the University of Kentucky. 
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OF CALCULATIONS PAST AND PRESENT: 
THE ARCHIMEDEAN ALGORITHM 
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There is nothing new in the world except 
the history you do not know. 
Harry S. Truman 


1. Introduction. The famous method of inscribed and circumscribed polygons for approximat- 
ing the constant 7, which allowed Archimedes to get the estimates 


(1.1) 3->0>3— 


was a forerunner of a certain type of algorithms used on today’s computers. Stimulated by an 
article of G. M. Phillips [49], which recently appeared in this monthly, we trace the evolution of 
Archimedes’ method, from its geometrical beginning as a means to approximate 7 to its modern 
version as an analytical technique for evaluating inverse circular and inverse hyperbolic functions. 
We show that works of Descartes and Gregory provided crucial stepping stones for the transition 
from geometry to analysis. The resulting analytical procedure, which was formalized in the 
nineteenth century by Pfaff and Borchardt, is itself in a class of algorithms dependent on 
arithmetic and geometric means and on invariant transformations of elliptic integrals. This class 
of algorithms, investigated in 1971 by B. C. Carlson [8], includes Gauss’ well-known method for 
calculating complete elliptic integrals. From the modern standpoint, as shown by (11.1) and (11.4) 
in the sequel, the Archimedean algorithm can be viewed as a consequence of the duplication 
theorem applied on a degenerate elliptic integral. In the course of our description of the evolution 
of Archimedes’ method, we establish various historical connections among old and new algo- 
rithms. The rich diversity of these connections provides much interesting material for the 
classroom. For pedagogic reasons, we outline key geometrical arguments in order to display the 
passage from geometry to analysis, and in the last section, we give a list of pertinent references 
and computational exercises suited for the classroom. 


O B 
Fic. 1. 


George Miel was born in Paris and immigrated to the United States as a teenager. He worked for the Apollo 
Space Program and for industry abroad. After receiving his Ph.D. at the University of Wyoming in 1976, he held a 
visiting position for the next two years at the University of Calgary. He then came to the University of Nevada in 
Las Vegas where he is now Associate Professor. Occasionally he would rather be out scaling wild mountain faces 
than be in his office writing papers on numerical analysis. 
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2. An Arithmetical Marvel. Archimedes’ book On the Measurement of a Circle, written in the 
3rd century B.C., consists of three propositions only [30], [31]. Proposition 3, containing the 
numerical estimates (1.1), is most interesting to us. We outline and interpret Archimedes’ 
derivation of these estimates. (See Fig. 1.) 

Let AA, _, be tangent to a half-circle with center O and diameter AB. Let OA,, bisect the angle 
AOA,,_,. We then have 


OA, | _ A,,_|A,, 
OA ~— AA, 
Consequently, 
OA +0A,_, _AA,+A4,-\A, — AA,-\ 
OA 7 AA, ~ AA,’ 
OA,,~ 
(2.1) OA _ OA nod 
AA, AA,_, | AA,_, 
Also, 
(2.2) OA? = OA? + AA?. 


Letting §, = OA / AA,, and n, = OA,,/AA,, (2.1) and (2.2) yield the recurrence relations 


(2.3) Sy = Sn-1 + Mn 19 On = iS t1. 


In modern language, §, and 7,, are respectively a cotangent and a cosecant and the relations (2.3) 
represent standard trigonometric identities. 
Take B, on the half-circle so that angle BAB, equals angle AOA,,. We have 


If angles AOA, and BAB, are equal to 7/(2"M), for some fixed integer M > 3, then 2(AA,,) and 
BB, represent sides of regular 2”M-gons respectively circumscribed and inscribed about a full 
circle of diameter AB. Taking a unit diameter AB = 1, the perimeters of these polygons are given 
by 

2"M 2"M 
(2.4) Pym = es Pom 


The two sequences of perimeters given by (2.4) converge monotonically to 7. 
Archimedes’ calculation can be described as follows: 


(i) Let {9 = cot 7/M and ny = csc 77/M. 
(ii) Use (2.3) to find recursively {, and n,, for n = 1,2,..., N. 
(iii) Evaluate the perimeters (2.4) with n = N. 


The final estimates are given by 
Pony > T > P2Nm- 


Archimedes started with circumscribed and inscribed hexagons and he doubled the number of 
sides four times, namely, he took M = 6, {) = V3, No = 2, and N = 4. 

The values of §,, and 7, correct to 5 decimal places are listed in Table 1. Due to the backward 
condition of arithmetic during his time [30], Archimedes was unable to get figures to such 
accuracy. Working with the limited means at his disposal, he controlled the effects of roundoff 
errors by using his procedure twice, once with consistent underevaluation of the arithmetic 
operations to get values {, and 7,, and the second time with consistent overevaluation to get 
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values {, and 7. The resulting numbers, shown in Table 2 in fractional form and in equivalent 
decimal notation, satisfy the inequalities 


(2.5) fi < Sn < Sn In <n < Mn 
The estimates for 7 follow from two rounded divisions: 
= l 96 10 96 
2.6 Px = 35> >—, =3—_ <—. 
(2.6) 96 Te P% 1 ~ a, 


This systematic use of directed rounding enabled Archimedes to obtain an interval [ po., Pog], 
guaranteed to contain the exact interval [ pog, Po], thus accounting for both the theoretical and 
the roundoff errors. This strategy was an early example of what is called today interval analysis. 
This kind of analysis, currently an area of intense research, is described briefly in the next section. 


TABLE | 


Sn 


a 
15.28979 


780 
1.73205 


2911 


780 
3.73205 


1823 


153 240 
7.66095 7.59583 


| 
23397 1007 


153 153 66 66 
15.25654 15.28922 15.25758 15.29040 


l l l 
4673 5 20165 20177 


153 66 66 
30.54575 30.54798 30.56439 


In the course of the calculation, Archimedes reduced two fractions to lower terms, 


3 9 
z= $944 _ 1823 z= 50017 _ 1007 
7780 240’ *3 240 66 | 
Without explanation as to how they were obtained, he used the following lower and upper bounds 
for the square roots involved in the evaluation of n,, and 1,: 
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265 /3 1351 


153 > *? ~ 780 
591- < 349450 30135 > 9082321 
1 9 9 
11725 < [1373943 1838-5 > 3380929 
23397 < 5472132 1009 > 1018405 


1 1 
2017 4° 4069284 = 


There has been much speculation as to the method he employed in the determination of these 
bounds [30, Chapter 4], [31, pp. 51-52]. The final estimates (2.6) for 7 involve the inequalities 


1 9%xX153 10 96x 66 
> 1? -7~ 1 
46735 20175 


Considering that the figures are accurate to about four digits, and that they were obtained without 
positional arithmetic and without analytic trigonometry, one cannot help wonder at the power 
displayed by Archimedes’ calculation. 


oe) 
| 


3. A Precursor of Interval Analysis. Computational errors range from uncertainties in data due 
to experimental observation, theoretical errors in approximation techniques, and roundoff errors 
due to the finite precision of computer arithmetic. Interval analysis provides a means of taking 
into account all these errors by producing closed intervals of numbers in which exact results are 
known to lie. The mathematical foundations of interval analysis, which in essence replace 
point-to-point mappings by set-to-set mappings and approximate equalities by set inclusions, were 
formulated in 1966 by R.E. Moore [44]. 

Arithmetic operations on closed intervals are defined by 


[a,b]*[c,d] ={(x*yla<x<b,c<y< dy}, 


where * denotes any one of the four operations, with the restriction that 0 € [c, d] in the case of 
division. Although interval addition and multiplication are both associative and commutative, the 
distributive law does not hold in general. During actual computations, exact interval arithmetic is 
approximated by requiring that the left endpoint be rounded down to the closest machine-repre- 
sentable number and that the right endpoint be similarly rounded up. The representation in a 
computer of an interval I = [a, b] is] =| a, b], obtained by directed rounding, and if * denotes 
the machine operation corresponding to *, then the set inclusions I > J and I et , > 1,*T, hold. 
A computer program executed in rounded-interval arithmetic yields a set of intervals, represented 
by pairs of machine numbers, each containing the exact result of the corresponding infinite-preci- 
sion operation. 

In interval computations, uncertain input data enter as intervals, error estimates of approxima- 
tion methods are likewise formulated as intervals and the directed rounding accounts for roundoff 
errors. The resulting process provides a mathematically rigorous and complete error analysis for 
computational results. Disadvantages of interval analysis, presently the subject of vigorous 
research, involve the paucity of supporting hardware and software, the slowness of rounded-inter- 
val operations, and the tendency for intervals to grow very large, thus giving overly pessimistic 
results. An overall view of current research in these areas can be found in Moore [45] and in 
Nickel [46]. 
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Archimedes’ calculation of 7 was an early predecessor of interval analysis. The exact intervals 
T,,, with endpoints defined by (2.4), form a nest of intervals converging to the single point 7, thus 
accounting for the theoretical error inherent in the method. Furthermore, Archimedes accounted 
for arithmetic errors by systematic directed rounding in order to obtain rounded intervals I, > I,. 
One can roughly simulate his calculations by taking M = 6 and computing /, with an arithmetic 
based on a decimal floating-point representation with a mantissa of four digits. As for Archimedes 
result (1.1), the final rounded interval, L, = [3.141, 3.144], provides bounds for 7 accurate to two 
decimals. Finally, we note that in a sort of forerunner of Archimedes’ calculation of 7, Aristarchus 
(circa 250 B.C.) calculated the equivalent of sines and tangents of certain small angles, also by 
systematic use of lower and upper bounds, see Heath [31, Chapter XII]. 


4. An Equivalent Formulation. One can show by trigonometry that the second recurrence 
relation in (2.3) is equivalent to 


(4.1) UP 25,,0n—1 . 

Using the first relation in (2.3), (4.1), and (2.4) we find that 
2P n-1 P n—-I 

(4.2) Pony = eee Porm = VPomP2"-'M - 
Pyn-inyy + P2"-\mM 

These relations show that the elements of the sequence 


Py» PM> Pom, Pom'lPam’ Pam’ ::- 


can be generated by taking alternatively the harmonic and geometric means of the two preceding 
elements. The pair of formulas (4.2) is known as the Archimedean algorithm [33, p. 22]. 
In modern notation, we have 


(4.3) Py = Mtan—, py = Msin 
and the essential points of the algorithm depend on the identities 


any tan@ + sind’ a) zm” 


the familiar limits 
sinf? |. tan@ _ 


hi = ji = 1, 
o>0 6 60 6 


and when 0 < @ < 7/2, the inequalities sin? < 6 < tan@ and the monotonicity of (sin#)/@ and 
(tan 6) /6. 
Divorcing the formulas (4.2) from their geometrical meaning, define 


2a,),, 
(4.4) Qn+1 = a 4 b ) aa = V2n+ 1p 9 n > Q. 


It is shown in Section 7 that for any positive initial numbers a, and by, the sequences a,, and b, 
converge to a common limit. Denoting this limit by A(ap, b)), the procedure (4.4) defines a nest 
of intervals converging to the single point A(do, bo). 

Archimedes’ calculation used recurrence relations different from (4.4) in order to minimize the 
count of arithmetic operations. However, his method can be described conceptually by 
T 


(4.5) A(ay, bo) = 7, a) = Mtan—, by = Msin = 


For completeness, we include here a brief numerical analysis of (4.5). 


Numerical stability. Due to roundoff errors, actual computations do not generate the numbers 
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a, and b,, but perturbed numbers 4,, and 6, which we represent by 

(4.6) G4,=a,(1+6), 6,=5,(1 +8). 

The relative error 6 is a measure of the number of significant digits in G, and 6,. Suppose that the 
arithmetic operations in 

2a,b, 
G, + 5, 


(4.7) Gn+1= bia = VGn4 10, 


cause negligible roundoff errors. Substitution of (4.6) in (4.7) yields 


@n+1 7 Gn+l _ g Pnvi — not _ g 
’ ’ 


Qn+i Ba 


thus showing that the relative error is not magnified by the algorithm. 
Rate of convergence. Let 


M ha M . ha 
F(h) = 7" tan 57 and G(h) — > sin M 
We then have a, = F(2~"), b, = G(2~”), and 
(4.8) F(h) =a + eh? + cht +...4+ 6,87" + O(h?"*7), 
(4.9) G(h) =a + d,h? + dyhtt+...+d,,h?" + O(h?™*?), 


where c, and d, are constants independent of h. Expansions (4.8) and (4.9) with m = 1 yield 


t— ,,,= 4 ("— ,) + O(16-"-), 


t — by. = (m7 ~ by) + O(16-"')., 


The error at the end of each cycle of the iteration is roughly decreased by a factor 4. 


Richardson extrapolation. The above rate of convergence makes the Archimedean algorithm 
much too slow for computation beyond few decimals. However, the expansions (4.8) and (4.9) in 
even powers of h indicate that the convergence can be accelerated by a method discovered in 1927 
by Richardson; see, e.g., (32, pp. 239-41]. The procedure is given by 


(4.10) Ton = Ans O<n< 
T,-tnst—T-1»n  ? 
(4.11) Ten = T-1,n41 + aot - hs ’ 0 
4* — ] 
To show why this procedure speeds up convergence, we proceed as follows. If 
4° F,_(h) — Fy—1(2h) 


Fo(h) = F(h), F(A) = Ak] 


9 


then (4.8) implies that 
F(h) = 74+ C,h?*** + O(h2**4), 
where C, is a constant independent of h. Since T,,, = F,(2~*~"), we get 
Tn —q= AEF IKEMC 4 O(4 kt Merny), 


Thus, the errors in successive elements 7,9, T;1,-.., T,, y—, Of the kth stage of the extrapolation 
are each time roughly reduced by a factor of 4~(**), The same conclusion holds if (4.10) is 
replaced by 7,,, = b,. It is interesting to note that in this case the elements of the first stage are 
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given by 


l 
Tin = Br + 3 (Pn — b,), 


which is precisely a formula derived geometrically by Huygens in 1654, see (33, p. 30], [47]. 


Modern computation. Archimedes’ calculation corresponds to (4.5) with M = 6. When ex- 
ecuted on a CDC CYBER 73, a program of (4.4) with ay) = 2V3 and by = 3 almost instanta- 
neously printed a result accurate to 12 decimals after 20 iterations. When coupled with the 
extrapolation (4.11), only 5 iterations were needed to get the same accuracy. In double precision, 
accuracy to 25 decimals required 42 nonextrapolated iterations compared to only 8 iterations in 
the extrapolated case. 


5. The Method of Equal Perimeters. It is a tribute to the master of antiquity that for almost 
two millennia, before the discovery of the differential calculus led to a new approach to the 
problem, the Archimedean method remained the basis for the calculation of 7. Of pertinence to 
this study is a posthumous work of Descartes (1596-1650), in which the polygonal method is 
regarded from a novel point of view. In a typically Gallic twist, Descartes’ approach consisted of 
doubling the number of sides of regular polygons while keeping the perimeter constant. We derive 
the method geometrically as follows. (See Fig. 2.) 


Let AB be a side of a 2” 'M-gon inscribed in a circle of radius OA. Let OC bisect the angle 
AOB. Draw DE parallel to AB and bisecting CF at G. Since the angle DOE is half the angle 
AOB, DE = }AB is a side of a 2"M-gon with the same perimeter as the 2”. 'M-gon. We have 


(5.1) 0G = + (OF + OC). 


Consideration of the right triangle OCD yields 


(5.2) OD = VOC: OG. 


If we denote the radius of the incircle of the 2”M-gon by rj.,, and the radius of its circumcircle by 
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Ryny, then (5.1) and (5.2) yield 

] 
(5.3) Ym = 3 (orm + Ron-iy), Rou = Vroem Ran-ty - 


Thus the elements of the sequence 
tu> Rus toms Rom, tam, Rams--- 
can be generated by taking alternatively the arithmetic and geometric means of the two preceeding 
elements. As n increases, these radii tend to the radius of a circle whose circumference equals the 
common perimeter of the polygons. 
Hence, if the initial M-gon has perimeter 2, then rj.,, and R5.,, converge monotonically to 
1/z. Observe that in this case, we then have 


r = 1 oot & Ry =wzWese za 
MMM’ —™ 


that comparison with (4.3) yields 
(5.4) Pony = A/S Pony, Ray = 1/Pom, 
and that (5.4) substituted in (5.3) yields the Archimedean algorithm (4.2). We show in Section 7 


that the method of equal perimeters is a special case of an algorithm studied by Pfaff and 
Borchardt in the nineteenth century. 


6. The Quadrature of Conic Sectors. In his first important publication, the Vera Circuli et 
Hyperbolae Quadratura, Gregory (1638-1675) applied the polygonal procedure of Archimedes to 
find areas of conic sectors. By considering areas rather than perimeters, Gregory was able to deal 
simultaneously with elliptic and hyperbolic sectors. His approach disclosed the analytical content 
of the polygonal method and it showed implicitly that the Archimedean algorithm provides a 
single process for evaluating both circular and hyperbolic arctangents. We interpret Gregory’s 
geometrical argument in modern language. (See Fig. 3.) 

Let O be the center of a sector KL of a central conic, let KT and LT be the tangent lines at K 
and L respectively, and let OT intersect the conic at M. The tangent line at M intersects KT at U 


nw 


‘ 
wa oe ee eS 


“j 


Fic. 3. 
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and LT at V. The heart of Gregory’s strategy is to compare polygonal areas 
q, = OKL, Q, = OKTL, 
gq, = OKML, QO, = OKUMWL. 


Basic properties of the central conics enable him to get the relations 


2 
= (G0, Q4= —2& 


gz + Qo” 
Repeating the same construction n times, he gets the relations 
2q nQ n 
(6.1) don = V92"-1Qon ) Qonsi = oT n = l, 
Gan + Qo 


where gn and Q5n+1 are polygonal areas composed of 2” and 2”*! triangles respectively. These 
polygonal areas converge monotonically to the area of the sector, respectively from below and 
above in the case of an ellipse, and with directions reversed in the case of a hyperbola. 

A minor modification of Gregory’s geometry indicates that these polygonal areas can be 
generated algebraically by the Archimedean algorithm (4.4). Assuming that the angle KOL is 
acute, extend the lines KT and OL until they meet at J. Let 


ag= OJK, b= 4, = OKL, 

a, = Qn, b, = qo, n> 1. 
It can be shown that a, = 2a9b)/(dp + by). We can now change the order of execution of the 
two relations (6.1), thus getting (4.4). 


Consider an ellipse defined parametrically by x = acos t, y = bsin t. Suppose that the sector 
KL is given by 


K:(a,0), L:(x,y,) = (acost,, bsint,), 0<t, < 7/2. 
One readily deduces that 


t ty 
a, = 2" labtan =, 5, = 2"~ ‘absin =, 


2 
(6.2) (52 Bt) =P tan 


For a hyperbolic sector, the same expressions hold with the circular functions replaced by their 
hyperbolic counterparts. Observe that a special case of Gregory’s procedure is in a sense 
numerically equivalent to Archimedes’ calculation of 7. Indeed, if a sector consists of one twelfth 
of a circle of radius 2V3 then x, = 3, y, = V3 and (6.2) then becomes A(2y3 ,3) = 7. 

We refer to [16] and [62] for a general review of Gregory’s contributions. From the point of 
view of our study here, there are two points which are particularly pertinent. The Vera Quadratura 
supplied a stepping stone toward the formalization of the Archimedean algorithm as an analytical 
process for evaluating inverse circular and inverse hyperbolic functions. As shown in the following 
section, such a formalization was obtained in the nineteenth century by Pfaff and Borchardt by 
studying sequences closely related to those generated by the Archimedean algorithm. The second 
point is Gregory’s discovery in 1670 of the series, 


l l l 
-l1.2, 4 53,4 )55_ 1.7 
(6.3) tan x= x— gx tex — ax t..., |x| < 1 
sometimes mistakenly attributed to Leibniz who found it independently in 1673. This series, 
combined with certain trigonometric identities, was instrumental in the 1974 computation of 7 to 
a million decimals. We deal with the high accuracy computation of 7 in Section 10. 
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7. Borchardt’s Algorithm. In an unpreserved letter to his teacher, Gauss in 1800 suggested the 
study of sequences x,, and y, generated by the relations 


l 
(7.1) Xn+1 = 5X, + yn) Yn+1 7 VXn+1Nn > n> 0. 


In his reply [25, pp. 234,284], Pfaff showed that for any positive numbers x, and yp these 
sequences converge monotonically to a common limit given by 


1/2 Xx 
(yg — x3) /COS a O< xX) <\N, 


(7.2) B(x, yo) _ 
(x6 ~ ¥6)'“/eosh'=>, 0 < yo < Xo. 
0 


Pfaff’s letter was unpublished in 1881 when Borchardt rediscovered this result which now bears 
his name [5], [8], [61, Chapter 1]. 


A quick proof of (7.2) proceeds as follows. Argue by induction that x, and y, converge 
monotonically to the same limit. The ratio r, = x,/y, satisfies r7,, = $(1 + r,). If x9 < yo, let 
6 = cos 'ry. The quantities s, = 2”cos~'r, = 6 and c, = 4"(x?2 — y?) = x3 — y¢ are independent 
of n. We have 


_ 1 1/2 
| 2 Nene Neal”? _ (x = x8)” 
lim y, = im ————__ = lim = ——,—_. 

2° "\c,,(? Sy 6 

a | n 
sn —— 
Yn 


If yo < Xq, let 9 = cosh '% and proceed similarly. 

We now come to the important observation that Borchardt’s algorithm (7.1) is essentially 
equivalent to the Archimedean algorithm (4.4). Indeed, if a,,b, are generated by (4.4) then 
x, = l/a,, y, = 1/b, are generated by (7.1) with 


(7.3) A( ao, by) = 1/B(1/a); 1/bo). 
Conversely, if x,, y, follow from (7.1), then a, = 1/x,, b, = 1/y, follow from (4.4) with 
B( xo, yo) = 1/A(1/xo, 1/yo). 


Relations (7.2) and (7.3) imply that the common limit of the two sequences generated by the 
Archimedean algorithm is given by 


9 ag > by > 90, 

(7.4) A( dg, bo) = 1/2 b 

AdOby( be — a3) cosh” '—, by > ay > 0. 
0 


The two expressions in (7.4) correspond to (22) and (25) in Phillips [49]. The homogeneity 
properties 
(7.5) A(Xay, Abo) =AA(ag, bo), BlAX9, AY) =AB(Xo,%), A> 9, 
are valid. 
Observe that Descartes’ method of equal perimeters was a predecessor of Borchardt’s algorithm 
with 
(7.6) B(ry,Ruy) =-~s ty =e cot 
. M:> *\M T° M M 


and that the expressions 
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2. 2 
5 7x7 4(£,1) = O tan O<x<a, 


x 
f 2. 2 
6 x a A(,1) = F tanh x>a, 
2 x 2 x 


describe the algebraic content of Gregory’s quadrature of elliptic and hyperbolic sectors. 
Relations (7.2), (7.5), and standard identities, imply that 


cos7'x = V1 — x? /B(x,1),0 <x < 1; cosh” 'x = yx? — 1/B(x,1),x > 1; 
sin-'x = x/B(V1 —x?,1),-1<x <1; sinh” 'x = x/B(V1 +x?,1),-0 <x <0; 
tan”'x = x/B(1,V1 + x), —-c0 <x < 0; tanh 'x = x/B(1, | — x7), —-l<x<l]; 


Inx =(x- y/e(*5 ~ Vi J 2 > 0. 


In view of (7.3), these functions can also be evaluated by means of the Archimedean algorithm. 
The latter, however, is not as computationally efficient as Borchardt’s algorithm, since each 
harmonic mean in (4.4) requires four arithmetic operations compared to two operations for each 
arithmetic mean in (7.1). In 1972, Carlson [9] coupled Borchardt’s algorithm with Richardson 
extrapolation, in order to obtain a viable technique for the programmed computation of the above 
functions. 


8. One-Sided Sequences. We can uncouple the recurrence relations (7.1) defining Borchardt’s 
algorithm in order to generate each of the two sequences independently of the other. Use the 
invariant c = 4"(x?2 — y”?) = x@ — y¢ to solve for each of the quantities y,, x, in terms of the 
other. Substitution of the resulting expressions in (7.1) yields 


. Xn Xn 2 —-n- 
(8.1) Xue = Zt (=) —4°""'o, 


2 
(8.2) y= (1 + 1+ 4-rey,?), 


Corresponding relations for the Archimedean algorithm (4.4) are obtained by letting x, = 1/a,, 
Yn = 1/6, 


qentl —> 
(8.3) One = (1-1-4 ca?) 
(i + 4 "cb? — 1). 


We relate the above recurrence relations to certain old and new algorithms. 


2n+1 


(8.4) be = 


The calculation of Archimedes. The two relations (2.3) can easily be uncoupled to get 


(8.5) Sar =sSit Vet, 

(8.6) Mn+1 = 20, + 2n,/0, — 1. 

Letting c = —1/M’, (8.5) is equivalent to (8.1) with x, = £,/2"M and to (8.3) with a, = 1/x,, 
whereas (8.6) is equivalent to (8.2) with y, = 7,,/2"M and to (8.4) with b, = 1/y,. 


The relation of Bashkara (12th century A.D.). This Indian mathematician [33, p. 22], [4, p. 26] 
obtained the value 3927/1250 for 7 by using polygons with 12, 24, 48, 96, 192, and 384 sides. His 
calculation is based on the formula 
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(8.7) Sr4]7 2— V4 —s, 9 
where s, is the side of a 2”M-gon inscribed in a unit circle. The recurrence relation (8.7) is 
equivalent to (8.4) with b, = 2”Ms,, and c = —1/4M?’. 


Viete’s infinite product (1593). The first explicit expression for 7 by an infinite sequence of 
operations [33, p. 26], [4, p. 92] can be described by: 


1 yt 1 1 : 
Bo =V2,2, = 775 >? Sn = 7 + 5 8n-1> 
, 4 
lim( 081 *** 8.) =7- 
It turns out that g9g, -:: g, = y,, where y, is the one-sided sequence corresponding to B(1, V2 ) 


= 1/sin"'/1/2. 


The relation of Descartes (17th century A.D.). Instead of using the method described in Section 
5, Descartes actually generated the one-sided sequence x,, corresponding to (7.6) with M = 4. He 
used an ingenious geometrical construction [33, p. 32], [4, p. 118] algebraically equivalent to 


(8.8) Xn+i(Xna1 — Xn) = 40"? 


Solving for positive x,,,, in (8.8), we get 


Xnt1= > (x, + x; + qn?) 
which is the same as (8.1) with c = — 1/16. 


Iterated square root expansions (1961). Thacher [58] showed that if R, = v2x+2, R,,, 
= /R, +2 then t, = 2”\|R,, — 2| converges to cos” 'x if |x| < 1 and to cosh” 'x if |x| > 1. For 


positive x, we have that ¢, = b,,,, where b, corresponds to A(y|l — x7| /x, y|l — x?|). 


Evaluation of complex logarithms (1981). Carlson’s treatment [9] of Borchardt’s algorithm can 
be extended to the complex plane. Given a complex number z with |z| > 1, consider the iteration 


z+ 1 pa ne 
(8.9) fo =F =7> En41= 6, + t*- l, xX, =2 rn 
done in complex arithmetic. It turns out that x, corresponds to 


z+] Vz 
a 


Combined with Richardson extrapolation and standard identities, the iteration (8.9) yields viable 
software for evaluating complex logarithms and other functions [40]. 


| = 1/inz. 


9. Carlson’s Unified Theory. In 1971, B.C. Carlson [8] presented a unified convergence 
analysis for all algorithms of the form 


(9.1) Xn+ = fi(Xn5 Inds Yn+ = fi (Xs In): n>0, 


where 


fay) =75%, Ale. y) = Gy)? 


flay) = (2252), par) = (72) % 
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and where i, j are fixed indices among the numbers 1,2, 3,4. The expressions (9.1) define sixteen 
algorithms involving arithmetic and geometric means, twelve of which are nontrivial when i = /, 
including Borchardt’s algorithm when j = 1, 7 = 4. It turns out that when j + / the sequences x, 
and y, converge to a common limit of the form 


, —1/2a 
L,;(x0, ¥) = R(a; b, b’; x5, yo) ’ 


where 
, nd 2) fa \—b 9\— 2 
R(a; b, b’; x’, y”) Bacay dy? (t+ x?) "(t+ y7) © de, 


B(a, a’) denotes the beta function, a’ is defined by a + a’ = b + b’, and a, b, b’ depend on i and 
j. We refer to [8, pp. 501-3] for the proof. The homogeneity property analogous to (7.5), 


(9.2) L;j(AXo, Ayo) = AL; ;(xo, yo), A>O, 


is clearly valid. 
For Borchardt’s algorithm, the common limit is given by 


1 1 7! 
(9.3) B( x95 ¥0) = Lie X05 ¥0) = R( 53 5915x898} ’ 


with the R-function in this case degenerating to a function proportional to cos~' x9/yp or cosh”! 
Xo/Vo. A judicious choice of the starting numbers x9, yo yields an inverse circular, inverse 
hyperbolic, or logarithmic function. By accelerating the basic iteration with Richardson extrapola- 
tion, Carlson [9] obtained an efficient and stable method for the computation of these functions. 

Certain other algorithms in the class defined by (9.1) deal with fascinating and relevant 
historical material. The one with i = 1 and j = 3, discovered by Carlson, has a limit 


1 3 1 ~2 
(9.4) Lys(0s 90) = R( gs 53x89} ) 
which involves an inverse lemniscatic sine, 


arcsl r = [ (1 — 1)" dt, P<. 
0 


This function is the length of the arc of Bernoulli’s lemniscate r* = cos26 from the origin to the 
point with radial coordinate r. For a description of lemniscate functions, see Marushevich [36] or 
Siegel [57]. A particular case of (9.4) is 


(9.5) L,,(1,0) = 1/4? = 16B2/n?, 


where A = arcsl1 and B = 7/4A are the so-called lemniscate constants. A computational history 
of these constants is given in Todd [60]. Whereas Borchardt’s algorithm is connected with the 
rectification of the circle, Carlson’s algorithm is related to the rectification of the lemniscate. 
Moreover, the derivation of (9.4) provides a succinct proof of Fagnano’s doubling in 1718 of a 
lemniscatic arc with ruler and compass. This geometrical construction pointed Euler in 1753 to the 
addition theorem for general elliptic integrals. This result in turn led Gauss to the development of 
the theory of lemniscate functions in 1797-98 and to the discovery of general elliptic functions in 
1799. For a lively account of the pertinent history, see [63], [53], [57, pp. 1-7]. 


The best known algorithm of type (9.1) occurs with i = 1 and j = 2. The corresponding limit 
L1(Xo; Yo) involves a complete elliptic integral of the first kind, 


m/2 _ 
(9.6) I(x, y) =| ; (x2cos?t + y2sin?r)'7* dt. 
0 


Lagrange first recorded the algorithm in 1784-85 by repeatedly using a transformation due to 
Landen, 
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T(Xns1> Ina = I(x, Yn)3 


see [34, pp. 267,272], [63, pp. 13-16]. Unaware of Lagrange’s work, Gauss in 1791, at age 
fourteen, experimented numerically with the algorithm using x) = 1 and y, = V2. He showed in 
1799 that the limit in that case is 


(9.7) Ly(1,v2) = 7/24, 


where A is the lemniscate constant [25, p. 542]. This result led him from lemniscate functions to 
general elliptic functions. In 1818, in a work on the motion of planets, he proved the convergence 
for arbitrary starting values [24, pp. 352-355]. The limit, which is given by 


1 11 7! 
(9.8) Lia( 0.9) = R55 3952328} = 1/21( x9, %), 


is called the arithmetic-geometric mean of x, and yo. 


Of all algorithms defined by (9.1), only the arithmetic-geometric algorithm of Gauss enjoys 
quadratic convergence, 1.e., the error at each step is approximately proportional to the square of 
the previous error. This means that each cycle of the iteration roughly doubles the number of 
significant digits. This rapid convergence accounts for the continuing use of the arithmetic-geo- 
metric mean in the computation of the elliptic integral /(x, y); see, e.g., [14, Algorithms 149, 155]. 
The next section describes a recent method for computing 7 based on Gauss’s algorithm. The 
other algorithms in (9.1) converge with a linear rate, the error being roughly reduced by a factor 4 
at each step, except for the algorithm with i = 3 and j = 4, which has a linear rate with factor 2. 


Both Gauss’s and Borchardt’s algorithms are special cases of three-dimensional algorithms 
investigated by Carlson [6], [7]. In 1979, Carlson also presented algorithms for the practical 
computation of incomplete elliptic integrals of all three kinds [11]. The iterations, based on 
successive applications of the duplication theorem, are accelerated by including terms up to fifth 
order of Taylor series of the integrals. The resulting algorithms, which require only rational 
operations and square roots, converge linearly with the error roughly decreased by a factor of 
4096 in each cycle. 


10. Extreme Accuracy Computations. A chronology of the calculation of 7 can be found in 
Hobson [33], Schepler [52], and Wrench [65]. The advent of the digital computer allowed 
computations to hundreds of thousands of digits, and in 1974, the value of 7 was tabulated to a 
million decimals [29]. Interesting discussions of such extreme computations are given in Beck- 
mann [4, Chapters 8, 10] and Davis [15, pp. 55-75]. 

Borchardt’s algorithm, which is a computationally efficient form of the Archimedean algo- 
rithm, and the arithmetic-geometric algorithm of Gauss are both in the class of algorithms (9.1) 
investigated by Carlson. Whereas the former is only linearly convergent with factor 4, the latter is 
quadratically convergent. Even when accelerated, Borchardt’s algorithm in (7.6) is hopelessly too 
slow for the computation of 7 beyond few decimals. To date, the most successful techniques for 
high accuracy computation are based either on Gregory’s series (6.3) applied on certain identities 
expressing 7 as a linear combination of arctangents, or in a rather surprising twist, on the rapidly 
converging algorithm of the arithmetic-geometric mean applied on a formula for 7 recently 
presented by Salamin [51]. We briefly describe these two different techniques. 

In 1961, Shanks and Wrench [56] used the relation 

— -11 -1_1 -1_] 
(10.1) aq = 24 tan g t Stan ' 55 + 4tan™ 555» 
to obtain a value to over 100,200 decimals. The computation, which was verified by using the 
relation 


(10.2) n= 48tan~'— +. 32 tan~'L — 20 tan-! 
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took 8 hours 43 minutes on an IBM 7090. The two relations (10.1) and (10.2) were subsequently 
used by Guilloud and associates to ultimately calculate 7 to 1,000,000 decimals [29]. The run time 
on a CDC 7600, not including the verification, was 23 hours 18 minutes, of which 1 hour 7 
minutes was used to convert the final result from binary to decimal. Results of statistical tests, 
which generally support the conjecture that 7 is normal, are also given in [29]. 

Salamin’s identity [51], which expresses 7 in terms of two arithmetic-geometric means, is 
readily derived as follows. A standard relation due to Legendre can be expressed as 


(10.3) JO,k)JO,k) +70, k)70,k) -TO,k)T0, k’) = 7/2, k*? +k? =1, 
where I(x, y) and J(x, y) are complete elliptic integrals, defined respectively by (9.6) and 
I(x, y) = [°"(°c0s"t + y?sin2r)'”* dt. 
0 


By Landen’s transformation, if x, and y, are generated by (9.1) with i = 1 andj = 2, then 


00 


(10.4) Hxo.38) = 9] 33 ~ 3 X (on — yr) 


/2L19(Xo, Yo). 


Substitution of (9.8) and (10.4) in (10.3) yields 
7 = 4: LQ, k) ; LQ, k’) 
(10.5) 1 Yo 2"(z) + 277) 
n=] 
KP+k? = 1, 2p = xR Vas Ze Xe Ins 
where x,, y, correspond to L,,(1, k) and x,,, y, correspond to L,,(1, k’). 
The computation of 7 by (10.5) benefits from the rapid convergence of Gauss’ algorithm. Since 
(10.5) defines a continuum of formulas, the computation can be done with one pair k, k’ and then 


verified by using a different pair. The case k = k’ causes the two arithmetic-geometric means to 
coincide, thus halving the work: 


T= 4 LQ), 1/y2)" 
1-— >) 2"tlz? 


n=] 


(10.6) 


An interesting observation, apparently unreported before, is that the computation of 7 by (10.6) 
yields the lemniscatic constant A as a by-product. Indeed, Gauss’ result (9.7) and the relation 


Li2(1,1/ V2) = Ly, V2)/ v2 imply that 
(10.7) A=yn/2D, 


where D is the denominator in (10.6). Thus, once 7 is known to the desired accuracy, A can be 
obtained at the cost of a multiplication, a division, and a square root. 

D. E. Knuth and associates at Stanford University have been trying to compute 7 to 15 million 
digits, by programming Salamin’s identity (10.5) combined with diverse devices, including the 
Schénhage-Strassen algorithm for multiprecision multiplications [1, Section 7.5]. To date, the 
outcome of this effort has not been recorded in the literature. 


11. Classroom Material. In previous sections, we established diverse historical connections and 
we traced the evolution of the Archimedean algorithm, from its geometrical beginning as a means 
to approximate 7, to its gradual metamorphosis into an analytical method, in the form of 
Borchardt’s algorithm, for calculating certain elementary functions. Carlson showed that 
Borchardt’s algorithm is itself a member of a class of algorithms dependent on invariant 
transformations of elliptic integrals. This class of algorithms contains the lemniscatic algorithm, 
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which proves explicitly Fagnano’s duplication of a lemniscatic arc by compass and ruler, and the 
arithmetic-geometric algorithm of Gauss, the only one in the class to enjoy quadratic convergence. 
Fagnano’s construction led Euler successively to the duplication and addition theorems for elliptic 
integrals. Gauss’ algorithm combined with the recent formula of Salamin provides a novel 
technique for computing 7. Moreover, Borchardt’s algorithm is a special case of a three-dimen- 
sional algorithm based on successive applications of the duplication theorem. We see that the 
story of the evaluation of 7 is linked in several ways to invariant transformations of elliptic 
integrals. - 

The wealth of connections in this web of old and new algorithms provides considerable 
pedagogic material. An interesting exercise is to derive the limit (7.4) of the Archimedean 
algorithm from the duplication theorem, thereby reversing the theme of this article by using a 
modern result to.derive an old one. An induction argument shows that a, and b, generated by 
(4.4) converge to a common limit a = A(do, by). The degenerate elliptic integral of the first kind, 


1 62 - - 
(11.1) R(x yl=af (ttx) ‘r+ y) dt, x,y >0, 
0 
clearly has the properties 
(11.2) R.(1,1) = 1, 
(11.3) R (px, py) =H R(x, y), > 0. 
The duplication theorem [10, p. 279] and (11.3) imply the invariance property 
2ab |~? 2ab —! 
~2 p-2) _ 
(11.4) R(a-2, b-2) R (|| | [=e b| 


Using (11.3) and (11.2), we get 
R.( ao’, by”) = R.(a;’, b, 7) = R.(a~?, a~*) = a 


Finally, to get (7.4) put (t + ap*)/(t + bo *) equal to cos*6 or cosh’@ according as ay > by or 
Ay < bo. 

An excellent illustration of the slow rate of convergence of the lemniscatic algorithm compared 
to the quadratic convergence of the arithmetic-geometric algorithm consists of computing the 
lemniscatic constant A by (9.5) and (9.7). Applications of the arithmetic-geometric algorithm deal 
with the computation of the period of a simple pendulum of length L and amplitude ¢, 


/L @ 
T=29 g /La{1, 0s a 


and the computation of the circumference of an ellipse x*/a? + y*/b? = 1, 
l 1° @) 
4](a,b)= 2a a —3 Y 22?) /Lala b). 


Additional applications of elliptic integrals in geometry, mechanics, and potential theory can be 
found in [10, Section 9.4]. 

Although not in the class of algorithms defined by (9.1), the method of Heron of Alexandria 
for finding square roots deserves special consideration [31, pp. 323-6]. Given a positive number a, 
the method is to make a first guess of Ya, say x9, and refine it successively by calculating 


] a 
(11.5) tt =a (4+ =], 


The iteration can be rewritten in two-dimensional form, 
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l 2 
(11.6) Xn+1 = 5 %n + Inds Inet ns ee 


by letting y, = a/x,,. Since (11.5) happens to be Newton’s method [32, Chapter 4] applied on the 
equation x” — a = 0, we immediately know that x, converges quadratically to Va, and conse- 
quently, that y, also approaches Va. The common limit yXo¥o Of the algorithm (11.6) is sometimes 
called the arithmetic-harmonic mean of x, and yo. A geometrical diagram of this algorithm is 
given in Ercolano [17]. By using results given in Eves [18, p. 36], one can readily get similar 
diagrams for Archimedes’ and Borchardt’s algorithms. 

A very effective exercise for the classroom is to program the recurrence relations (4.4) with 
ag = 2¥3 and by = 3, thus getting a modern version of Archimedes’ calculation of 7. The 
relations (4.10) and (4.11), which are easily incorporated in the program, provide a dramatic 
illustration of the efficacity of Richardson extrapolation. We described the output of such 
programs in Section 4. Other numerical results are given in Puritz [50], Fox and Hayes [19], and 
Phillips [49]. A theoretical exercise is to use the analyticity of (tan z)/z and (sin z)/z along the 
lines of [40, p. 747] in order to find bounds for the errors 7 — T,,,. 

The instructor should indicate the difficulties involved in programming the multiprecision 
arithmetic and the storage allocation needed for the computation of 7 to a high number of 
decimals. In single or double precision, however, most techniques involve straightforward pro- 
gramming: the partial sums of Gregory’s series (6.3) are easily combined with the formulas (10.1) 
and (10.2), and Salamin’s relation (10.5) is likewise readily programmed. As mentioned earlier, a 
special case of the latter yields both 7 and A. A description, well-suited for pedagogic purposes, of 
a class of formulas for 7 involving two arctangents is given in [38]. 

It should be mentioned that over 200,000 partial quotients of the simple continued fraction for 
a have been computed, see Choong et al. [12], [13] and Gosper [28], [54]. In this connection, 
instructive exercises consist of evaluating classical continued fractions, see Olds [48, pp. 135-8], 
by using 2 X 2 matrices as described in Milne-Thomson [41, pp. 109-10] or Frame [20]. 

Other classroom material on the computation of 7 can be found in Moakes [42], [43], 
Goldsmith [27], and Maor [35]. Buffon’s needle experiment is readily simulated on a computer [4, 
p. 164]. The original derivation of Wallis’ infinite product makes for an interesting and little-known 
application of interpolation, see Whiteside [64, p. 236ff]. Various classroom notes involving 7 can 
be located in a cumulative index for this MONTHLY [37, p. 245]. Further pertinent material can be 
found in Davis [15, pp. 52-82], Gardner [21], [22], [23], Anderson [2], Goggins [26], Nanjundiah 
[47], and Shanks [55]. 

In conclusion, we mention some unsolved problems. Although e” has been proved to be 
transcendental, it has not yet been established whether 7 + e, ze, or 7° are rational or irrational. 
While it is known that almost all the irrational numbers must be normal in every base, the 
normality in the decimal base of such numbers as 7, A, v2, and e has yet to be proved, though 
statistical evidence supports the conjecture that they are. Shanks and Wrench [56, p. 78] expressed 
the hope for a theory that would measure the difficulty involved in computing basic mathematical 
constants; such a theory currently does not exist. Finally, on a more specific level, whereas 
iterative sequences exist which converge quadratically to 7, A, and 2 , an equally efficient method 
for computing e is not known, see Baxter [3]. 

om 
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UNSOLVED PROBLEMS 
EDITED By RICHARD GUY 


In this department the MONTHLY presents easily stated unsolved problems dealing with notions ordinarily 
encountered in undergraduate mathematics. Each problem should be accompanied by relevant references (if any 
are known to the author) and by a brief description of known partial results. Manuscripts should be sent to 
Richard Guy, Department of Mathematics and Statistics, The University of Calgary, Calgary, Alberta, Canada 
T2N IN4. 


DON’T TRY TO SOLVE THESE PROBLEMS! 


RICHARD K. GUy 
Department of Mathematics and Statistics, The University of Calgary, Calgary, Alberta, Canada T2N 1N4 


Such an exhortation will likely produce the opposite effect, but I’m serious, and I'll explain 
why. This article has been in mind for some time, but its eruption is triggered by a proposal from 


Schmuel Schreiber, Department of Mathematics and Computer 
Science, Bar-Ilan University, Ramat-Gan, Israel. 


Problem 0. For an integer a define the set S, inductively by 
()aeS,, (2)ifkKES,, then2kK+2ES,,(3)ifk €S,, then3k+3€S,. 
Equivalently, define a function s,() on the integers by 
(1) s,Q) =a, (2)s,(2k) =2s,(k) +2, (3)s,(2k + 1) = 3s,(k) + 3. 


For a < —3 ora > 2 iss, injective? Or does S,, contain repeated elements? 
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Some of you are already scribbling, in spite of the warning! More cautious readers may have 
been reminded of other problems, perhaps one or more of the following. 


Problem 1. The diophantine equation a? + b* + c* = 3abc has the singular solutions (1, 1, 1) 
and (2,1, 1). Other solutions can be generated from these, because the equation is quadratic in 
each variable, for example, b = 2,c = 1 gives a* —- 6a +5=0, a=1 or 5 and (5,2,1) is a 
solution. Each solution, apart from the singular ones, is a neighbor of just three others, and they 
form a binary tree. Is this a genuine tree, or can the same number be generated by two different 
routes through it? 


Problem 2. Consider the sequence a,,,, = a,/2 (a,, even), a,,, = 3a, + 1 (a, odd). For each 
positive integer a, is there a value of n such that a, = 1? 
Problem 3. Consider the mapping 
2m—>3m, 4m-—1>3m-1, 4m+17>3m+1. 
This generates the cycles (1), (2,3), (4,6,9, 7,5) and (44, 66, 99, 74, 111, 83, 62, 93, 70, 105, 79, 59). 
Are there others? 


Problem O can be visualized as a binary tree generated by the pair of unary functions 
a—>2a+2,a— 3a + 3. For example, if a = 1, we have Figure 1. 


LN 
JN JN 


10 15 *. 24 
-_ JN _ / 
2 48 
Naa KA & 
GS » 
Fic. 1. Binary tree generated by two unary functions. 

The number 66 appears twice in Figure 1, by making three steps to the right, or by making one 
to the left, one to the right and two to the left. A right step multiplies by 3 (roughly); a left step 
multiplies by 2 (roughly); the coincidence is roughly explained by the approximation: two right = 
three left; 37 ~ 2. Is this another example of the strong law of small numbers [11]? If we try other 
small values for a, we find similar-coincidences (Figure 2). 


Fic. 2. Small values of a lead to coincidences. 


Let us look at a = —4. We’ve omitted the minus signs; alternatively, change the plus signs to 
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minuses in each of conditions (2) and conditions (3) in Problem 0. The binary tree is now as in 
Figure 3. 


4 


/N\ 
5¢ 8 82 123 5 
Fic. 3. Binary tree generated by a — 2a — 2 anda > 3a — 3. 


The numbers that appear, when arranged in numerical order, are 

4,6, 9, 10, 15, 16, 18, 24, 27, 28, 30, 34, 42, 45, 46, 51, 52, 54, 58, 66, 69, 78, 81, 82, 

87, 88, 90, 99, 100, 102, 106, 114, 123, 130, 132, 135, 136, 150, 153, 154, 159, 160, 

162, 171, 172, 174, 178, 195, 196, 198, 202, 204, 210,.... 
Does a number ever occur twice? I sent an earlier draft of this article to John Leech, who 
disobeyed orders, started scribbling, and found that had I continued Figure 3, I would have 
discovered 258 and 402 on the next row, with both of these repeated two rows further down. In 
fact, reinstating the minus signs and using L for a left step and R for a right one, we can write 
these coincidences as 

—4L’ = —258 = —4RL*R? and —4L?RL* = —402 = —4R*LR? 
and those starting from | and 0 in Figures 1 and 2 as 
ILRL? = 66 =1R? and OL* = 30 = OLR’. 

The last coincidence starts with a left step in either case, so can be regarded as a coincidence 
starting with 2: 2L°? = 30 = 2R?. Leech used his computer to find coincidences starting with 3, 4 
and 5: 


3LRLRL"” = 177150 = 3R’LR? 
4L*R® = 626574 = 4RL’RLRL* 
5L*RL7R® = 241662 = 5R*L"’. 


Since 3R = 12 = 5L and 3L? = 18 = 5R, this last gives another coincidence starting from 3. We 
have seen coincidences starting from —1, —2, —3 and —4; Leech also found 


—51L?R* = —1986 = —5R°L° 
—7TLRL*RL?RL* = — 143250 = —7R*L7R® 
—8LRL*’ = —9474 = —8R‘LR? 
—9L'° = —7170 = —9RL*R* 


—10L3RL®° = — 12354 —10R7LR* 
—11L°R7L° = — 166146 = —11R*LRL*R? 
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(29,5,2) (13,5, ) 
(433,295) (169,29,2) (194,13,5)  (34,13,1) 
LN ZNIN SN 


Fic. 4. Binary tree of Markoff triples. 


and conjectures that there will always be “duplicate fruits on the trees” no matter where you start. 
If you want a coincidence starting with —6, note that —6R = — 10, but it would be cleaner to 
find one of the form —6L... = —6R... and I’m tempted to strengthen Leech’s conjecture to: 


4 For each integer n, there are strings L... and R... such thatnL... =nR...? 


Stop press! In yet another letter, Leech proves his conjecture, so don’t bother to solve this 


problem! 
In problem | a binary tree is similarly generated by the pair of ternary functions 


(a,b,c) > (ab-—c,a,b), (a,b,c) > Bac — b,a,c) 


as in Figure 4. 
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Fic. 5. Simplified Markoff tree. 
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Fic. 6. Does the Collatz algorithm give any more cycles? 


We can exhibit more of the tree by simplifying it as in Figure 5. To recapture the triples from this, 
choose any entry for a, and its immediate predecessor for b. Then c is found when travelling up 
the tree, just after the first step after the first mghtward step. For example, a = 985 has 
predecessor b = 169. When travelling upwards from 985, the first rightward step is from 29 to 5. 
The next step is from 5 to 2, soc = 2. 

Whether or not there are repetitions in the sequence of Markoff numbers 


1,2,5, 13,29, 34, 89, 169, 194, 233, 433, 610, 985, 1325, 1597, 2897, 
4181, 5741, 6466, 7561, 9077, 10946, 14701, 28657, 37666, 51641,... 


has become a notorious problem. There are occasional claims to have proved uniqueness, but 
none seem to hold water [19]. Don Zagier [23] has some results on distribution, but none on 
distinctness. He can show that the problem is equivalent to the unsolvability of a certain system of 
diophantine equations, so we may be nearing the realm of Hilbert’s tenth problem. Hence the title 
of this paper. 

Problem 2 is associated with various names: Collatz, Hasse, Kakutani, Syracuse. It is just as 
notorious. Lothar Collatz told me that he thought of it when a student. One of its several waves of 
popularity started when he mentioned it to several people at the 1950 International Mathematical 
Congress in Cambridge (the wrong Cambridge). Presumably some mathematicians from Syracuse 
(the wrong Syracuse) became interested in it; the boys from Syracuse can perhaps fill in that bit of 
history. 
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Is the graph of the Collatz sequence unicyclic? Figure 6 includes all the numbers up to 26; the 
branch containing 27 is a much longer one, but still comes down to 1 after 111 steps. 

After a long and inconclusive correspondence some years ago, a claimant to have a proof 
eventually admitted that “Erdés says that mathematics is not yet ripe enough for such questions.” 
Hence the title of this paper. 

Problem 3 is one of John Conway’s permutation sequences. It is similar to the Collatz problem, 
but here the function has an inverse 

3m—>2m, 3m—-—1>4m—-1, 3m+1>4m+41 
(if the number’s a multiple of 3, take a third off; otherwise add a third on) so the sequence can be 
pursued in either direction. Its graph consists of a number of disjoint cycles and doubly infinite 
chains. But it hasn’t even been proved that an infinite chain exists! What is the status of the 
sequence containing the number 8? 


...,41, 31, 23, 17, 13, 10, 15, 11, 8, 12, 18, 27, 20, 30, 45, 34, 51, 38,57, 43, 32,.... 


What gives a cycle? Each term is either 3/2 times the previous one, or approximately 3/4 of it. 
Our best chance of getting back to an earlier value is to find a power of 3 which is close to a 
power of 2. The known cycles of lengths 1, 2, 5 and 12 correspond to the approximations of 3!, 3°, 
3° and 3!* by 27, 23, 28 and 2!°. The last is the ratio of D sharp to E flat! In fact in each of 
problems 0, 2 and 3, the convergents 


1 2 3 8 19 65 84 485 1054 24727 50508 


1 12 5 12 41 #53 306 £665 15601 31867 


to the continued fraction for log3 to the base 2 are of significance. Note that there are cycles 
corresponding to the denominators 1, 2, 5 and 12. It has been shown that there are none of length 
41, 53 or 306. Computers can push numerical results quite a long way, but it’s not clear that they 
can be of any use with such problems. 

In [5] Conway relates families of sequences similar to that in Problem 3 to the vector 
reachability problem and Minsky machines. Hence the title of this paper. 


Postscript. After I'd written this, the following problem of David A. Klarner reached me from 
two different directions. There seem to be some significant similarities. 


Problem 4. Let S be the smallest set of numbers such that 1 € S and if x € S, then 2x, 3x + 2 
and 6x + 3 each belong to S, ie., 


S = {1,2,4,5, 8,9, 10, 14, 15, 16, 17, 18, 20, 26, 27, 28, 29, 30, 32, 33, 34,...}. 


Does S have positive density? That is, as n > oo, is 


liminf|S M {1,2,..., 2}|/n > 0? 
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WELL-DISTRIBUTED MEASURABLE SETS 


WALTER RUDIN 
Department of Mathematics, University of Wisconsin, Madison, WI 53706 


THEOREM. There is a measurable set A C I = [0,1] such that 
0<m(ANV)<m(V) 
for every nonempty open set V Cc I. 


Proof. Let CTDP mean: Compact Totally Disconnected subset of J, having Positive measure. 

Let {J,} be an enumeration of all segments in J whose endpoints are rational. Construct 
sequences {A,,}, {B,} of CTDP’s as follows: 

Start with disjoint CTDP’s A, and B, in J,. 

Once A,, B,,...,A,_,, B,_, are chosen, their union C, is CTD, hence J, \C, contains a 
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measure theory and functional analysis. This unique notebook is reproduced here (in English trans- 
lation) with occasional commentaries and modern references. The volume is introduced by brief anec- 


dotal essays about the origins of the Scottish book by Ulam, Kac, Zygmund and Erd¥s. LAS 


Foundations, P, L. Friedrich Waismann, Lectures on the Philosophy of Mathematics. Ed: Wolfgang 
Grassl. Humanities Pr, 1982, 170 pp, $17 (P). [ISBN: 90-6203-983-9] Six essays from the 50’s by 
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eralized inverse, combinatorial properties, and asymptotic forms with numerous excursions into 
applications. Many exercises, references, index. JS 


Algebra, P. Lecture Notes in Mathematics-907: Dualisierende Komplexe in der lokalen Algebra und 
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dimension and orders of various derivatives, and solutions are given for each of the 24 cases con- 
sidered. Technical expertise is required. A short discussion of the theory follows. TAV 


Differential Equations, P. Lecture Notes in Mathematics-925: The Riemann Problem, Complete Integra- 
bility and Arithmetic Applications. Ed: D. and G. Chudnovsky. Springer-Verlag, 1982, vi + 373 pp, 
$20 (P). [ISBN: 0-387-11483-1] This volume contains a series of lectures on the Riemann boundary 
value problem which was presented at a seminar held in 1979-80 at the Institut des Hautes Etudes 


Scientifiques in Bures-sur-Yvette, France, and at Columbia University. CEC 


Numerical Analysis, T(16-17: 1, 2), S*, P*¥, L. Iterative Methods for the Solution of Equations. 
J.F. Traub. Chelsea Pub, 1982, xviii + 310 pp, $15.95. [ISBN: 0-8284-0312-0] Textually identical 
with the 1964 edition which contained much new material. Rigorous treatment of the mathematical 
foundations of a general theory of iteration algorithms for the numerical solution of real equations 
and systems of real equations with ‘emphasis on algorithm efficiency. No problems. Excellent refer- 
ences through 1963. Dated, but very well done. An excellent buy. JK 


Numerical Analysis, T(14-16: 1). Numerical Analysis: A Practical Approach. Melvin J. Maron. Mac- 
millan Pub, 1982, xviii + 471 pp, $26.95. [ISBN: 0-02-475670-9] The topics covered in this textbook 
are the standard ones (roots of equations, linear algebra, approximation and interpolation, quadra- 
ture, and ordinary differential equations). It emphasizes modern algorithms and attempts to provide 
an intuitive understanding of how each works. Few proofs or derivations are given. AO 


Numerical Analysis, T(15-16: 1), L*. Essentials of Numerical Analysis with Pocket Calculator 
Demonstrations. Peter Henrici. Wiley, 1982, vi + 409 pp, $27.95. [ISBN: 0-471-05904-8] This is a 
very interesting new textbook which emphasizes the connections between applied mathematics and 
numerical analysis. A umique feature of the book is that all of the problems are designed to be 
carried out on a programmable pocket calculator. Most of the standard topics are covered. AO 


Numerical Analysis, S(15-17), L*. Numerical Solution of Partial Differential Equations in Science 
and Engineering. Leon Lapidus, George F. Pinder. Wiley, 1982, 677 pp, $44.95. [ISBN: 0-471-09866-3] 
The numerical methods treated in this work include finite differences, finite elements, collocation, 
and boundary elements. After introductory chapters on the basic techniques, each type of partial 
differential equations is treated in a separate chapter. Each of these chapters includes an exten- 


sive bibliography. AQ 


Numerical Analysis, T(14-15: 1), S*, L. Numerical Methods. Robert W. Hornbeck. Quantum Pub, 1975, 
310 pp, $9.95 (P). [ISBN: 0-13-626614-2] A relative bargain. Intended for science and engineering 
students. Includes Taylor’s formula, finite differences, interpolation, approximation, nonlinear 
equations, linear systems, integration, ordinary differential equations, partial differential equa- 
tions, matrix eigenvalue problems. Develops some of the methods. Examples, flowcharts, exercises. 
RWN . 


Functional Analysis, P. Lecture Notes in Mathematics~-936: Counterexamples in Topological Vector 
Spaces. S.M. Khaleelulla. Springer-Verlag, 1982, xxi + 179 pp, $10.70 (P). [ISBN: 0-387-11565-x] 
Topological vector spaces may be Fréchet, locally convex, bornological, Montel, etc. These notes 
consist of approximately 200 examples of spaces or subspaces enjoying some but not others of these 
properties. Other such counterexamples are given in the category of topological algebras. Open 
problems in the same spirit are included. PZ 


Functional Analysis, T(18), P. Continuous Semigroups in Banach Algebras. Allan M. Sinclair. London 
Math. Soc. Lect. Note Ser., No. 63. Cambridge U Pr, 1982, 145 pp, $19.95 (P). [ISBN: 0-521-28598-4] 
A discussion of the abstract theory of analytic one-parameter semigroups in Banach algebras. LCL 


Analysis, P. Equivalence of Measure Preserving Transformations. Donald S. Ornstein, Daniel J. 
Rudolph, Benjamin Weiss. Memoirs No. 262. AMS, 1982, xii + 116 pp, $7.60 (P). Almost forty years 
ago Kakutani introduced an equivalence between ergodic measure~preserving transformations and showed 
its relevance in studying the measurable cross sections of measurable flows. Only three ine- 
quivalent types of equivalence of measure~preserving transformations were known until J. Feldman 
resurrected Kakutani”’s idea by constructing a fourth type. The present volume is a compilation of 
lectures giving a detailed exposition of Feldman’s ideas and their ramifications. PH 
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Analysis, T*¥*(17: 1, 2), S*, P, L. Fixed Point Theory, Volume I. James Dugundji, Andrzej Granas. 
PWN, 1982, 209 pp. [ISBN: 83-01-01142-4] Requires some background in topology and functional 
analysis. Beginning with the Banach contraction principle, it includes the major results with, spe- 
cial attention to Leray~Schauder theory, with applications to differential and integral equations. 
Numerous exercises, notes and comments and a substantial bibliography. Should serve to standardize 
the terminology of recent results in the field. TAV 


Analysis, P. Lecture Notes in Mathematics-932: Analytic Theory of Continued Fractions. Ed: W.B. 
Jones, W.J. Thron, H. Waadeland. Springer-Verlag, 1982, 240 pp, $12.50 (P). [ISBN: 0-387~-11567-6] 
This volume contains the proceedings of a seminar-workshop on recent progress in the analytic theory 
of continued fractions held at Loen, Norway from June 5-30, 1981. CEC 


Analysis, P. The Mathematics of Time: Essays on Dynamical Systems, Economic Processes, and Related 
Topics. Steve Smale. Springer~Verlag, 1980, vi + 151 pp, $16 (P). [ISBN: 0-387-90519-7] Reprints of 
10 of Smale’s papers and reviews on dynamical systems; half the volume consists of his 1967 
Bulletin survey "Differentiable Dynamical Systems." Concludes with a brief, informal autobiograph- 
ical note. LAS 


Geometry, T**(15-16), S, L**. Convex Sets and Their Applications. Steven R. Lay. Wiley, 1982, xvi 
+ 244 pp, $29.50. [ISBN: 0-471-09584-2] Generally set in R”, this beautiful book is less abstract, 
more accessible to undergraduates than Valentine’s Convex Sets with which it invites comparison. 
Same audience, same goals, same style as Roberts and Varberg”s Convex Functions with which it is a 
natural companion volume. A must for every mathematics library. AWR 


Geometry, T(16: 1, 2), S, L. The Mathematical Theory of Chromatic Plane Ornaments. Thomas W. Wiet- 
ing. Mono. and Textbooks in Pure & Appl. Math., No. 71. Dekker, 1982, vii + 369 pp, $55. [ISBN: 
0-8247~1517-9] With the goal of developing the color theory for plane ornaments, i.e., for periodic 
tilings of the Euclidean plane, the author begins from "scratch," developing the theory of the crys- 
tallographic groups and all of the geometry and group theory necessary to reach his goal. The only 
prerequisites are elementary linear algebra and group theory. SS 


Algebraic Topology, P. Brown-Peterson Homology: An Introduction and Sampler. W. Stephen Wilson. 
CBMS Reg. Conf. Series in Math., No. 48. AMS, 1982, v + 86 pp, $9.20 (P). [ISBN: 0-8218-1699-3] 


Topology, P. G Surgery II. Karl Heinz Dovermann, Ted Petrie. Memoirs No. 260. AMS, 1982, xxiii + 
118 pp, $8 (P). An account of the work of the two authors which develops an equivariant surgery 
theory containing a generalization of the exact Wall sequence. JAS 


Topology, P. Low-Dimensional Topology. Ed: R. Brown, T.L. Thickstun. London Math. Soc. Lect. Note 
Ser., No. 48. Cambridge U Pr, 1982, 246 pp, $27.50 (P). [ISBN: 0-521-28146-6] Papers on knots, man- 
ifolds and homotopy in 2, 3 and 4 dimensions from a 1979 conference at University College of North 
Wales, Bangor, U.K. With the exception of notes by Peter Scott on William Thurston’s lectures on 
3-manifolds, the major survey lectures of the conference are not included. LAS 


Optimization, T(16-17: 3), L. Mathematical Programming and Games. Edward L. Kaplan. Wiley, 1982, 
xx + 588 pp,, $34.95. [ISBN: 0-471-03632-3] Linear programming presented with early attention to 
duality and an all-integer pivoting procedure that relieves some complexity in hand calculation. 
Gets to the revised simplex method; last third of book deals with optimal paths in networks and the 
transportation problem. Accessible to a student with modest linear algebra. AWR 


Optimization, T, S*, P, L**. Maxima and Minima Without Calculus. Ivan Niven. Doleiani Math. 
Expos., No. 6. MAA, 1981, xv + 303 pp, $24.50. [ISBN: 0-88385-306-X] A compendium of algebraic, 
geometric, and inequality methods for optimization problems, excluding methods of calculus, linear 
programming, and game theory. Treats isoperimetric problems (in two and three dimensions), tiling 
problems, refractions, and various problems involving inscribed geometric figures. Exercises are 
scattered throughout each chapter, with answers at the end. LAS 


Statistics, T(14-16: 1, 2), S. Statistical Analysis: Resolving Decision Problems in Business and 
Management. Stephen A. Book, Marc J. Epstein. Scott Foresman, 1982, xii + 736 pp, $22.95. [ISBN: 
0-673-16002-5] Descriptive techniques, probability, distributions, statistical inference, prediction 
and forecasting, Bayesian techniques, and nonparametric methods. Presupposes no college mathemat~- 
ics. Examples drawn from real situations in business. FLW 


Statistics, T(13-14: 1). Introduction to Statistics, Third Edition. Ronald E. Walpole. Macmillan 
Pub, 1982, xv + 521 pp, $23.95. [ISBN: 0-02-424150-4] Revised, reorganized and expanded version of 
the author’s 1974 Second Edition (TR, December 1974). Material on nonparametric statistics has 
been increased and placed in a separate chapter. New material also includes more descriptive 
Statistics, sampling procedures, Bayesian methods, partial and multiple correlation, and Latin 
square designs. (TR, First Edition, October 1968.) RSK 


Statistics, T?(14-15: 1). Elementary Statistical Methods, Third Edition. G. Barrie Wetherill. 
Chapman & Hall, 1982, 356 pp, $14.95 (RP). [ISBN: 0-412-24000-9] Modest revision of the author’s 
Second Edition (TR, March 1974). Main change is the addition of some simple Basic programs. It 
remains an old-fashioned text with a tendency toward a cookbook style. (TR, First Edition, June~ 
July 1968.) RSK 
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Statistics, P. Statistics and Probability: Essays in Honor of G.R. Rao. Ed: G. Kallianpur, P.R. 
Krishnaiah, J.K. Ghosh. Elsevier North Holland, 1982, xi + 722 pp, $144.25. [ISBN: 0-444~-86130-0] 
Collection of sixty-six papers covering a wide range of topics, both pure and applied. Also 


includes a list of all Rao’s publications through 1981. Note price. RSK 


Statistics, P. Prior Information in Linear Models. Helge Toutenburg. Wiley, 1982, ix + 215 pp, 
$39.95, [ISBN: 0-471-09974-0] In the Wiley Series in Probability and Mathematical Statistics. Uses 
restrictions, as well as prior estimates, to estimate coefficients in linear regression models. 
Concentrates on robustness of minimax estimators, restricted and two-stage least squares estimators, 
and mixed estimators which allow for part of the prior information to be incorrect. Aimed at both 


statisticians and econometricians. RSK 


Statistics, P*. The Complete Categorized Guide to Statistical Selection and Ranking Procedures. 
Edward J. Dudewicz, Joo Ok Koo. American Sci Pr, 1982, v + 627 pp, $85 (P). [ISBN: 0-935950-03-6] 
Volume 6 in the American Series in Mathematical and Management Sciences. Revised and expanded ver- 
sion of Dudewicz” 1968 bibliography entitled A Categorized Bibliography on Multiple-Decision (Rank- 
ing and Selection) Procedures. Includes a complete collection of reviews from Mathematical Reviews 

and Zentralblatt flr Mathematik in addition to various listings of relevant journal articles and 
other references. Note price. RSK 


Statistics, P*. Teaching of Statistics and Statistical Consulting. Ed: Jagdish S. Rustagi, Douglas 
A. Wolfe. Academic Pr, 1982, xvi + 548 pp, $36. [LISBN: 0-12-604540-2] Proceedings of an interna- 
tional conference held at The Ohio State University in November 1980. Contains invited papers, dis- 
cussions, and contributed papers in five general areas: graduate programs in statistics, teaching 
service courses and short courses, training statisticians for employment in industry and government, 
the role of statistical consulting in graduate training, and statistics at the Open University in 
England. RSK 


Statistics, T7(13: 1). A Primer in Data Reduction: An Introductory Statistics Textbook. A.S.C. 
Ehrenberg. Wiley, 1982, xviii + 305 pp, $51.95. [ISBN: 0-471-10134-6] Nonstandard introduction, 
divided into six parts: statistical data, frequency distributions (and probability), sampling (and 
inference), relationships, communicating data, and empirical generalization. "Plays 
down...statistical inference from samples and tests of significance." Discusses several advanced 


topics, but the treatment is terse. Note price. RSK 


Statistics, T(17-18: 1, 2), P. The Foundations of Multivariate Analysis: A Unified Approach by 
Means of Projection Onto Linear Subspaces. Kei Takeuchi, Haruo Yanai, Bishwa Nath Mukherjee. 
Halsted Pr, 1982, xi + 458 pp, $24.95. [ISBN: 0-85226-964-1] Emphasizes algebraic derivations and 
geometrical interpretations rather than statistical distributions. Techniques covered include 
regression analysis, analysis of variance and covariance, principal component analysis’, canonical 
correlation analysis and discriminant analysis, factor analysis, analysis of categorical data, 
theory of distance and its application to classification problems, and analysis of covariance struc~ 
tures. RSK 


Statistics, P. Proceedings of. the Twenty-Seventh Conference on the Design of Experiments. US Army 
Research Office (P.0. Box 12211, Research Triangle Park, NC), 1981, xvi + 511 pp, (P). Report of a 
conference held at North Carolina State University in Raleigh, North Carolina in October, 1981. 
Contains five survey lectures (including discrete distributions, exploratory data analysis, cumula- 
tive processes) and contributed papers. LAS 


Statistics, $(13-18), L. Statistical Tables for the Social, Biological and Physical Sciences. F.C. 
Powell. Cambridge U Pr, 1982, 96 pp, $15.95; $5.95 (P). [ISBN: 0-521~24141-3; 0-521-28473-2] Tail 
probabilities and/or critical values for many common distributions to help carry out statistical 
tests, parametric and non~parametric. Also brief tables of logs and random numbers. Includes good 
explanations of the use of the tables. FLW 


Statistics, $(15-18), P, L. Topics in Applied Multivariate Analysis. Ed: Douglas M. Hawkins. Cam- 
bridge U Pr, 1982, 362 pp, $24.95. [ISBN: 0~-521-24368-8] Papers presented at a 1981 NRIMS Summer 
Seminar. Discriminant analysis, covariance structures, the log-linear model, scaling to reduce 
dimensionality, automatic interaction detection, and cluster analysis. Discusses methods that are 
widely applied and often Little understood by their users. FLW 


Statistics, $(14-18), P, L*. Comparative Statistical Inference, Second Edition. Vic Barnett. 
Wiley, 1982, xvi + 325 pp, $38. [ISBN: 0-471-10076-5] A thorough revision to "reflect changes of 
substance and attitude." The purpose is still to "develop the various principles and methods of 
inference and decision making to such a _ level that the readers can appreciate...the different 
approaches." (TR, First Edition, November 1974.) FLW 


Statistics, T(16-18: 1), S. Design and Analysis of Experiments. M.N. Das, N.C. Giri. Halsted Pr, 
1979, vii + 295 pp, $14.95. [ISBN: 0-470-26861-1] Complete and incomplete block designs, split-plot 
designs, factorial experiments, bio-assays, response surfaces, weighing designs, analysis of covari- 
ances. Uses partial derivatives but little matrix algebra. FLW 


Computer Literacy, S. Word Processing and Text Editing. John Zarrella. Microcomputer Applications, 
1980, 148 pp, $8.95 (P). [ISBN: 0-935230-01-7] Elementary introduction to the vocabulary of word 
processing. LAS 
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Computer Programming, L? Practical Pascal Programs. Gregory Davidson. Osborne/McGraw-Hill, 1982, 
ix + 205 pp, $15.99 (P). [ISBN: 0-931988-74-8] A collection of programs translated into Pascal from 
Basic. The programs represent a variety of problem areas including finance, management, and statis- 
tics. AO 


Computer Programming, S(13), P. Your ATARI Computer: A Guide to ATARI 400/800 Computers. Lon Poole, 
Martin McNiff, Steven Cook. Osborne/McGraw-Hill, 1982, vi + 458 pp, $15 (P). [ISBN: 0-931988-65-9] 
A guide for users of ATARI computers. Includes a detailed introduction to programming in ATARI 


Basic, an introduction to ATARI hardware and its use and a complete reference manual. CEC 


Computer Programming, S*, L. Problem Solving and Computer Programming. Peter Grogono, Sharon H. 
Nelson. Addison-Wesley, 1982, xvi + 284 pp, $14.95 (P). [ISBN: 0-201-02460-8] In two distinct 
parts: four chapters on general methods of solving problems (illustrated with many classic problems) 
followed by six on specific principles of good programming (illustrated with examples in Pascal). 
Intended for students with some programming experience, the book aims to help students separate the 
task of: solving a problem from the task of writing a program, so their thinking does not become dom- 
inated by the structures of a particular language. LAS 


Computer Programming, T(13: 1). A First Course in Computer Programming Using Pascal. Arthur M. 
Keller. McGraw-Hill, 1982, xiii + 306 pp, $14.95 (P). [ISBN: 0-07-033508-7] A thorough introduc- 
tion to programming in the spirit of CS 1 of Curriculum “78, emphasizing procedures as the first 
control structure. The elegance of the text is matched by the elegance of its appearance: it was 
prepared and typeset at Stanford in TEX, and shows off some of TEX’s multiple font features. LAS 


Computer Science, P. Multicomputers and Image Processing: Algorithms and Programs. Ed: Kendall 
Preston, Jr., Leonard Uhr. Academic Pr, 1982, xx + 470 pp, $34. [ISBN: 0-12-564480-9] Thirty-four 
papers from a meeting at Madison, Wisconsin in May, 1981. Architecture, languages, algorithms; some 
general articles and some on special systems for image processing. RWN 


Computer Science, P. The Handbook of Random Number Generation and Testing with TESTRAND Computer 
Code. Edward J. Dudewicz, Thomas G. Ralley. American Sci Pr, 1981, xi + 634 pp, $95 (P). [ISBN: 0- 
935950-01-X] Mainly listings and results of the TESTRAND package which includes 20 generators and 15 
tests. Some history and general discussion. Should be considered by organizations where good ran- 


dom number generation is critical. RWN 


Applications, $(17-18), P, L*. New Directions in Applied Mathematics. Ed: Peter J. Hilton, Gail S. 
Young. Springer-Verlag, 1982, ix + 163 pp, $24. [ISBN: 0-387-90604-5] Seven lectures on modern 
topics in applied mathematics--combinatorics, ccntrol theory, system theory, operations research, 
Lie groups, nonlinear analysis, turbulence-~employing methods as diverse as differential and alge- 
braic topology, theory of sets, catastrophe theory, fibre bundles, singularity theory and functional 
analysis. Papers presented at Case Western Reserve in April 1980 in honor of the Case centennial. 
Concludes with an essay on the nature and pedagogy of applied mathematics by Peter Hilton. LAS 


Applications, L. Catastrophe Theory and Applications. Ed: D.K. Sinha. Halsted Pr, 1981, xii + 158 
pp, $15.95. [ISBN: 0-470-27303-8] A coherent exposition of catastrophe theory, edited from notes 
taken at a 1979 seminar held at Jadavpur University, Calcutta. Major lectures by E.C. Zeeman are 
followed by various applications and a comprehensive bibliography. LAS 


Applications (Artificial Intelligence), T(16-17: 1, 2), P, L. Principles of Artificial Intelli- 
gence. Nils J. Nilsson. Tioga Pub, 1980, xv + 476 pp, $30. [ISBN: 0-935382-01-1] Eminently readable 
introduction to fundamental ideas of artificial intelligence. Organization is based on general com- 
putational concepts involving data structures and search control strategies, rather than by areas of 
application. Stresses role of production systems and predicate calculus theorem proving. Exposi-~ 
tion is simple (hence fun and easy to understand), but with excellent guidance to the literature for 
more details. Not enough exercises. GHM 


Applications (Behavioral Science), T(15-17), L. Measurement Theory for the Behavioral Sciences. 
Edwin E. Ghiselli, John P. Campbell, Sheldon Zedeck. WH Freeman, 1981, xv + 494 pp, $30.75; $19.95 
(P). [ISBN: 0-7167-1048-X; 0-7167-1252-0] Extensive revision of first author’s Theory of Psychologi- 
cal Measurement. Appeals to broader range of backgrounds and interests. Assumes introductory 
psychology and descriptive statistics. Equal emphasis on technical features and assumptions of 
various models. Thorough presentation covers such topics as correlation, regression, reliability 
and validity of measurement, psychological scaling -and testing. JRG 


Applications (Biology), P. Biomathematics in 1980. Ed: Luigi Ricciardi, Alwyn Scott. Math. Stud., 
V. 58. Elsevier North-Holland, 1982, xiv + 297 pp, $55.75 (P). LISBN: 0-444-86355-9] Papers from an 
April 1980 international workshop at the University of Salerno, on a wide range of topics: neurosci- 
ence, population dynamics, self-organizing systems, cardiovascular modelling. LAS 


Applications (Cybernetics). Progress in Cybernetics and Systems Research, V. VILI-XI. Ed: Robert 
Trappl, George J. Klir, Franz R. Pichler. Hemisphere Pub, 1982, $88 each. V. VIII: General Systems 
Methodology, Mathematical Systems Theory, Fuzzy Sets. xxiii + 529 pp [ISBN: 0-89116-237-2]; V. Ix: 
Cybernetics in Biology and Medicine, Cybernetics in Cognition and Learning, Health Care Systems. 
xii + 532 pp [ISBN: 0-89116-238-0]; V. X: Structure and Dynamics of Socioeconomic Systems, Cybernet- 
ics in Organization and Management, Engineering Systems Methodology, Systems Research on Science and 
Technology. xiii + 562 pp [ISBN: 0-89116-239-91; V. XI: Data Base Design, International Information 
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Systems, Semiotic Systems, Artificial Intelligence, Cybernetics and Philosophy, Special Aspects. xv 
+ 601 pp. [ISBN: 0-89116-240-2] These volumes represent part of the proceedings of the Fifth Euro- 
pean meeting on cybernetics and systems research that took place at the University of Vienna, April 
8-11, 1980. Earlier volumes represent proceedings of other European conferences on the same topics. 
JAS 


Applications (Economics), T(15-17: 1), S, P, L. Economics for Mathematicians. J.W.S. Cassels. Lon- 
don Math. Soc. Lect. Note Ser., No. 62. Cambridge U Pr, 1981, xi + 145 pp, $14.95 (P). [ISBN: 0- 
521-28614-xX] Concise notes expressing classical economic theories in standard mathematical language 
(linear algebra, multivariable calculus, convex sets). A good crash course in economic theories and 
terminology, but without the real-world examples of standard treatments. LAS 


Applications (Mechanics of Solids), T?7(18: 1), P. Theory of Thin Elastic Shells. M. Dikmen. Pitman 
Pub, 1982, xii + 364 pp, $59.95. [ISBN: 0-273-08431-3] The objects of study are deformable, simply 
connected, 3-dimensional bodies which are "thin" in that they admit a ‘tolerable 2-dimensional 
analysis. This comprehensive monograph develops several mathematical theories treating shells. The 
level is advanced, but there are several mathematical supplements. PZ 


Applications (Physics), T(14~18: 1, 2), S, L. An Introduction to Tensor Calculus, Relativity and 
Cosmology, Third Edition. D.F. Lawden. Wiley, 1982, xiii + 205 pp, $35.95. [ISBN: 0-471-10082-x] 
First four chapters are devoted to the special theory of relativity including Lorentz transforma- 
tions, orthogonal transformations, Cartesian tensors, mechanics, electrodynamics. Remaining three 
chapters focus on general theory of relativity: general tensor calculus, Riemannian space, cosmol- 
ogy. Chapter exercises. References, bibliography, index. RJA 


Applications (Physics), P. Complex Geometry in Mathematical Physics. R.O. Wells, Jr. Pr U Mont- 
real, 1982, 168 pp, $14 (P). [ISBN: 2-7606-0559-8] An introduction to some of the work done in 
mathematical physics in the last ten years. The underlying machinery is complex twistor geometry 
growing out of the ideas of Roger Penrose; the main thrust is the description of solutions of cer- 
tain problems in field theory. JAS 


Applications (Social Science), P. Cultural Transmission and Evolution: A Quantitative Approach. 
L.L. Cavalli-Sforza, M.W. Feldman. Mono. in Population Biology, V. 16. Princeton U Pr, 1981, xiv + 
388 pp, $25; $10.50 (P). [ISBN: 0-691-08280-4; 0-691-08283-9] An extension of many of the quantita- 
tive methods developed in population genetics to cultural evolution of learned traits. This volume 
treats vertical (parent to child) and horizontal (older teachers to younger students) transmission 
of dichotomous traits, multiple state traits, and continuous traits. Mathematical techniques are 
limited to elementary probability and calculus-~to avoid limiting the audience for the volume. LAS 


Applications (Social Science), P, L*¥*. Watergate Games: Strategies, Choices, Outcomes. Douglas Muz- 
zio. New York U Pr, 1982, xii + 205 pp, $17.95. [ISBN: 0-8147-5384-1] Attempt to "explain" the 
major developments of the Watergate investigation (but not the infamous break-in itself) by casting 
them in the language of game theory with various goals, strategies and payoff matrices. Author con- 
tends that the major "players" in these Watergate "games" acted quite "rationally" when measured by 
the optimal strategies suggested by game theory. GHM 


Reviewers 


RJA: Richard J. Allen, St. Olaf; JNC: Judith N. Cederberg, St. Olaf; JD-B: John Dyer-Bennet, Carle- 
ton; JRG: Jennifer R. Galovich, St. Olaf; SG: Steven Galovich, Carleton; JG: Jack Goldfeather, 
Carleton; PH: Paul Humke, St.Olaf; RBK: Roger B. Kirchner, Carleton; RSK: Richard §. Kleber, St. 
Olaf; JK: Joseph Konhauser, Macalester; GHM: George H. Mills, Carleton; RWN: Richard W. Nau, Carle~ 
ton; AO: Arnold Ostebee, St. Olaf; AWR: A. Wayne Roberts, Macalester; JS: John Schue, Macalester; 
SS: Seymour Schuster, Carleton; JAS: J. Arthur Seebach, Jr., St. Olaf; KS: Kay Smith, St. Olaf; 
LAS: Lynn Arthur Steen, St. Olaf; TAV: Theodore A. Vessey, St. Olaf; MW: Martha Wallace, St. Olaf; 
FLW: Frank L. Wolf, Carleton; PZ: Paul Zorn, St. Olaf. 


Section Reports 
An asterisk (*) by the title of a paper indicates that copies of the paper are available from 
the author. Papers presented under special sponsorship as part of joint meetings are so noted in 
parentheses. 


Ohio Section 


The fall meeting of the Ohio Section was held October 22-23, 1982 at Youngstown State Univer- 
sity with 130 individuals in attendance. The meeting was a joint meeting with SIAM. 


Invited Addresses: 


"A Sampling of Applied Combinatorial Problems," by David P. Roselle, Secretary of MAA, Dean of 
Research and Graduate Studies, Virginia Polytechnic Institute and State University. 
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analysis. This comprehensive monograph develops several mathematical theories treating shells. The 
level is advanced, but there are several mathematical supplements. PZ 


Applications (Physics), T(14~-18: 1, 2), S, L. An Introduction to Tensor Calculus, Relativity and 
Cosmology, Third Edition. D.F. Lawden. Wiley, 1982, xiii + 205 pp, $35.95. [ISBN: 0-471-10082-x] 
First four chapters are devoted to the special theory of relativity including Lorentz transforma- 
tions, orthogonal transformations, Cartesian tensors, mechanics, electrodynamics. Remaining three 
chapters focus on general theory of relativity: general tensor calculus, Riemannian space, cosmol- 
ogy. Chapter exercises. References, bibliography, index. RJA 


Applications (Physics), P. Complex Geometry in Mathematical Physics. R.O. Wells, Jr. Pr U = Mont- 
real, 1982, 168 pp, $14 (P). [ISBN: 2-7606-0559-8] An introduction to some of the work done in 
mathematical physics in the last ten years. The underlying machinery is complex twistor geometry 
growing out of the ideas of Roger Penrose; the main thrust is the description of solutions of cer- 
tain problems in field theory. JAS 


Applications (Social Science), P. Cultural Transmission and Evolution: A Quantitative Approach. 
L.L. Cavalli-Sforza, M.W. Feldman. Mono. in Population Biology, V. 16. Princeton U Pr, 1981, xiv + 
388 pp, $25; $10.50 (P). [ISBN: 0-691-08280-4; 0-691-08283~9] An extension of many of the quantita- 
tive methods developed in population genetics to cultural evolution of learned traits. This volume 
treats vertical (parent to child) and horizontal (older teachers to younger students) transmission 
of dichotomous traits, multiple state traits, and continuous traits. Mathematical techniques are 
limited to elementary probability and calculus-~to avoid limiting the audience for the volume. LAS 


Applications (Social Science), P, L*¥*. Watergate Games: Strategies, Choices, Outcomes. Douglas Muz- 
zio. New York U Pr, 1982, xii + 205 pp, $17.95. [ISBN: 0-8147-5384-1] Attempt to "explain" the 
major developments of the Watergate investigation (but not the infamous break-in itself) by casting 
them in the language of game theory with various goals, strategies and payoff matrices. Author con- 
tends that the major "players" in these Watergate "games" acted quite “rationally” when measured by 
the optimal strategies suggested by game theory. GHM 


Reviewers 


RJA: Richard J. Allen, St. Olaf; JNC: Judith N. Cederberg, St. Olaf; JD-B: John Dyer-Bennet, Carle- 
ton; JRG: Jennifer R. Galovich, St. Olaf; SG: Steven Galovich, Carleton; JG: Jack Goldfeather, 
Carleton; PH: Paul Humke, St.Olaf; RBK: Roger B. Kirchner, Carleton; RSK: Richard §S. Kleber, St. 
Olaf; JK: Joseph Konhauser, Macalester; GHM: George H. Mills, Carleton; RWN: Richard W. Nau, Carle~- 
ton; AO: Arnold Ostebee, St. Olaf; AWR: A. Wayne Roberts, Macalester; JS: John Schue, Macalester; 
SS: Seymour Schuster, Carleton; JAS: J. Arthur Seebach, Jr., St. Olaf; KS: Kay Smith, St. Olaf; 
LAS: Lynn Arthur Steen, St. Olaf; TAV: Theodore A. Vessey, St. Olaf; MW: Martha Wallace, St. Olaf; 
FLW: Frank L. Wolf, Carleton; PZ: Paul Zorn, St. Olaf. 


Section Reports 
An asterisk (*) by the title of a paper indicates that copies of the paper are available from 
the author. Papers presented under special sponsorship as part of joint meetings are so noted in 
parentheses. 


Ohio Section 


The fall meeting of the Ohio Section was held October 22-23, 1982 at Youngstown State Univer- 
sity with 130 individuals in attendance. The meeting was a joint meeting with SIAM. 


Invited Addresses: 


"A Sampling of Applied Combinatorial Problems," by David P. Roselle, Secretary of MAA, Dean of 
Research and Graduate Studies, Virginia Polytechnic Institute and State University. 


C10 Center Section January 


"The Process of Revenue Estimating for State Budget Purposes," by Samuel Nemer, Office of Budget 
and Management, State of Ohio. 

"Results of Testing College-intending High School Juniors," by Bert K. Waits, Director Early 
Mathematics Placement Testing Program," Ohio State University. 

"What is Applied Mathematics? What is Applied Mathematics Education?" by James M. Greenberg, Ohio 
State University; formerly NSF Program Director for Applied Mathematics. 


Panel Discussions: 


Discussion of Ohio Section Survey of Recent Mathematical Sciences Graduates, by Tom Price, Univer- 
sity of Akron, Moderator. 

Report on Retraining for Computer Science, by Zaven Karian, Denison University, Moderator. 

"Nurturing Young Talent," by Robert Dieffenbach, Miami University. 


Contributed Papers: 


"Kind-of-5 Plus Sort-of-8 is Imprecisely Close to Sort-of-5 Plus Kind-of-8," by Steven Rodabaugh, 
Youngstown State University. 

"A Non-commutative Pettis Theorem," by Nazanin Azarnia, Miami University, Hamilton. 

"Mathematics and Art," by Gus Mavrigan, Youngstown State University. 

"Almost Periodic Functions on the Real Line," by David Colella, Youngstown State University. 

"Poles and Zeros of Matrices of Functions," by Bostwick Wyman, Ohio State University. 

"Studying Curves at La," by Kenneth Cummins, Kent State University. 

"A Metrization Theorem, or Can This Dog Be Saved?" by Albert Klein, Youngstown State University. 

"Independent Partitions and the Unimodality Conjecture," by Thomas A. Dowling, Ohio State Univer- 
sity. 

"Difference Sets," by T. Arasu, Ohio State University. 

"Association Schemes," by Dijen K. Ray~Chaudhuri, Ohio State University. 


Indiana Section 


The fall meeting of the Indiana Section was held at Wabash College on Saturday, October 16, 
1982 with 50 members present. 


Student Paper: 


"The German Enigma," by Brian Wade and Tony Kirk, Rose-Hulman Institute of Technology. 
Contributed Presentations: 


"Some Applications of Logic to Algebra," by Leonard Lipshitz, Purdue University. 

"Baire Category: A Proof in Search of a Theorem," by William Dunham, Hanover College. 

"Some Comments on Articulation,” by Billy Rhodes, Indiana University. 

"Numerical Solution of Two-point Boundary-value Problems: A Survey," by James Daniel, University 
of Texas. 


North Central Section 


The North Central Section held its fall meeting at the University of Minnesota at Duluth on 
October 29-30, 1982. 


Invited Addresses: 


"Know-How in Mathematics," by Loren C. Larson, St. Olaf College. 
"Applications of Graph Theory to Linear Algebra," by Michael Doob, University of Manitoba. 


Short Presentations: 


* "Another Proof that Ag is Simple," by Joseph Gallian, University of Minnesota at Duluth. 
"Growth Functions of Groups," by Max Benson, University of Minnesota at Duluth. 
* "Is Time Real, If You Have to Tell It (A New Concept of Time is Needed)," by Robert L. Jacobsen. 
"Fixed Points for Set-valued Maps," by K.L. Singh, University of Minnesota at Duluth. 
* "Integer-sided Triangles Whose Ratio of Base to Altitude is an Integer," by Gerald E. Bergum, 
South Dakota State University. 
"LOGO for Learning Mathematics," by Roger Kirchner, Carleton College. 
"Iterative Methods for Solving Systems of Equations," by Larry F. Bennett, South Dakota State 
University. 
* "An Illustrated History of Remainders," by Don Mattson and James Hatzenbuhler, Moorhead State 
University. 
* "Mathletes in Minnesota," by Wayne Roberts, Macalester College. 
"Porosity, A Nice Concept in Exceptional Sets," by Ted Vessey, St. Olaf College. 
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references (films, supplies, books) for teachers of such courses. 
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Films 
A Non-Euclidean Universe. (1978; 25 Min; Color). 
University Media. 


A Time for Change--The Calculus. (1975; 25 Min; 


Color) University Media. 


Accidental Nuclear War. 
Pictura Film. 


(1976; 8 Min; Color) 


Adventures in Perception. (1973; 22 Min; Color) 
BFA Educational Media. (Reviews: Amer. Math. 
Monthly 84 (1977) 582.) 


An Historical Introduction to Algebra. 
Film Rentals. 


Modern 


An Introduction to Feedback. (1960; 11 Min; 
Color) Encyclopedia Britannica Educational 
Corporation. 


Anatomy of Analogy. 
University. 


(25 Min; BW) Open 


Area and Pi. (1969; 10 Min; Color) Modern Film 


Rentals. 

Auto Insurance. (1976; 8 Min; Color) Pictura 
Film. 

Caroms. (1971; 9 1/2 Min; Color) International 


Film Bureau. (Reviews: Amer. Math. Monthly 82 
(1975) 417; Math Teacher 66 (1973) 51.) 


Central Perspectivities. (1971; 13 1/2 Min; 
Color) International Film Bureau. (Reviews: 
Amer. Math. Monthly 82 (1975) 419; Math. Teacher 
(1972) 733.) 


Central Similarities. (1966; 10 Min; Color) 
International Film Bureau. (Reviews: Amer. Math. 
Monthly 82 (1975) 418; Math. Teacher (1972) 643- 
644.) 


Challenge in the Classroom. 
Modern Film Rentals. 


Circle Circus. (1979; 7 Min; Color) 
International Film Bureau, 


(1966; 55 Min; Color) 


Common Generation of Conics. (4 Min; Color). 


Educational Solutions. 


Complex Numbers. (1978; 25 Min; Color). 


University Media. 


Computer Perspective. (1972; 8 Min; Color) 


Pyramid Films. 


Congruent Triangles. (1978; 7 Min; Color). 
International Film Bureau. 


Conic Sections. (1968; 11 Min; Color) 
Educational Media. 


Conics. (1979; 10 Min; Color) 
Learning Aids. 


BFA 


Wards Modern 


Conics. (1978; 25 Min; Color). University Media. 
Constructing an Algorithm. (25 Min; BW) Open 
University. 


Cosmic Zoom. McGraw~Hill Films. 


Curves of Constant Width. 


sec~ 
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The following pages contain this 


Curves. (1968; 17 Min; Color) A.I.M.S. 
(Reviews: Amer. Math. Monthly 83 (1976) 71-72; 
Math. Teacher 64 (1971) 525.) 


(1971; 16 Min; Color) 
International Film Bureau. (Reviews: Amer. Math. 
Monthly 78 (1971) 539; Math. Teacher 65 (1972) 
234.) 
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Woods. (1974; 22 Min; Color) Modern Film 


Rentals. 


Dance Squared. 
Film Bureau. 


627.) 
Dihedral Kaleidoscopes. 


(1963; 4 Min; Color) International 
Review: Math. Teacher 64 (1971) 


(1966; 13 Min; Color) 


International Film Bureau. (Review: Math. 
Teacher 66 (1973) 51.) 
Dimension. (1970; 13 Min; Color) A.I.M.S. 


(Reviews: Amer. Math. Monthly 83 (1976) 71-72; 
Math. Teacher 64 (1971) 525.) 


Donald in Mathmagicland. (1960; 26 Min; Color) 
Walt Disney Educational Media Company. 


Dr. Posin’s Giants: Isaac Newton. Indiana 


University Audiovisual Library. 


Dragon Fold...And Other Ways to Fill Space. 
(1979; 7 1/2 Min; Color) International Film 
Bureau. 


Equidecomposable Polygons. (10 1/2 Min; Color) 
International Film Bureau. (Reviews: Amer. Math. 


Monthly 82 (1975) 687-688; Math. Teacher 65 (1972) 
734.) 


Errors That Die. (25 Min; BW) Open University. 


Flatland. (1965; 12 Min; Color) McGraw-Hill 
Films. (Review: Math. Teacher 64 (1971) 44-45.) 


Functions and Graphs. (1978; 25 Min; Color) 


University Media. 


Geodesic Domes: Math Raises the Roof. (1979; 20 


Min; Color) David Nulsen Enterprises. 


Geometric Vectors--Addition. (1971; 17 Min; 
Color) International Film Bureau. (Review: 
Amer. Math. Monthly 82 (1975) 420.) 


Geometry: Inductive and Deductive Reasoning. 
(1962; 12 1/2 Min; Color) Coronet Films. 


(25 Min; BW) 
(1966; 43 Min; Color) 


Good for What? Open University. 


Gottingen and New York. 
Modern Film Rentals. 


How Far is Around? (1979; 7 1/2 Min; Color) 
International Film Bureau. 


Inferential Statistics, Part I: Sampling and 
Estimation. (1977; 19 Min; Color) Media Guild. 


Inferential Statistics, Part II: Hypothesis 


Testing. (1977; 25 Min; Color). Media Guild. 
Infinity. (1972; 17 Min; Color) A.I.M.S. 


(Review: Amer. Math. Monthly 83 (1976) 71-72.) 
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Inversion. (12 Min; Color) International Film 
Bureau. (Reviews: Amer. Math. Monthly 83 (1976) 
71; Math. Teacher (1972) 644.) 


Isaac Newton. (1959; 13 1/2 Min; Color) Coronet 


Films. 


Isn’t That the Limit! (1980; 17 Min; Color) 


David Nulsen Enterprises. 


(1967; 26 Min; Color) International 
(Review: Math. Teacher 66 (1973) 51- 


Isometries. 
Film Bureau. 
52.) 


Iteration and Convergence. 
University Media. 


(1978; 25 Min; Color) 


John von Neumann, A Documentary. (1966; 63 Min; 
BW) Modern Film Rentals. (Review: Amer. Math. — 
Monthly 75 (1968) 435.) 


to of a (1977; 8 1/2 
Min; Color) International Film Bureau. 


Let Us Teach Guessing. (1966; 61 Min; Color) 
Modern Film Rentals. (Review: Amer. Math. 
Monthly 75 (1968) 219.) 
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(1979; 14 Min; Silent; Color) 
Bureau. 


International Film 


Limit Surfaces and Space Filling Curves. (1979; 
10 1/2 Min; Silent; Color) International Film 


Bureau. 
(25 Min; BW) Open University. 


(1969; 9 Min; Color) 


Limits. 


Linear Programming. 
Macmillan Films. 


Look Again. (1970; 15 Min; Color) A.I.M.S. 
(Reviews: Amer. Math. Monthly 83 (1976) 71-72; 
Math. Teacher 64 (1971) 525.) 


Love Song. (1976; 11 Min; Color) 


Mathematical Curves. 
Churchill Films. 


Pictura Film, 
(1977; 10 Min; Color) 


Mathematical Induction. (1960; 62 Min; Color) 


Modern Film Rentals. 


Mathematical Induction. (1978; 25 Min; Color) 


University Media. 


Mathematical Peep Show. (1961; 11 Min; Color) 
Encyclopedia Britannica Educational Corporation. 
(Review: Math. Teacher 64 (1971) 625.) 


(1959; 19 Min; 


Mathematician and the River. 
Color) No distributor. 


Mathematics of the Honeycomb. (1964; 13 Min; 
Color) Moody Institute of Science. (Review: 
Math. Teacher 64 (1971) 334.) 


(9 Min; Color) 


Indiana University Audiovisual 


Matrices. Macmillan Films. 


Matrioska. 
Library. 


of (1970; 26 
1/2 Min; Color) Coronet Films. (Review: Amer. 
Math. Monthly 83 (1976) 495.) 


McGraw-Hill Films. 
(1978; 25 Min; Color). 


Mean, Median, Mode. 


Modelling Drug Therapy. 
University Media. 


Modelling Pollution. (1978; 25 Min; Color). 


University Media. 


Modelling Surveys. 
University Media. 


(14 1/2 Min; Color) 


(1978; 25 Min; Color). 


Modmath. International Film 


Bureau. 
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Mr. Simplex Saves the Aspidistra. (1966; 33 Min; 
Color) Modern Film Rentals. 


Networks and Matrices. (1978; 25 Min; Color) 
University Media. 


New Worlds From Old. (1975; 25 Min; Color) 


University Media. 


Newton’s Equal Areas. (1968; 8 Min; Color) 
International Film Bureau. (Reviews: Amer. Math. 
Monthly 79 (1972) 1054; Math. Teacher 63 (1970) 
449.) 


Nim and Other Oriented Graph Games. 
Min; BW) Modern Film Rentals. 


Notes on a Triangle. International Film Bureau. 
(Review: Math. Teacher 63 (1970) 363.) 


(1975; 25 Min; Color) 


(1966; 63 


Numbers Now and Then. 
University Media. 


Orthogonal Projection. (1965; 13 Min; Color) 
International Film Bureau. (Reviews: Amer. Math. 
Monthly 82 (1975) 419-420; Math. Teacher (1972) 
643.) 


Paradox Box. Scientific American. 


Pits, Peaks, and Passes (Part 1). (1966; 48 Min; 


Color) Modern Film Rentals. 


Plateau’s Problem. A Film by Sr. Rita Ehrmann. 


Points of View: Perspective and Projection. 
(1975; 25 Min; Color) University Media. 


Possibly So, Pythagoras. (1973; 14 Min; Color). 
International Film Bureau. (Review: Math. Teacher 
64 (1971) 626.) 


Powers of Ten. (1978; 9 Min; Color) Pyramid 
Films. (Review: Math. Teacher (1979) 388.) 


Predicting at Random. (1966; 43 Min; Color) 
Modern Film Rentals. 


(12 Min; Color) McGraw-Hill Films. 


Probability. 


Professor George PUlya and Students, Parts I and 
IL. (1972; 60 Min; Color) University Media. 


Professor George PUlya Talks to Professor Maxim 
Bruckbeimer. (1972; 60 Min; Color) University 
Media. 


Projective Generation of Conics. (16 Min; Color) 
International Film Bureau. (Reviews: Amer. Math. 
Monthly 82 (1975) 538-539; Math. Teacher 66 (1973. 
51.) 


Quaternions: A Herald of Modern Algebra. (1975; 
25 Min; Color) University Media. 
Rational Numbers and the Square Root of 2. (1978: 
25 Min; Color) University Media. 
Regular Homotopies in the Plane: Part I: (1975; 1: 


Min; Color); Part II: (1975; 18 1/2 Min; Color) 
International Film Bureau. (Review: Amer. Math. 
Monthly 85 (1978) 212. 


Root Two: Geometry or Arithmetic? 


(1975; 25 Min; 


Color) University Media. 
Sampling. (25 Min; BW) Open University. 


Sets, Crows, and Infinity. (12 Min; Color) BFA 


Educational Media. 


Shaking the Foundations. (1975; 25 Min; Color) 


University Media. 


Shapes of the Future: Some Unsolved Problems in 
Geometry: Part I: Two Dimensions (1975; 22 Min; 
Color); Part IL: Three Dimensions (1970; 21 Min; 
Color) Modern Film Rentals. (Review: Amer. 
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Math. Monthly 79 (1972) 1052-1053.) 


Sierpinski’s Curve Fills Space. (1979; 4 1/2 Min; 
Color) International Film Bureau. 


Similar Triangles. (1976; 7 1/2 Min; Color) 
International Film Bureau. 


Space Filling Curves. (1975; 25 1/2 Min; Color) 
International Film Bureau. (Review: Math. 
Teacher 69 (1976) 164-165.) 


Sphere Eversions. (1979; 7 1/2 Min; Silent; 


Color) International Film Bureau. 
Spheres. International Film Bureau. 


Statistics At A Glance. (1972; 28 Min; Color) 
Media Guild. (Review: Amer. Math. Monthly 82 
(1975) 312.) 


Statistics and Probability I. (15 Min; BW) Open 
University. 
Statistics and Probability IL. (25 Min; BW) Open 


University. 


Statistics and Probability III. (25 Min; BW) 


Open University. 


Symbols, Equations and the Computer. (1978; 25 


Min; Color) University Media. 
Symmetries of the Cube. (1971; 13 1/2 Min; Color) 


International Film Bureau. (Review: Math. 


Teacher 65 (1972) 733.) 


The Algebra of the Unknown. 
University Media. 


(1975; 25 Min; Color) 


The Binomial Theorem. (1978; 25 Min; BW) 


University Media. 


The Butterfly Catastrophe. (1979; 4 1/2 Min; 
Silent; Color) International Film Bureau. 


(1975; 25 Min; Color) 


The Delian Problem. 
University Media. 


Indiana University 
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Audiovisual Library. 


The Geometry Euclid Didn’t Know. (1979; 16 Min; 
Color) David Nulsen Enterprises. (Reviews: Amer. 


Math. Monthly 86 (1979) 600; Math. Teacher (1979) 
300.) 


The Great Art-~Solving Equations. 
Color) University Media. 


The Hypercube: Projections and Slicing. (1978; 12 
Min; Color) Banchoff-Strauss Productions. 


(1962; 60 Min; Color) 


(1975; 25 Min; 


The Kakeya Problem. Modern 


Film Rentals. 


The Majestic Clockwork. (1974; 52 Min; Color) 


Time Life Multimedia. 


The Marriage Theorem, Parts I & II. (1974; 46 
Min. and 47 Min; BW) Modern Film Rentals. 
é 


The Music of the Spheres. (1974; 52 Min; Color) 
Time Life Multimedia. 


The Nature of Digital Computing. 
University Media. 


(25 Min; BW) 


McGraw-Hill Films. 
(1963; 62 Min; BW) 


The Perfection of Matter. 
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Modern Film Rentals. 


The Seven Bridges of KUnigsberg. 
Color) International Film Bureau. 


a Computer. (25 Min; BW) 


(1965; 4 Min; 


The Structure of a 
University. 


Topology. 


Open 


(1972; 9 Min; Color) Macmillan Films. 
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Topology. (1966; 30 Min; BW) Modern Film 
Rentals. (Review: Amer. Math. Monthly 75 (1968) 
790.) 

Topology: Some Historical Concepts. (21 3/4 Min; 
Color) Richard Cline Film Productions. 


Trio for Three Angles. (8 Min; Color) 
International Film Bureau. 


Turning a Sphere Inside Out. (1976; 23 Min; 
Color) International Film Bureau. (Reviews: 
Amer. Math. Monthly 86 (1979) 511-512; Math. 
Teacher 70 (1977) 55.) 


View From the People Wall. (1964; 14 Min; Color) 
Encyclopedia Britannica Educational Corporation. 


Weather by the Numbers. University of Indiana 


‘Audiovisual Library. 


(25 Min; BW) 


What is a Set? Part I & Il. 
Color) Modern Film Rentals. 
Monthly 75 (1968) 324.) 


Open University. 


(1967; 15 Min; 
(Review: Amer. Math. 


What is a Limit? 


(1966; 60 Min; BW) Modern Film Rentals. 

Zooms on Self-Similar Figures. (1979; 8 Min; 
Color) International Film Bureau. 

Addresses of Distributors: 

A.I.M.S., 626 Justin Avenue, Glendale, CA 91201. 


International Film Bureau, 332 South Michigan 
Avenue, Chicago, IL 60604. 


McGraw-Hill Films, McGraw-Hill Book Company, 330 
West 42nd Street, New York, NY 10036. 


Modern Film Rentals, 2323 New Hyde Park Road, New 
Hyde Park, NY 11040, 


Moody Institute of Science, 12000 East Washington 
Boulevard, Whittier, CA 90606. 


Indiana University Audiovisual Library, 
Bloomington, IN 47401. 


Ward”s Modern Learning Aids.Division, P.O. Box 
1712, Rochester, NY 14603. 


University Media, 118 South Acacia, Box 881, 
Solana Beach, CA 92075. 


Pyramid Films, P.O. Box 1048, Santa Morfica, CA 
90406. 


Educational Solutions, Inc., 80 Fifth Avenue, New 
York, NY 10011. 


Banchoff-Strauss Productions, Inc., P.O. Box 2430, 
East Side Station, Providence, RI 02906. 


Macmillan Films, Inc., 34 MacQuesten Parkway 
South, Mt. Vernon, NY 10550. 


Encyclopedia Britannica Educational Corporation, 
425 North Michigan Avenue, Chicago, IL 60611. 


David Nulsen Enterprises, 3211] Pico Boulevard, 
Santa Monica, CA 90405. 


The Media Guild, 118 South Acacia, Box 881, 
Beach, CA 92075. 


Coronet Films, 65 East South Water Street, 
Chicago, IL 60601. 


Pictura Film Co., 111 8th Avenue, New York, NY 
10011. 


BFA Educational Media, 2211 Michigan Avenue, P.O. 
Box 1795, Santa Monica, CA 90406. 


Time-Life Multimedia, Time and Life Building, New 


Solana 
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York, NY 10020. 


Churchill Films, 662 North Robertson Boulevard, 
Los Angeles, CA 90069, 


Walt Disney Educational Media Company, 500 South 
Buena Vista Street, Burbank, CA 91521. 


Contemporary~McGraw Hill Films, 1221 Avenue of the 
Americas, New York, NY 10020, 


Suppliers of Classroom Aids 


The Math Group, Inc., 396 East 79th Street, 
Minneapolis, MN 55420. 
Unique puzzles and card games, Designed for 
elementary school enrichment, many are flexible 
enough to be of interest to adults as well. 


Lano Company, 9001 Gross Road, Dexter, MI 48130. 
(313-426-4860) 
Mathematical visual aids (solids, 
transparencies, graphing aids} together with 
various games and probability models. 


Math Shop, Inc., 5 Bridge Street, Watertown, MA 
02172. 
A full range of curriculum enrichment material 
for elementary, junior and senior high school 
mathematics. Includes several games and 
puzzles of use in introductory college 
mathematics. 


International Film Bureau, Inc., 332 South 
Michigan Avenue, Chicago, IL 60604, 
The major American distributor of mathematics 
films, The current list includes nearly 50 
films at the high school and college level, 
each of which may be either purchased or 
rented. 


Yoder Instruments, East Palestine, OH 44413. 
(216-426-3612; 216-426-9580). 
Two unique geometric construction sets for 
plane and solid geometry. 


Cuisenaire Co. of America, Inc., 12 Church Street, 
New Rochelle, NY 10805. 
Sensory apparatus designed for elementary 
schools, some of which (e.g., geoboards, 
polyhedral structures) would be suitable to 
courses in mathematics appreciation. 


LaPine Scientific Company, Department B43, 6005 
South Knox Avenue, Chicago, IL 60629; 373 Chestnut 
Street, Norwood, NJ 07648; 920 Parker Street, 
Berkeley, CA 94710; Box 95, Postal Station U., 
Toronto, Canada M8Z 5M4. 
An extensive offering of models, teaching aids, 
games, and audio-visual materials for 
elementary, high school and beginning college 
mathematics. Mathematics catalogue exceeds 100 
pages. 


Geyer Instructional Aids Co., Inc., P.O. Box 7306, 
Fort Wayné, IN 46807. 
A large collection of models, games, books and 
classroom aids for high school and elementary 
college courses, 


Creative Publications, 3977 East Bayshore Road, 
P.O. Box 10328, Palo Alto, CA 94303. (415-968- 
3977). 
An attractive 100-page catalogue of 
mathematical books, models, games, posters, 
construction sets, and puzzles for all grade 
levels. The premier source for mathematics 
enrichment material. 


Inquiry Audio~Visuals, 1754 West Farragut Avenue, 
Chicago, IL 60640. 


Filmstrips and transparencies for high school 
algebra topics. 


Educational Audio Visual, Inc., Pleasantville, NY 
10570. 
Transparencies and games for algebra, geometry 
calculus and statistics. 


W.H. Freeman and Company, 660 Market Street, San 
Francisco, CA 94104. 
Offprints of Scientific American articles. 


David Nulsen Enterprises, 3211 Pico Boulevard, 
Santa Monica, CA 90405. 
Three 15-minute, 16mm color films ("Curves," 
"Dimension," "Look Again") for rent or 
purchase. 


Popular Science Audio~-Visuals, Inc., 5235 

Ravenswood Avenue, Chicago, IL 60640. 
Filmstrips and overhead transparencies, 
principally. in high school mathematics 
(geometry, algebra, elementary functions). 


Time Life Films, 43 West l6th Street, New York, N 
10011. (212-691-2930). 
Numerous BBC~produced 20-minute l6mm B & W 
films on topics ranging from inequalities to 
matrices. 


Edmund Scientific Company, 1985 Edscorp Building, 
Barrington, NJ 08007. (609-547-3488). 
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WELL-DISTRIBUTED MEASURABLE SETS 


WALTER RUDIN 
Department of Mathematics, University of Wisconsin, Madison, WI 53706 


THEOREM. There is a measurable set A < I = [0,1] such that 
0<m(ANV)<m(V) 
for every nonempty open set Vc I. 


Proof. Let CTDP mean: Compact Totally Disconnected subset of J, having Positive measure. 

Let {J} be an enumeration of all segments in J whose endpoints are rational. Construct 
sequences {A,,}, {B,} of CTDP’s as follows: 

Start with disjoint CTDP’s A, and B, in J. 

Once A,, B,,..., A,_1, B,—-, are chosen, their union C, is CTD, hence J, \C, contains a 
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nonempty segment J, and J contains a pair A,, B, of disjoint CTDP’s. Continue in this way, and 
put 


If V c Tis open and nonempty, then J, c V for some n, hence A, Cc V and B, c V. Thus 
0< m(A,)<m(ANV)<m(ANV)+4+m(B,) < m(V); 
the last inequality holds because A and B, are disjoint. Done. 


The point of publishing this is to show that the highly computational construction of such a set 
in [1] is much more complicated than necessary. 


Reference 


1. A. Simoson, An Archimedean paradox, this MONTHLY, 89 (1982) 114-125. 


ANY QUESTIONS? 


DESMOND MACHALE 
Department of Mathematics, University College, Cork, Ireland 


Recently, I attended a mathematical lecture given by a guest speaker where absolutely nobody, 
except possibly the speaker, had the remotest idea what was going on. Normally, one can absorb 
at least some of the preliminary definitions and follow, say, the first blackboard full of 
development of the theory, but on this occasion everyone was completely lost after the first 
definition. After the speaker had finished over an hour later to an enthusiastic round of applause, 
the chairman asked for questions, and, of course, there was a deathly and highly embarrassing 
silence. Then and there I resolved to put together a collection of universal questions for use in 
such situations. Such questions must sound sensible, but they are designed to cover up the total 
ignorance of the questioner rather than to elicit information from the speaker. The following is the 
list I came up with. 


1. Can you produce a series of counterexamples to show that if any of the conditions of the 
main theorem are dropped or weakened, then the theorem no longer holds? 
[The speaker can almost always do so—if not you may have presented him with a 
stronger theorem!] 
2. What inadequacies of the classical treatment of this subject are now becoming obvious? 
3. Can your results be unified and generalized by expressing them in the language of 
Category Theory? 
[The answer to this question is always NO!] 
4. Isn’t there a suggestion of Theorem 3 in an early paper of Gauss? 
[The answer to this question is almost always YES!] 
5. Isn’t the constant 4.15 in Theorem 2 suspiciously close to 47/3? 
[This question can clearly be generalized for any constant k—‘“Isn’t k suspiciously close 
to (p/q)a (for suitable integers p and g)?’’] 
6. I’m not sure I understand the proof of Lemma 3—could you outline it for us again? 
[Lemma 3 should be just a little nontrivial, yet not more than one third of a blackboard 
in length. ] 
7. Are you familiar with a joint paper of Besovik and Bombialdi which might explain why the 
converse of Theorem 5 is false without further assumptions? 
[This is a dangerous question to ask unless you like living dangerously. The answer is 
always “NO” unless the speaker is playing the same game as you are, because Besovik and 
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6. I’m not sure I understand the proof of Lemma 3—could you outline it for us again? 
[Lemma 3 should be just a little nontrivial, yet not more than one third of a blackboard 
in length.| 
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Bombiaidi do not exist, and even if by some unfortunate chance they do exist, it is very 
unlikely that they have written a joint paper. If the speaker calls your bluff and asks for 
details and a reference, tell him the paper is available only in Albanian with Portuguese 
summaries. Promise to mail him a copy but forget to do.] 
8. Why not get a graduate student to perform the horrendous calculations mentioned in 
Theorem | in the case n = 4? 
[The answer is always “I’ve a student doing just that at the moment.”’] 
9. Could you draw us a simple diagram to show what the situation looks like for n = 2? 
[Be careful that he hasn’t already done so.] 
10. What textbook would you recommend for someone who wishes to get students interested 
in this area? 
[The speaker has almost invariably written such a textbook himself and will be delighted 
you asked this question. If he hasn’t, then you can ask the next question.] 
11. When can we expect your definitive textbook on this subject? 
12. Why do you think there was such a flurry of activity in this area around the turn of the 
century and then nothing until your paper of 1979? 
[The true answer is that people in the period in between had more sense. ] 


In general, a good ploy is to stop halfway through a totally meaningless question you are 
asking and pretend you have suddenly seen the answer yourself. However, never, never ask 
13. What are the applications of these results? 


The speaker is probably embarrassed enough already! 


AN ALGORITHM FOR THE MINIMAL POLYNOMIAL OF A MATRIX 


BERNARD R. GELBAUM 
Department of Mathematics, State University of New York at Buffalo, NY 14214 


Using nothing beyond the Gram-Schmidt orthonormalization process, we present a direct 
method for finding the minimal polynomial of a matrix. We will use the following notation: 
A = (a;,) is ann X n complex matrix; 


m,(A) =A" + aA" | +--+ +4,, 


the unique monic polynomial of least degree such that m,(A): A" + 1% ,a,A”~' = 0 is the 
minimal polynomial for A. M,, will denote the vector space of n x n complex matrices. M, has 
dimension n* and hence is isomorphic to C n’ the vector space of n*-tuples of complex numbers. 
We will use the standard isomorphism ¢: M, > C n° which spreads the rows of a matrix out in a 
long line: 


$((a,,)) — (ay,,° "*5 Bins 4719° °° Gans" 's ont "*y any 


We will also use the standard inner product for C n° > (u,v) = v, where u = (u;), v = (v,) 
and v, denotes complex conjugation. 

Now apply the Gram-Schmidt orthonormalization to the powers of a matrix A, considered as 
vectors in C”’, to obtain the coefficients involved in a minimal linear combination of these powers 
of A; ie., the coefficients of the minimal polynomial. Let A° be the identity matrix 7 and 
recall that @(A”) is just A? considered as a spread-out vector in C nr’ Let O, be the a 
orthonormalized vector produced by the Gram-Schmidt process. Then O, = $(1)/ Va and O 
is found inductively by computing 


P 
X41 = O(A?*')— DF < 6(A?*'), O, > O, 
k=0 
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and then taking O,,, = X,+1/||X,+1\| . Since we are orthonormalizing in a vector space of finite 
dimension n”, we eventually have X,,,, = 0 for some m < n*. The Gram-Schmidt process stops 
here. Gather up the various coefficients computed in the orthonormalization process and express 
each O, as a linear combination of (A*),---, ¢(A), ¢() and thereby obtain a relation 


m 


g(A") + 2) ayp(A"™~“) =O. 
k=1 
Since the process stops at the first power of A that is linearly dependent on the preceding (lower) 
powers of A, it is clear that 


m,(A) =A" + Yo a,A"~*, 
k=1 

We close with a few brief observations. Although it is not hard to write a computer program 
for carrying out the above procedure, it is apparently not easy to compare the efficiency of this 
approach to that of using the formula m ,(A) = det(A — AI)/D,_,(A) where D,_,(A) = GCD 
of all the (m — 1) st order minors of det(A — AJ). Finally if © is any vector in C” the 
Gram-Schmidt process applied as above to the vectors v, Av, A*v,... in C” leads to the so-called 
order m, ,(A) of v with respect to A, ie., to the unique monic polynomial 


Bb 
M+ db, AH 
k=1 


of least degree » such that m, ,(A)v = 0. Clearly » <n and then by the Cayley-Hamilton 
theorem m < n. 


THE TEACHING OF MATHEMATICS 
EDITED BY MARY R. WARDROP AND ROBERT F. WARDROP 


Material for this department should be sent to Professor Robert F. Wardrop, Department of Mathematics, Central 
Michigan University, Mount Pleasant, MI 48859. 


MATHEMATICS APPRECIATION COURSES: THE REPORT OF A CUPM PANEL* 


In 1977 the Committee on the Undergraduate Program in Mathematics (CUPM) established a 
panel to consider the content of those college and university courses that treat mathematics 
appreciation for students in the arts and humanities. Such courses are taken by a large number of 
students, frequently as their last formal contact with mathematics. Yet in most institutions they 
are given very low priority; they are frequently taught perfunctorily, without a clear set of 
objectives, by faculty who lack appropriate interest or credentials. Since these courses may play a 
major role in molding nonscientists’ opinions of mathematics and its role in society, CUPM 
decided that it should call attention to the importance of these courses and offer some suggestions 
on how they may be organized and taught effectively. 

This is the report, approved by CUPM, of the CUPM Panel on Mathematics Appreciation 
Courses. While the panel has many guidelines and recommendations to offer, it does not feel that 
a particular selection of topics or teaching strategy should be universally adopted for mathematics 
appreciation courses. A main goal of such courses is to get students to appreciate the significant 
role that mathematics plays in society, both past and present. All material presented in such 
courses should be well motivated and related to the role of mathematics in culture and technology. 


*Reprints of this article and the related bibliography contained in the Center Section may be purchased for $1.00 
from MAA Publications Department, 1529 Eighteenth Street, NW, Washington, D. C. 20036. 
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I. Mathematics Appreciation: A Philosophy. The inclusion of a mathematics appreciation 
course in the undergraduate curriculum is common in the nation’s colleges and universities. This 
trend is a direct result of an underlying belief, held by most mathematicians, that every 
well-educated person should be mathematically literate. Whether or not a mathematics course is 
required at a particular institution often depends, among other things, upon the extent to which 
this belief is shared by the general faculty. But the ultimate success of an “appreciation” course in 
mathematics should not depend upon mandatory enrollment. Rather, the value and importance of 
such a course should be directly attributable to the care and understanding with which it is 
conceived and taught. 

If as mathematicians we accept the notion that an educated person should know something 
about mathematics, then we must also accept the responsibility for conscientiously providing 
appropriate training. Students in the mathematical, physical, life, and some social sciences, and 
usually those in business, study mathematics as an inherent part of the undergraduate curriculum. 
It is not to these students, but rather to majors in the arts, in the humanities, and in certain social 
sciences that we must direct the mathematics appreciation course. At the outset we must take into 
account the background and interests of the prospective students. In many cases they have chosen 
their majors precisely because of a weak or unpleasant mathematical background; a college course 
that reinforces this negative experience with mathematics certainly cannot be called an apprecia- 
tion course. 

At most institutions the great majority of students in a mathematics appreciation course will 
have studied less than four years of high school mathematics; moreover, many of these students 
will have had poor experience in mathematics, or will have had very weak courses. However, high 
school mathematics study is predominantly concerned with developing skills, and while such skills 
are of unquestioned importance, they are not necessarily prerequisite to (nor should teaching them 
be a part of) a mathematics appreciation course. 

Among all fundamental academic disciplines, mathematics is perhaps unique in the degree to 
which it is not understood (or is misunderstood) by students and even faculty from other areas of 
study. By taking an introductory course in chemistry, history, or psychology, a student is expected 
to gain an understanding of the general techniques, accomplishments, and goals of the discipline, 
and will learn to appreciate the work of the contemporary professional practitioner of the subject, 
sometimes even to the extent of reading the current journals. But an undergraduate major in 
mathematics is unlikely to have comparable insight into mathematics. Thus the challenge of a 
mathematics appreciation course 1s enormous. 

The ultimate goal of such courses is defined by our umbrella title—to instill in the student an 
appreciation of mathematics. For this to occur, students must come to understand the historical 
and contemporary role of mathematics, and to place the discipline properly in the context of other 
human intellectual achievements. 

From the beginning of recorded history, mathematics has proved to be an indispensable aid to 
the empirical sciences; the great successes (and failures) of mathematical reasoning in the 
furtherance of human knowledge are tales begging to be told. Even the direct impact of 
mathematics on developments in virtually all disciplines is often not realized by the mathematical 
layman. 
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But of course, to mathematicians, the subject is more than a tool of applied science, more than 
a universal language useful for communication and research in other disciplines. Mathematicians 
see mathematics as an intellectually exciting discipline, one that holds great aesthetic appeal for its 
practitioners. This idea of mathematics as art is often difficult for nonmathematicians to 
appreciate, yet is fundamental to understanding the development and role of the subject. 

Finally, to appreciate mathematics fully, one must recognize it as a vital, on-going discipline, 
one that is practiced by a worldwide community of dedicated, sometimes passionate, and 
frequently brilliant scholars. It is a surprise to many that mathematics is a living, changing, 
developing subject. A true appreciation of mathematics requires some knowledge of contemporary 
developments. 

The entire mathematical community should be concerned with what view educated, informed 
people have of mathematics. Thus, courses in mathematics appreciation, while presumably 
benefiting primarily the students, may also have a long-term positive effect on the discipline itself. 
Obvious benefits will accrue if leaders in education, industry, business, and government have a 
better understanding of the nature, role, and importance of contemporary mathematics. 

It is a sad commentary on the attitudes of mathematicians that courses in mathematics 
appreciation frequently command pejorative (albeit informal) labels such as “Math for Poets.” 
Even the supposedly neutral title of “Math for Liberal Arts Students” may convey the connota- 
tion of condescension. We must recall that liberal arts education, for a large percentage of the 
college educated population, is a rigorous, disciplined encounter with the best elements of man’s 
history and culture. The major clientele of the mathematics appreciation courses are liberal arts 
students, and it is from their ranks that many of society’s leaders will emerge. 

The panel believes that it is better to describe courses of this type in terms of their objectives 
rather than their audience. Since the term “mathematics appreciation” brings to mind similar 
courses in other special fields (e.g., ““music appreciation”) that generally carry positive connota- 
tions with regard to their role in general undergraduate education, and since it conveys concisely 
what such courses intend to accomplish, standing as a brief reminder of this intention to both 
teachers and students, the majority of the Panel prefers this title. 


II. Things to Stress in a Mathematics Appreciation Course. 


1. The relationship between mathematics and our cultural heritage. Students enrolled in mathe- 
matics appreciation courses are generally more interested in, as well as more knowledgeable about, 
the arts and humanities than the sciences; it is natural, therefore, to capitalize on these strengths 
by appropriate illustrations of the relations between mathematics and music, art, literature, 
history, and society. 


2. The role of mathematics in history and the role of history in mathematics. Although the 
influence of mathematics is often remote, mathematical discoveries have shaped our world in 
fundamental ways, altering the course of history as well as the way we live and work. Examples of 
these influences abound, and should form a major part of any mathematics appreciation course. 
Historical developments and the evolution of mathematical concepts should be properly em- 
phasized. 


3. The nature of contemporary mathematics. The mathematics known by most humanities 
students is ancient mathematics—the geometry of ancient Greece, and the algebra of the early 
renaissance; not surprisingly, such students have the impression that mathematics is dead. 
Showing them that it is in fact a vigorous, growing discipline with considerable influence in 
contemporary society is an important aspect of any course in mathematics appreciation. 


4. The recent emergence of several mathematical sciences. While the mathematics appreciation 
course should not be devoted solely to one “modern” area such as statistics, computer science, or 
operations research, it surely provides an opportunity to use these fields as illustrations of the 
panoply of contemporary mathematical science. 
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5. The necessity of doing mathematics to learn mathematics. While some parts of the mathe- 
matics appreciation course can and should be about mathematics, it is essential that some parts 
actually engage the students in doing mathematics. Only in this way can they gain a realistic sense 
of the process and nature of mathematics. Of course it is vitally important that the instructor have 
appropriate respect for the students’ interests and abilities, and that exercises be selected so as to 
maintain rather than destroy their enthusiasm. 


6. The role of mathematics as a tool for problem solving. As the language of science and industry, 
mathematical models are the tool par excellence for solving problems. Students in mathematics 
appreciation courses should be exposed to contemporary mathematical modelling, to gain some 
appreciation both of its power and its limitations. 


7. The verbalization and reasoning necessary to understand symbolism. While symbols provide 
the mathematician and scientist with great power, they obscure the meaning of mathematics from 
the uninitiated. A great service the teacher of a mathematics appreciation course can provide is to 
enable students to overcome their fear of symbols, to learn to think through arguments apart from 
the traditional symbols in which they are expressed. 


8. The existence of a large body of interesting writing about mathematics. Students in mathe- 
matics appreciation courses generally feel comfortable with assignments such as term papers, book 
reports, and library research because they have become accustomed to these in their humanities 
courses. There is much good mathematics that can be learned in this way, and assignments can be 
arranged that utilize these familiar learning tools. 


III. Things to Avoid in a Mathematics Appreciation Course. 


1. Do not leave the assignment of an instructor in the mathematics appreciation course until 
the last minute and do not assign it on the sole basis of availability. The course requires more 
planning and preparation than almost any other mathematics course if it is to be successful. 


2. Do not simply allow the students to sit back and listen. It is important that they be involved 
actively. But this need not take the form of daily homework. In fact, drill type assignments should 
be avoided. The involvement could take the form of projects, papers, book reports, “discovering” 
mathematics in class, participating in class discussions. 


3. Do not overemphasize the history of mathematics. While the history of mathematics could 
and should be used to enliven the topics covered, a student who knows (and cares) nothing about 
a mathematical topic is not likely to be interested in its history. 


4. Do not stress remedial topics. While many of the students in a mathematics appreciation 
course may need remedial work, any such material that is covered must be presented as part of a 
topic that fits into the scope of the course as a whole. 


5. Do not make a fetish of rigor; in particular do not prove things that are self-evident to the 
students. For example, a rigorous presentation of the real numbers in which one proves the 
uniqueness of zero is entirely inappropriate in courses of this type. 


6. Do not cover topics you do not yourself find interesting and important. It is hard to fool 
these students, and if the teacher does not care, they will not see why they should. 


7. Do not be condescending. While the students in such courses may not be mathematically 
inclined, this does not mean that they are unintelligent. Many who take mathematics appreciation 
courses are outstanding, creative students, who have simply concentrated in the nonquantitative 
areas of the curriculum. The attitude of the teacher can help either to open or to close their minds 
to the material. 


8. Do not cover topics which you cannot relate in some way to ideas familiar to the students. 
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Clock arithmetic and symbolic logic, for example, are of little value to mathematics appreciation 
courses unless you can find applications the students can appreciate and understand. 


9, Do not make the course too easy. The material should not be way over the heads of the 
students, but it should not be trivial either. 


10. Do not accept anyone else’s blueprint for a mathematics appreciation course. If you can 
communicate, in your own way, why you believe that mathematics is beautiful and important, the 
course will fulfill its purpose. 


IV. Approaches to Course Organization. There are nearly as many ways to teach a course in 
mathematics appreciation as there are teachers of these courses. While some strategies will work 
superbly in some contexts, none can be recommended for all; the teacher’s enthusiasm for what is 
being done as well as the appropriateness of the strategy for the students in the course are 
generally more important than the actual strategy adopted. Nevertheless, to encourage flexibility, 
we list below some approaches to teaching mathematics appreciation that have been effective in 
certain contexts. 


1. A sampler approach, featuring a variety of more or less independent topics. The advantage 
of this method is that it covers many areas without requiring a sustained continuity of interest; 
students who fall behind or simply fail to comprehend one topic always know that they have a 
chance for a fresh start in a few days. The disadvantage is that of all survey courses: not enough 
time spent on any one thing to ensure long-term learning. 


2. A single-thread approach, built around a common theme, for example, 2 X 2 matrices, or 
algorithms, or patterns of symmetry. Doing this takes careful planning, and runs the risk of 
alienating some of the class who find the thread incomprehensible. But it guarantees a solid 
example of the intellectual coherence that is so much a part of contemporary mathematics, that 
ideas arising in one context find applications in others, and that a common abstract structure 
underlies them all. 


3. A socratic approach, in which the instructor works carefully to let the students develop their 
own reasoning. This works well in small classes with a highly motivated instructor. While the 
content of such courses is hard to guarantee in advance, the achievement for students who are able 
to think for themselves, perceiving patterns where others simply see chaos, is a worthy objective 
for a course in mathematics appreciation. 


V. Examples of Topics. The topics available for courses in mathematics appreciation are as 
diverse as mathematical science itself. Standard textbooks offer a rather traditional assortment of 
topics: probability, graph theory, finite difference equations, computers, matrices, statistics, 
exponential growth, set theory, and logic seem to dominate. But there are numerous other themes 
that can be used for large or small components of courses. Here are a few of the many possible 
examples. 


1. Understanding how to use the buttons on a pocket calculator. It used to be that the number 
e was a complete mystery to those who had not studied calculus, and that “sin” had for 
humanities students more the connotation of theology than of mathematics. But no more. 
Virtually everyone has, or has seen, inexpensive hand calculators with buttons that perform 
operations involving exponential, trigonometric, and basic statistical functions. Teaching a class 
what these buttons do is an exciting new way to explore some traditional parts of classical 
mathematics. 


2. Tracing the modern descendants of classical mathematical ideas can illustrate the power of 
mathematics to influence the real world, as well as its remoteness from it. For example, classical 
Greek geometry involving conic sections led to models for planetary motion, and ultimately to the 
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possibility of space flight. And probability, which had its origins in seventeenth-century discus- 
sions about gambling, now dominates actuarial and fiscal policy, influencing government and 
corporate budgets, thus affecting the level of interest, of unemployment, and the health of the 
entire economy. 


3. Connecting mathematics with Nobel Prizes. Nobel prizes are not given in mathematics (and 
the apocryphal reasons for this are quite amusing). But the work that led to Nobel prizes (e.g., of 
Libby, of Arrow, of Lederberg, and others) often has an intrinsically mathematical basis. The 
study of this scientific work provides an opportunity to show how mathematics is important in the 
most profound discoveries of modern science. 


4. Applying exponential growth models. The applications of traditional topics from elementary 
mathematics can often be explored more fully than has usually been the case. Exponential growth 
and decay models provide a striking example. Simple noncalculus approaches to models of growth 
provide a basis for discussion not only of interest and inflation, but also of such things as 
radiocarbon dating, cooling and heating of houses, population dynamics, strategies for controlling 
epidemics, and even detection of art forgeries. 


5. Relating traditional mathematics to new applications. A discussion of beginning probability 
theory can quickly lead to a treatment of the Hardy-Weinberg law of genetics and a calculation of 
the probability of winning state lotteries. An introductory treatment of statistics can quickly lead 
to a discussion of political polls, the design and interpretation of surveys and of related 
decision-making problems. Modern applications of elementary network theory include recent 
work in computational complexity and almost unbreakable codes. 


6. Introducing problems involving decision-making. There are many situations described by 
elementary mathematics in which one must choose “rationally” among possible options. One can 
discuss quantifying risk and uncertainty, fair division schemes, applications of network flows, 
pursuit and navigation problems, game theory and numerous other topics. Political science is full 
of unexpected but usually interesting topics, including Arrow’s theorem and its offshoot theories 
of voting, the recently discovered problems associated with apportionment of legislatures, and 
strategies of fair voting in multiple candidate elections. 


7. Exploring the powers and limitations of mathematical models. Each of the modern social 
sciences abounds with applications of elementary mathematics. All of the examples mentioned 
above, and many more, involve the use of mathematical models. Sometimes these models are quite 
accurate and sometimes they are not. But even in the latter case the model can help clarify one’s 
thinking about the underlying problem. An example of this use of mathematical modelling is the 
prisoner’s dilemma argument of game theory and its possible connection with U.S.-U.S.S.R. 
relations. 


References 


Numerous references for mathematics appreciation courses are given in Section IX of this 
report. We list here only a few specific suggestions for the numbered topics mentioned above. 


1. See the handbooks for various calculators. 

2. Much of this is in standard textbooks. Morris Kline’s Mathematics in Western Culture and 
George Polya’s Mathematical Models in Science are helpful sources. 

3. Libby’s work is briefly discussed in several elementary texts on ordinary differential 
equations. e.g., in Differential Equations with Applications and Historical Notes by George F. 
Simmons. For some work of Arrow see Edward Bender, An Introduction to Mathematical 
Modeling, Wiley, New York, 1978. Lederberg published an article in this MONTHLY entitled 
“Hamilton circuits of convex trivalent polyhedra (up to 18 vertices)” in vol. 74, pp. 522-527. 
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4. See any modern text on ordinary differential equations. A particularly good one is Martin 
Braun, Differential Equations and Their Applications, Springer-Verlag, New York, 1975. 

5. The Hardy-Weinberg law appears in several texts on finite mathematics, e.g., Applied Finite 
Mathematics by Anton and Kolman. How to Lie with Statistics by Darrel Huff and Irving Geis, 
Norton, 1954, and other texts contain situational mathematics which can be discussed according 
to the interests of the audience. For the two topics mentioned last, see Scientific American, Jan. 
1978, p. 96, and Aug. 1977. . 

6. See Bender (as in 3); the articles by William Lucas in vol. 2 of the forthcoming Modules in 
Applied Mathematics (Springer-Verlag, New York); M. Balinski and H. P. Young, Proc. Nat. Acad. 
Sci. U.S.A., 77 (January 1980) 1-4; H. Hamburger, J. Math. Sociology, 3 (1973) 27-48; David 
Gale, UMAP Module 317, 1978; W. Stromquist, this MONTHLY 87 (1980) 640-644; and George 
Minty’s article in M. D. Thompson, ed., Discrete Mathematics and its Applications (Indiana 
University, Bloomington, 1977). 


VI. Two-Year Colleges. Many courses that ought to follow the “mathematics appreciation” 
philosophy are taught in two-year colleges. Innovative approaches and curriculum development by 
some two-year faculty are reflected by their texts and articles in this area. Although the preceding 
sections of this report are applicable to mathematics appreciation courses in all colleges, this 
separate section appears because of the special problems created in two-year colleges by generally 
heavy teaching loads, by staffing in some cases by faculty whose mathematical experiences are not 
sufficient to make them comfortable with the broad range of topics demanded by these courses, 
and by the regrettable frequency of administrative procedures which allow students needing 
remediation to enroll in: these courses. The following suggestions may help to overcome these 
impediments to two-year college implementation of the goals of mathematics appreciation 
courses. 


1. When there is a choice among faculty members for assignment to the mathematics 
appreciation course, only those having a broad range of mathematical experiences and expressing 
interest in the course should teach it. Mathematics program administrators should provide extra 
guidance to faculty teaching this course for the first time. In the two-year college there will usually 
be one text used by all teachers, often supplemented by a reading list and/or other texts; a 
description of special uses of these materials, as well as sample course outlines, supplementary and 
classwork materials, and tests, will be helpful. Entrance and/or exit requirements may be matters 
of policy and should be explained. Lists of applicable resource material owned by the school 
should be provided to new teachers, along with knowledge of the school’s film rental policies. 


2. Special attention should be paid to the needs of this course by the library, audio-visual, and 
computer facilities. The mathematics program administrator should be sure these courses are 
adequately supported. 


3. Since mathematics appreciation courses, properly taught, take an enormous amount of 
preparation time, any load relief possible would be appropriate. In a suitable lecture room, the 
course can be effectively taught to “double” sections of 60—90 students if doing so would leave the 
teacher several hours more preparation time each week. (Such a load might be counted as two or 
three sections, corresponding to the grading load.) 


4. Sharing materials and ideas and perhaps team-teaching would be reasonable for mathe- 
matics appreciation courses. One teacher at a school might be most qualified for teaching, say, a 
computer unit, and might “rotate” across several sections. Many more “hand-out” materials seem 
to be necessary for mathematics appreciation courses than for traditional courses; these might be 
used by several teachers in a given term, or re-used in succeeding terms. Faculty teaching 
mathematics appreciation courses seem to enjoy sharing materials and methods. 


5. In-course remediation should be avoided. If students are enrolled who cannot handle 
elementary operations at the level needed for the work of the course, a “math lab” facility might 
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be used to design and administer individual remediation programs. It cannot be over-emphasized 
that a mathematics appreciation course cannot fulfill its goal if it degenerates into the teaching of 
arithmetic computations or pre-algebra skills, or if it is limited to a topic such as “consumer 
mathematics.” 


6. A large proportion of students enter two-year colleges with little realistic expectation 
concerning majors. Many of these students have had poor experiences with mathematics and, if 
there is a general education mathematics requirement which may be satisfied by either a 
mathematics appreciation course or a pre-calculus/calculus course, they will often elect the 
mathematics appreciation course. Well into a successful term, the student may begin to think 
realistically about mathematics requirements of various university majors. Since most majors 
outside the humanities will necessitate at least some mathematics at a technical level rarely 
achieved in the typical two-year college mathematics appreciation course, an important service of 
this course can be to channel these students back into regular sequence mathematics courses. 
Without violating the spirit of a mathematics appreciation course, it is possible to include a 
topically organized unit requiring the review and use of elementary algebra and graphing 
techniques; this may give the student a successful experience in doing mathematics that serve as 
encouragement to return to regular sequence mathematics courses. (A linear programming unit, 
for example, requires the students to review or acquire facility with graphing and algebra 
techniques. Many of the topics suitable for a mathematics appreciation course can be handled in 
this way.) Students with the experience will frequently place higher in the sequence courses than 
they would have upon original enrollment, and will go on as solid, though late-blooming, students. 


VII. Films. Since students in mathematics appreciation courses frequently have little experi- 
ence in sustaining interest in regular mathematics lectures, it is usually appropriate in these 
courses to provide a variety of class activities. Films are a useful but under-utilized medium for 
mathematics instruction generally. They are especially useful for the mathematics appreciation 
course. 

We list in the Center Section a selection of films about mathematical subjects that are suitable 
for lay audiences. (Distributor addresses are listed at the end.) Further information on these and 
other films is available in the booklet Annotated Bibliography of Films and Videotapes for College 
Mathematics by David Schneider (M.A.A., 1980). 


VIII. Classroom Aids. Certain topics treated in mathematics appreciation courses are particu- 
larly amenable to demonstration with physical or geometric devices. Useful exhibits can often be 
seen at NCTM meetings. A list of major suppliers of mathematics classroom devices is given in 
the Center Section. 


IX. References. Since many of the topics that arise in mathematics appreciation courses occur 
nowhere else in the mathematics curriculum, it is quite important that instructors be aware of the 
expository literature of mathematics that treats its relations to science and society. Student term 
papers in courses on mathematics appreciation typically tax the instructor’s knowledge of the 
literature more than any other course in the mathematics curriculum. 

To aid instructors of mathematics appreciation courses, we list in the Center Section major 
references that would be suitable for background reading, and as sources for special projects. This 
list does not include textbooks, partly because we do not wish to endorse some books over others, 
and partly because texts go in and out of print much more rapidly than the reference classics. 


ANSWER TO “PHOTO” ON PAGE I1 


Julia Robinson, current president of the American Mathematical Society. Photo by George 
Bergman. 
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A MOCK SYMPOSIUM FOR YOUR CALCULUS CLASS 


DENNIS WILDFOGEL 
Mathematics Program, Stockton State College, Pomona, NJ 08240 


Dr. Gregory Campe, Professor of Organic Chemistry at Frostbite Falls (Minn.) State College. 
Dr. Linda Gillespe, Professor of Environmental Science at the Susan B. Anthony University for 
Women. Dr. Daniel Hain, Professor of Surfing at Could’s Hole Oceanographic Institute. An 
unknowing colleague of mine at Stockton State College looked at the impressive roster of 
thirty-five participants in the First Stockton Symposium on Mathematical Modeling in the Life 
Sciences and exclaimed, ““You got all those people to come here?” What he didn’t know was that 
the people on the roster were the students in my Calculus For Life Scientists II class, given phony 
titles and affiliations at fictitious institutes. 

Calculus For Life Scientists is a two-semester, introductory level calculus course aimed 
primarily at students majoring in environmental science, marine science, and biology. The course 
emphasizes mathematical modeling. At the “Symposium,” which acts as a capstone to the 
students’ year-long encounter with calculus, teams of students make presentations about a 
modeling problem on which they have been working for about two weeks. The benefits of doing 
these projects are numerous: (1) the projects give the students a holistic view of the use of higher 
mathematics in their own disciplines; (2) most projects require them to learn some techniques they 
would not ordinarily encounter in a one-year calculus course; (3) they learn about the benefits and 
difficutties of working on a team under a pressing deadline; (4) they experience the gamut of 
emotions associated with the problem-solving process, from the pleasure of initial idea generation, 
through the frustration of the intermediate stages, to the triumph of the completed project; (5) 
they gain experience in making oral presentations and written reports; (6) they see how the need 
for mathematical techniques grows out of realistic problems. 

I have conducted such a symposium four times now. Students consistently rate it as one of 
their most rewarding and useful academic experiences. Several other faculty members have 
successfully adapted the symposium idea for use in their own classes. 

Here’s how it works. About two and a half weeks before the end of the term, I divide the class 
of thirty to forty students into teams of about six students each. Each team is given a fairly 
difficult modeling problem and is responsible for making both an oral presentation and a written 
report about the results of their investigation of that problem. Each team meets during class time 
for the remainder of the term while I give hints and feedback on their work. The teams invariably 
find it necessary to schedule meetings outside of class time, too. I always offer enough suggestions 
so that each team will develop a good model by the time of the symposium. 

The key aspect of running a symposium like this is the selection of appropriate problems for 
the teams. Each problem must be sufficiently challenging to occupy a team of six students for two 
weeks and yet still be within reach of their capabilities. There are a few texts ({1]-[3]) which I have 
used repeatedly as sources for problems. In a few instances I have been able to adapt material 
from books or journals in other fields, and I have made up several problems on my own. 
Colleagues in the life sciences have provided a great deal of assistance. Below is a partial list of 
titles of problems I have used. 


Competition of two species for limited resources 

Biogeography: a species equilibrium model 

The maximum brightness of Venus 

Operating strategies for publicly owned commuter bus systems 

A box model for airshed pollutant capacity estimation 

The effects of natural selection on gene frequency 

Passive transport of chemical substances through a thick section of tissue 
Determination of the shape of subterranean deposits by use of gravitational anomalies 
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Parasitic relationships which are not harmful to the host 
Ventilation systems and the accumulation of toxic pollutants 
A model for the clinical detection of diabetes 

A rare example of a closed ecological system 

The chemical kinetics of bimolecular reactions 

Excretion of a drug 

An optimal inventory policy model for an import wholesaler 


The composition of the student teams is important. I make sure that each student works on a 
project in an area of his or her own interest. I have tried dividing the class into groups 
homogeneous or heterogeneous according to ability. The homogeneous groups make it easier to 
tailor the difficulty of the problem to the appropriate level; however, it is difficult to keep the least 
capable groups from becoming discouraged. In heterogeneous groupings, the less successful 
students can learn from the better ones; however, too much of the burden then falls on the better 
students in each group. The best compromise I have found so far is to have two “all-star” teams 
of the best students, and to have heterogeneous teams composed of the remaining students. 

The symposium itself occupies the last three days of the term. Each team makes a half-hour 
presentation. The entire event, always attended by several other faculty members, is done up in 
tongue-in-cheek style. I circulate in advance to all mathematics and science faculty members a 
Roster of Participants and a Schedule of the Symposium, making it look as much like a real 
scholarly meeting as possible. (I always fool at least one new faculty member!) 

At the beginning of each session, one of my mathematics or science colleagues makes a 
humorous presentation, e.g., a “double talk” address that sounded like a commentary on the 
specific models to be presented, or a short discourse on the three-body problem while juggling 
three balls. Once Miss America visited, and another time I sang a song about calculus which I 
composed. The student teams get into the act, dressing up in suits and ties or lab coats, calling 
each other “Doctor” or “Professor,” and occasionally putting on brief skits. The merriment 
makes it enjoyable without detracting from the serious work to be done and serves to alleviate 
some of the anxiety the students have about making oral presentations. Afterwards, each student 
receives a copy of the “Proceedings of the Symposium” containing the written reports of the 
several teams and a few memorable photographs of the symposium. 

The symposium creates an opportunity for students to understand the way mathematics 1s 
actually used in their own fields and to understand both their own potential as users of 
mathematics and the difficulties inherent in the modeling process. It is thus a valuable experience 
in their mathematical education. 


References 


1. M. Braun, Differential Equations and Their Applications: An Introduction to Applied Mathematics, 2nd ed., 
Springer-Verlag, New York, 1978. 

2. D. P. Maki and M. Thompson, Mathematical Models and Applications: With Emphasis on the Social, Life, 
and Management Sciences, Prentice-Hall, Englewood Cliffs, N. J., 1973. 

3. E. O. Wilson and W. H. Bossert, A Primer of Population Biology, Sinauer Associates, Stamford, Conn., 1971. 
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— Gilbert Harman, in a review in the Times Literary Supplement, 16 April 1982. 
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FE 2980. Proposed by Jordi Dou, Barcelona, Spain. 


Given the points A,, A, A3;, M and the line s, construct P,Q such that PO is equal and 


parallel to A,M and P,Q, = P,Q, = P3Q3, where P,, Q, are the intersections of PA,, QA, with s. 
Describe the locus of the point M for which the problem has a solution when A,, A,, A; and s 
are known (fixed). 


E 2981. Proposed by Murray S. Klamkin, University of Alberta, Canada. 


If the three medians of spherical triangle are equal, must the triangle be equilateral? Note that 
the sides of a (proper) spherical triangle are minor arcs of great circles and thus its perimeter is 
< 27. 


E 2982. Proposed by R. L. Graham, Bell Telephone Laboratories, and D. E. Knuth, Stanford 
University. 
Let ||y|| denote the distance from (real) y to the nearest integer. Evaluate the double infinite 


sum 
se AX /All? + 2]x/2|? + [ll]? + [[2x[|°/2 + |[4xI|774 + ---. 


E 2983. Proposed by E. Ehrhart, University of Strasbourg, France. 
Let ABC be an equilateral triangle of perimeter 3a. Calculate the area of the convex region 
consisting of all points P such that PA + PB + PC < 2a. 


FE 2984. Proposed by Roger Maddux, Iowa State University. 


Characterize the logically valid sentences of the form 
Q 1X1 -+- OnX_RX,... X_ > QiX;,.-- OX; RX... Xp; 
where Q, is V or J, i,... i, 18 a permutation of 1... , and R is a relation. 
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‘E 2985. Proposed by P. J. Forrester, University of Melbourne and M. L. Glasser, Clarkson 
College. 


Show that for 0 < x < 1 


TT [0 + x4) / ~ x4) = exp [2/(x7' — x)] 
odd 
where $(k) is Euler’s totient. 


SOLUTIONS OF ELEMENTARY PROBLEMS 
Equivalence of Two Triangles 


E 2727* [1978, 594; 1979, 790]. Proposed by David P. Robbins, Hamilton College. 


Two triangles A,A,A; and B,B,B, in R’ are equivalent if there exist three different parallel 
lines p,, P>, p3 and rigid motions o, t such that o(A,) and r(B,) lie on p; (i = 1, 2, 3). 
Find necessary and sufficient conditions for equivalence of two triangles. 


Solution by M. J. Pelling, Balliol College, Oxford. Let the triangles A,A,A3, B,B,B; have 
respective sides a, b,c and d, e, f. We shall show that 4,A,A,, B,B,B, are equivalent unless 


either 
Vad? Ve—e ef? or Vd? —a?, Ve? 8, [Poe 


form the sides of a proper triangle. From consideration of the normal section with sides a, B, y of 
the triangular prism bounded by parallel lines p,, p,, p,; and from the published solution [1979, 
790] we see the triangles are equivalent if and only if a, B, y > 0 can be found forming the sides of 
a proper triangle, with a? < min(a’, d”) etc. and 


aa [ea =0 
(1) 7 — g? Vd? — «? + je? + yf*-—y? =0. 


Now the surface + yx + 5 + - = 0 is the a circular cone 

Tix? + y? +27 -—2yz —2zx —-2xy =0 
with axis in direction (1, 1, 1), vertex O and semiangle tan™ '}/2. One half-cone I’, lies within the 
octant x, y, z > 0 and the other half-cone I’_ lies in the octant x, y, z < 0. If Int T, denotes the 
open region bounded by I',, then our key observation is that (x, y,z) € Intl, if and only if 
vx, Vy ,vz are the sides of a proper triangle. Denote by I'( p, g, r) the cone I translated to have 


vertex at (p,q, r) with (p,q, r) analogously defined. 
The conditions on a, B, y are satisfied if and only if 


(2) T_(a’, b’?, c*) NT_(d’, e’*, f?) N Intl, * @ 


and we shall show that if [_(a’, b’, c*) and’ _(d?, e*, f) intersect at all, then already they have 
nonempty intersection within Int I’,. 

Let 7 be the plane through O normal to the axis of I and let the axes of I'_(a’, b’, c’), 
T'_(d’, e”, f7) meet 7 in O,, O, respectively. These two half-cones cut 7 in respective circles C, 
centre O, radius R, > 0 and C, centre O, radius R, > 0, and 


T_(a’, b’,c*) NT_(d’, e’, f*) # @ 
if and only if |R, — R,| < O,O). The parallel plane z, at distance r¥2 from a (on the I’, side) 
cuts T,,T_(a’, b*, c”), T_(d’, e”, f*) in 3 circles which project orthogonally onto 7 into 3 
circles C(t), C,(t), C,(t) with respective centres O, O,, O, and radu rt, R, — t, R, — t. The part of 
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Show that for 0 < x < 1 


TT [0 + x*)/C ~ x4) = exp [2/(x7! — x)] 
odd 
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Two triangles 4,A,A,; and B,B,B, in R° are equivalent if there exist three different parallel 
lines p,, P>, p3 and rigid motions o, t such that o(A;) and 7(B;) lie on p; (7 = 1, 2, 3). 
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Solution by M. J. Pelling, Balliol College, Oxford. Let the triangles A,A,A,;, B,B,B,; have 
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the triangular prism bounded by parallel lines p,, p,, p,; and from the published solution [1979, 
790] we see the triangles are equivalent if and only if a, 8B, y > 0 can be found forming the sides of 
a proper triangle, with a? < min(a’, d”) etc. and 


aa [2z¥ =0 
(1) 2 — g? Vd? — a? + ye? + ¥f*-y’?=0. 


Now the surface + yx + 5 + i = 0 1s the aan circular cone 

Tix? + yy? +27 -—2yz —2zx —-2xy =0 
with axis in direction (1, 1, 1), vertex O and semiangle tan™ '4/2. One half-cone I’, lies within the 
octant x, y, z > 0 and the other half-cone I’_ lies in the octant x, y, z < 0. If Int T, denotes the 
open region bounded by I',, then our key observation is that (x, y,z) € Intl, if and only if 
vx, Vy ,vz are the sides of a proper triangle. Denote by I'( p, g, r) the cone I translated to have 


vertex at (p,q, r) with .( p, q, r) analogously defined. 
The conditions on a, B, y are satisfied if and only if 


(2) T_(a’, 6’, c*) NT_(d’, e’, f?) N Intl, * @ 


and we shall show that if !_(a’, b”, c*) and’ _(d’, e*, f) intersect at all, then already they have 
nonempty intersection within Int I’,. 

Let 7 be the plane through O normal to the axis of I and let the axes of T'_(a’, b’, c’), 
T_(d’, e*, f”) meet 7 in O,, O, respectively. These two half-cones cut 7, in respective circles C, 
centre O, radius R, > 0 and C, centre O, radius R, > 0, and 


T_(a’, b’,c*) NT_(d’,e’, f*) # @ 
if and only if |R, — R,| < O,O,. The parallel plane 7, at distance t/2 from 7 (on the I’, side) 


cuts T,,T°_(a’, b*, c”), T_(d’, e”, f*) in 3 circles which project orthogonally onto 7 into 3 
circles C(t), C,(t), C,(t) with respective centres O, O,, O, and radu rt, R, — t, R, — t. The part of 
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the curve [_(a?, b*, c?) NT _(d’, e”, f7) on the I, side of 7 projects into the locus of 
C(t) N C,(t) as t varies between 0 and min(R,, R,), and this locus is the portion H of the 
hyperbola in 7, defined by O, P — O,P = R, — R, (assuming R, > R,) for O,P < R,,O)P < Ro. 
We have to show there is some value of ¢ for which one of the points P, of C,(t) N C,(f) exists 
and is inside the circle C(t), 1e., OP, < t. 

Now O &€ Int C, Int C, since (a’, b’, c*),(d’, e*, f°) € Intl, (which implies C,(0)N 
C,(0) contains 2 points and H is not empty) and as ¢ increases from 0 there 1s a first value t* > 0 
for which C(t*) just touches C,(t*) or C,(t*), say C,(t*) at Q. Let H* denote the portion of H 
defined by C,(t) N C,(t) for t => t*: then H* must meet O,Q internally (draw a diagram, noting 
that C(t*) lies inside both C,(t*) and C,(t*)) in a point P, with ¢t > ¢* and 


OP, = |OO — (t — t*)| = |[2t* -— t| <t. 
It follows that (2) is equivalent simply to 
(3) T_(a’, b’,c?) NT_(d’,e’, f*) # @ 


and this holds if and only if (a’, b*, c*) € IntI',(d’, e*, f?) and (d’, e*, f7)¢ 
Int I ,(a’, b*, c*), which is equivalent to 


(a? — d*, b* — e’,c? —f*), (d? —a*,e’— Bb’, f?- c?) € IntI,. 
So 4,A,A,, B,B,B, are equivalent unless either 


Va? — d*, Vb? — e*, Vc? — f? or Vd? —a’,Ve*— Bb? , Vf? —c? 


form the sides of a proper triangle. 
A Third Degree Congruence with Nine Solutions 


E 2763 [1979, 223]. Proposed by Lorraine L. Foster, California State University, Northridge. 


Let f(n) = n? + 396n? — 111n + 38. Prove that the congruence f(n) = 0 (mod 3“) has pre- 
cisely 9 solutions (mod 3“) for all integers a > 5. 


Solution by L. E. Mattics, University of South Alabama. 


THEOREM. If s > 3, then there is a d such that f(d) = 0 (3°*7) and if f(b) = 0 (3°*7) thenb = d 
(mod 3°). 


Indeed, f(n) = 0 (mod 243) if and only if n = 19 (mod27) and we proceed by induction 
assuming the theorem to be true for some s > 3. Then 


f(d+ d3*) = f(d) + k3°f'(d) (mod 3°*?) 
and since d = 1 (mod 9), we can show that 37|f’(d), but 3? + f’(d). Hence the congruence 
f(d) + k3°f’(d) = 0 (mod 35*') 


can be solved uniquely for k modulo 3. If f(c) = 0 (mod 3**7), then by induction hypothesis 
c=a+t+d-3'sod=k (mod 3), and c =a + k3* (mod 3°*'). The induction is complete and the 
theorem is proved. 

Now if s > 5 and f(b) = 0 (mod 3°), then 


f(b + m3°~? + r3°~!) = f(b) + m3°~7f'(b) + 7357 'f'(b + m- 3°77), 


and this quantity is = 0 (mod 3°) because 37|f’(b) and 3|f’(b + m3°~7). Therefore, since b is 
unique modulo 3° *, there are precisely nine solutions modulo 3° to the congruence f(n) = 0 (3°). 


Also solved by H. L. Abbott (Canada), J. Ampe (student), L. Kuipers (Switzerland), Jinku Lee (student), N. 
Miku (Netherlands), A. M. Rockett, R. 8. Stacy, P. J. Zwier, and the proposer. 
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(mod 3°). 


Indeed, f(n) = 0 (mod 243) if and only if n = 19 (mod27) and we proceed by induction 
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and since d = 1 (mod 9), we can show that 3°|f’(d), but 3? + f’(d). Hence the congruence 
f(d) + k3°f’(d) = 0 (mod 35*') 
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A Product of Powers of an Integer 


FE 2852 [1980, 672; 1982, 213]. Proposed by Jan Mycielski, University of Colorado. 


For any positive integer n let w(n) be the product of all positive integers k such that n is a 
power of k. Prove that 


00 w ( n) l/n _ 
U ( n+ | | _ 

Remark by William A. Newcomb, Lawrence Livermore National Laboratory. Neither of the 
published solutions to this problem [1982, 213] is entirely correct as it stands. They are vitiated, 
respectively, by the divergence of the series Yn 'log(n + 1), and of the corresponding infinite 
product, I1@(n + 1)'7" = T1%k!44— 9, 

A corrected version of Solution I may be given as follows. Let w(n) be rewritten as np(n), so 
that o(7) denotes the product of all integers k < n such that n is a power of k. One form of the 
proposition to be established is 

— 1 — 1 n+] 
(1) 27, op e(n) = L 7 log ne 


But now, by essentially the same calculation as in the original Solution I (with p replacing w, and 
the index a now running from 2 to oo), one easily obtains 
a — l 
2 — lo = ), ——— lo + 1), 

from which (1) will follow directly by an application of the Abel transformation. In this version of 
the argument, all the relevant series are convergent (and indeed, absolutely convergent). 

A similar correction may be made to Solution II. 

Edmund A. Butler, New Carrollton, Maryland, and the proposer also noticed the omissions. 
The proposer’s solution is as follows. 


00 
1/ 
(2) . i T] «i 
n= n (n is a power k’=n 


with exponent>1) s>1 


00 


= TL TK" 
=2 s=2 


k 


1e.@) 
= TL Rsk -079) 


oe (nt+1\'" 
“OS ) 
All the products in this calculation converge absolutely; the last step is also correct because 


k'Ak-1) _s 1, Now we can divide by II(— = 


= 


)'”" and get the desired result. 


(This and related identities are stated without proofs in J. Mycielski ““Quelques identités de la 
théorie analytique des nombres,” Coll. Math., 4(1956) 68—70.) 


Subsets F of Integers for Which F C F + F 


E 2899 [1981, 538]. Proposed by Gerard Letac, Université Paul-Sabatier, Toulouse, France. 


Let n be a positive integer. Consider a set F of distinct integers such that every element of F is 
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)'”" and get the desired result. 


(This and related identities are stated without proofs in J. Mycielski ““Quelques identités de la 
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the sum of two elements of F and the sum of k elements of F is never 0 for k = 1,2,..., n. Prove 
that F must have at least 2m + 2 elements. (*) Find all F with |F| = 2n + 2. 


Solution by Allen J. Schwenk, University of Waterloo. Partition F into a positive and negative 
subset, F = F'U F-. Define a directed graph on F™ by inserting an arc from x to y ify = x + z 
for some z € F. (Note that x and y are positive but z may be positive or negative.) By hypothesis, 
every vertex has positive indegree. Such a graph must contain a directed cycle, say p, Pp) --* DP; P}- 
Now we have 

Pi = Pt 2 
= Pr-1 + 2Z~-1 + 2% 
= Pp-2t Zp-2 + 2-1 F 2 


=P, t2Z, t+ 2Z, 4+ °° + Z,. 


Subtracting p, yields 0 as a sum of k elements, sok >n+ 1. Thus |F*|>2”+ 1. A similar 
argument applies to F. 

Now if |F| = 2 + 2, the directed graphs on F* and F” must each be precisely a directed 
n+ 1 cycle, since any additional arc would create a shorter directed cycle and hence a shorter 0 
sum. Consequently, each element in the set is realized as a sum in a unique way, and if y= x + z 
with x, y, and z all having the same sign, then x = z, for otherwise y has indegree 2 violating the 
structure of F* or F~. Observe that if F is any set with the prescribed properties, then so is mF for 
any nonzero multiplier m. 

To construct such sets for each positive n, first select py to be a positive integer such that 
Po < 2"*' — 1 and ged( po, 2”*' — 1) = 1. Define p, by 

DP, = 2*py>mod(2”*'-— 1) and O0<p,<2"*!- 1. 
Let F*= { po, Pj,---» p,) and set F- = (q,|9, = p, + 1 — 2”*'). To see that each element can 
be realized as a sum, observe that if p, < 2”, then p,,, = 2p, and 
Gxc+1 ~ Pk4i — (2"*' — 1) 
= 2p, — (2"*' — 1) 
= 4, + Px: 
On the other hand, if p, > 2”, then 
Proi = 2p, — (2"*' — I= pet q, and 
Qk+1 —~ Pr+i (2"*' _ 1) 
= 2p, — 2(2"*'- 1) 
= 24,.- 
To show that there are no forbidden zero sums, suppose x, + x, +-:: +x, =0 with each 
x, € F. Read this equation modulo 2”*' — 1. After multiplying by pg ', each term is a power 2! 
with 0 <i <n. Clearly at least n + 1 terms must be present to produce 0 mod2”*! — 1. It 
remains to decide whether the sets just constructed and their multiples are the only possible sets of 
minimum order 2” + 2. 

Let F, = F, U F- denote a set with the prescribed properties and order 2n + 2. By scaling 
with a suitable multiplier, we may presume that the element of smallest magnitude is +1. Let 
Po = 1. Since p, = Po + Zp, we cannot have z, < 0, for then |Zz)| < 1, s0 Zz) = 1 and p, must be 2. 

Now if n = 1, po = p, + Z, implies —1 € F,so gy = — 1. Then g, = Go + Wo implies g, = —2. 
Thus F, = m{1,2, — 1, —2} is essentially unique. 
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For n > 2, we may continue with p, = p, + z,. If z,; < 0, then |z,| < 1 is a contradiction. 
Thus z, = p, and p, = 4. If n = 2, pp = p. + Zz, implies —3 € F’. Since F* and one element of 
F™ are all integral, the cyclic generation of the rest of F” assures that it too is integral. (Our earlier 
rescaling could have destroyed the hypothesized integrality.) Since —1 and —2 would produce 
two term 0 sums, —3 is the element of least magnitude in F'. Thus g) = —3 implies that 
GQ, = 24) = —6 and then g, = g, + w,. If w, = gq, we cannot cycle back to g, = —3 in one more 
step. Thus w, € F°. The only choice that avoids contradiction is g, = —5. Thus F, = 
m{1,2,4, —3, —6, —5} is unique. 

For n = 3, we have p; = p, + Z2. If z, < 0, then —1 < z, < —2 since py = p; + Zz; with 
z, < 0. Now z, =z; = —1.5 implies that gy = —1.5 and so qg, = —3 yielding the sum 0 = 1 + 2 
— 3. Thus, z, and z, are distinct elements of F;, which sum to —3. Consequently, |z, — z3| < 1 
which requires that these elements not be consecutive on the cycle in F, . Thus {z,, 73} = (Go, 4). 
Now g, must be 2q) and gq, = g, + w, with w, © F,°. The only legal choice is w, = 1 so 
Go = 2q) + 1 with —1 < q, < —2. Next g; = q. + w, cannot tolerate a positive w,, so gq, = 2q, 
= 4q) + 2. Finally, go = g3 + w3 requires —3q) — 2 € F,". Selecting —3g) — 2 = 1 or 4 yields 
Go Outside the required range — 1 < gy < —1.5. If we select —3g,) — 2 = 2 we find Fz = {- 3, 
— §,—3,— 18). But now p,; = 4 or § with each choice producing a forbidden 0 sum. Thus, 
—3q) —2 =p; = 4+ Z,. Either choice z, = qo or q> yields gy < —2, so the initial assumption 
Z, < 0 cannot occur. Thus F;" must be {1, 2,4, 8} and —7 © F, . As with F,, we observe that F; 
must be integral with gg = —7 being the element of smallest magnitude. Then g, = — 14, 
gy = —13, and gq; = —11 are all forced. Consequently, F, = m{1,2, 4,8, —7, — 14, — 13, — 11) is 
unique. 

For each n > 4 there will be more than one possible set. For example, with n = 4 we have 
already constructed: 


F, = m{1,2, 4, 8, 16, —30, —29, —27, —23, — 15}, 
F, = m{3,6, 12,24, 17, -28, -25, —19, —7, — 14), 
F, = m{5, 10, 20,9, 18, —26, —21, —11, —22, — 13}. 


It is tempting to conjecture that these will be the only possible sets. For n => 4 this question 
remains open. 


and 
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Also solved by F. Gerrish (England). 

Also solved by M. W. Ecker, J. E. Morrill. 

Also solved by R. Breusch, D. M. Broline, M. W. Ecker, M. Gerstell, E. Hensley, R. Hill (United 
Kingdom), M. S. Jacobson, S. B. Maurer, E. Trost (Switzerland, G. Shulman, S. Wagon. 
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Also solved by Robert Patenaude. 

Also solved by N. E. Frangos, V. Pambuccian (Rumania). 

Also solved by H. N. Gupta, R. J. Tomkins, P. Zwier. 

Also solved by D. K. Ross (Australia). 

Also solved by S. A. Heuer, D. B. Shapiro. 

Also solved by Elgin Johnston. 

Also solved by R. Bournas. 

Also solved by S. A. Belbas, K. L. Bernstein, J. Binz (Switzerland), J. N. Boyd, N. Brown (student, 
Australia), P. S. Bruckman, M. P. Eisner, H. Eves, H. E. Fettis, C. Gengzhe, P. Hajnal (Hungary), H. 
Kappus (Germany), E. L. Koh (Saudi Arabia), D. Logothetti, O. P. Lossers (Netherlands), H. J. 
Ludwig, H. M. Marston, N. Martin (Canada), W. W. Meyer, M. W. Meyerson, W. Moser (Canada), 
W. A. Newcomb, J. Papes-F. (Spain), J. P. Robertson, I. A. Sakmar (Canada), B. Shelupsky, N. S. 
Shou (Hong Kong), J. Suck (Germany), H. R. van der Vaart, an anonymous solver, and the proposer. 
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E 2950. Also solved by N. Franceschine, III, O. P. Lossers (Netherlands); L. E. Mattics, N. Miku (Netherlands), 
D. E. Orr, and the proposer. 
6274. Also solved by V. Pambuccian (Rumania). 


ADVANCED PROBLEMS 


Solutions of these Advanced Problems should be mailed in duplicate to Professor David Borwein, Department of 
Mathematics, University of Western Ontario, London, Ontario, Canada N6A 5B9, by May 31, 1983. The solver’s 
full post-office address should be on each sheet. 


6415. Proposed by Raphael M. Robinson, University of California, Berkeley. 


If P(x,, X,.--,X,) 18 a polynomial with integer coefficients, find m and a homogeneous 
polynomial Q(x,, x,..., X,,) with integer coefficients, so that P = 0 has a solution in rationals if 
and only if Q = 0 has a solution in integers not all zero. 


6416. Proposed by Sanford S. Miller, SUNY, Brockport, NY. 


Let p(z) = 1+ p,z + -:-: be regular in the unit disc U, and let a be a real number such that 
a < 1. Show that if 


Re[ z2p’”(z) + 3zp'(z) - ap?(z)| > -] 
for z € U, then Re p(z) > 0. 


6417. Proposed by R. P. Boas, Northwestern University. 


It is a well-known theorem of Polya that although the cosine transform of a decreasing positive 
function is not necessarily positive, the cosine transform of a positive convex function is positive. 
[Math. Z., 2 (1918) 352-382 (378) = Collected Papers, vol. 2, M.I.T. Press, 1974, p. 192.] Show 
that there is a positive number 0 < o < } such that the cosine transform of a positive function ¢ 
with ¢“@(1) decreasing is positive provided that 1 > o but not necessarily when p < o. (Note that, 
although $(7) necessarily becomes infinite at t = 0, (t) might be 0 for t > fp.) 


6418. Proposed by George Benke, Georgetown University. 


Prove that 
2 
2nN-1 Sin oN _y 
=| Sin 24 oo 
" 2N 


6419. Proposed by Wim Vervaat, Cornell University. 


Let f be a real-valued Lebesgue measurable function on R. 

(a) Suppose that f(qx) = qgf(x) for all g € Z, x € R. Prove that f(x) = bx almost everywhere 
on R for some real b. 

(b) If f(x + y) = f(x) + f(y) for all x, y € R, then there is a real b such that f(x) = bx for 
all x € R. Many proofs are known for this result. Prove it this time as a corollary of (a). 


6420. Proposed by Barry Powell, Kirkland, WA. 


For any odd prime p, let g, and G,, be the least positive and greatest negative primitive root of 
p respectively. Prove that for any positive integer M, there exist infinitely many primes p for which 
M < g, < (p— M) and (—p+ M)<G, < —-M. 
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Let p(z) = 1+ p,z+--: be regular in the unit disc U, and let a be a real number such that 
a < 1. Show that if 


Re[ z2p”(z) + 3zp'(z) —ap?(z)| > -1 
for z € U, then Re p(z) > 0. 


6417. Proposed by R. P. Boas, Northwestern University. 


It is a well-known theorem of Polya that although the cosine transform of a decreasing positive 
function is not necessarily positive, the cosine transform of a positive convex function is positive. 
[Math. Z., 2 (1918) 352-382 (378) = Collected Papers, vol. 2, M.I.T. Press, 1974, p. 192.] Show 
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with t“@(t) decreasing is positive provided that ~ > o but not necessarily when p < o. (Note that, 
although $(1) necessarily becomes infinite at t = 0, (¢) might be 0 for t > fp.) 


6418. Proposed by George Benke, Georgetown University. 


Prove that 
2 
2N—1 Sin oN _y 
1 Sin mio 
he ae 
2N 


6419. Proposed by Wim Vervaat, Cornell University. 


Let f be a real-valued Lebesgue measurable function on R. 

(a) Suppose that f(gx) = gf(x) for all g € Z, x © R. Prove that f(x) = bx almost everywhere 
on R for some real b. 

(b) If f(x + y) = f(x) + f(y) for all x, y © R, then there is a real b such that f(x) = bx for 
all x € R. Many proofs are known for this result. Prove it this time as a corollary of (a). 


6420. Proposed by Barry Powell, Kirkland, WA. 


For any odd prime p, let g, and G, be the least positive and greatest negative primitive root of 
p respectively. Prove that for any positive integer M, there exist infinitely many primes p for which 
M <g,<(p-—M)and(-—p+ M)<G,< —-M. 
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SOLUTIONS OF ADVANCED PROBLEMS 
Collinear Algebraic Units 


6341 [1981, 294]. Proposed by J. C. Lagarias, Bell Laboratories, and H. W. Lenstra, University 
of Amsterdam. 


Let L be a line in the complex plane C passing through two relatively prime algebraic integers. 
(Two algebraic integers a, 8 are relative prime if there exist algebraic integers y, 6 such that 
ad + By = 1.) Prove that there are infinitely many algebraic units lying on the line L. 


Solution by David G. Cantor, University of California, Los Angeles. Suppose a, f are relatively 
prime algebraic integers. It is enough to prove there exist infinitely many ¢ € R such that a + Bt 
is a unit. Let K be the (real) field obtained by adjoining the real and imaginary parts of a and B to 
@. Let L be a real finite extension of K containing at least two Archimedean valuations. Let v, be 
the valuation of L obtained by the given embedding of L into R. Let v, be another Archimedean 
valuation of L. It suffices to prove there exist infinitely many algebraic ¢@ R such that 
ja + Br|,, = 1 for all non-Archimedean valuations. Since no restriction is placed on the value of t 
under w,, we can even specify that ¢ lie in an arbitrary small interval (7, s). The result is now an 
extremely special case of Theorem 5.1.1 of D. G. Cantor, On an extension of the definition of 
transfinite diameter and some applications, J. fiir die reine und angewandte Mathematik, 316(1980) 
160-207. 


Also solved by the proposers. 
Units of Chains 


6342 [1981, 294]. Proposed by Richard Stanley, Massachusetts Institute of Technology. 


Let f(n) be the number of nonisomorphic n-element partially ordered sets P which do not 
contain three pairwise incomparable elements. (Equivalently, P is a union of two chains.) Let 


F(x) =14+ D> f(n)x"=14 x4 2x7 4+ 4x? 4+ 10x44---. 


n>1 
Show that 
4 
2-2x+ V1 —4x + V1 — 4x2 


Solution by the proposer. Let g(n) be the number of such P that do not contain a proper subset 
Q such that x < y for all x € Q and y € Q. Let 


F(x) = 


G(x) = D>) g(n)x"=x4x74+ x7 4+ 3x74 --- 


n>\1 


It is easily seen that 


-] 
F(x)=(1- G(x)) -. 
A poset (= partially ordered set) P enumerated by g(n) can be written uniquely (apart from 
order) as the disjoint union of two chains C, and C,. Let C, be given by x, < x, < --: < x, and 
C, by y; < yy < +++ <-y,. Define a,, a,,..., a, and b,, b5,..., 5, by 
a | 1, ifforsomex,, i= |{x © Pix <x,}l, 


—1, otherwise, 
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of Amsterdam. 


Let L be a line in the complex plane C passing through two relatively prime algebraic integers. 
(Two algebraic integers a, 8 are relative prime if there exist algebraic integers y, 6 such that 
ad + By = 1.) Prove that there are infinitely many algebraic units lying on the line L. 


Solution by David G. Cantor, University of California, Los Angeles. Suppose a, f are relatively 
prime algebraic integers. It is enough to prove there exist infinitely many ¢ € R such that a + Bt 
is a unit. Let K be the (real) field obtained by adjoining the real and imaginary parts of a and B to 
@. Let L be a real finite extension of K containing at least two Archimedean valuations. Let v, be 
the valuation of L obtained by the given embedding of L into R. Let v, be another Archimedean 
valuation of L. It suffices to prove there exist infinitely many algebraic t © R such that 
|a + Br|,, = 1 for all non-Archimedean valuations. Since no restriction is placed on the value of t 
under w,, we can even specify that ¢ lie in an arbitrary small interval (7, s). The result is now an 
extremely special case of Theorem 5.1.1 of D. G. Cantor, On an extension of the definition of 
transfinite diameter and some applications, J. fur die reine und angewandte Mathematik, 316(1980) 
160-207. 


Also solved by the proposers. 
Units of Chains 


6342 [1981, 294]. Proposed by Richard Stanley, Massachusetts Institute of Technology. 


Let f(n) be the number of nonisomorphic n-element partially ordered sets P which do not 
contain three pairwise incomparable elements. (Equivalently, P is a union of two chains.) Let 


F(x)=1+ D f(n)x"=14+x4+2x? 4+ 4x74 10x44+---. 


n>1 
Show that 
4 
2-2x4+ V1 —4x + V1 — 4x2 — 


Solution by the proposer. Let g(n) be the number of such P that do not contain a proper subset 
Q such that x < y for all x € Q andy € Q. Let 


F(x) = 


G(x) = YY g(n)x"=x4t x74 x74 3x44--- 


n> 1 
It is easily seen that 
-] 
F(x)=(1- G(x)) . 


A poset (= partially ordered set) P enumerated by g(n) can be written uniquely (apart from 
order) as the disjoint union of two chains C, and C,. Let C, be given by x, < x, < --: < x, and 
C, by y,; < yy < +++ <-y,. Define a,, a,,..., a, and b,, b5,..., 5, by 

a= | 1, ifforsomex,, i= |{x © Plx<x,}l, 


—l, otherwise, 
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i 


h = _ ifforsomey,, i= |{y © Ply <y,}, 
1, otherwise. 
Then a,, a,..., a, and b,, b,,..., b, satisfy a, = 1,5, = —1, 
ag,t+::-+a,#b,+-:-+ 5b, forl <i<n, 
and 
ajt+-:-+a,=b,+---+56,. 
The number of such pairs of sequences is known to be the Catalan number 
C,-1 = - ( nev ) 
Each such pair yields a poset P enumerated by g(n), and two sequences yield the same poset P if 
and only if a; = — 5, for all i. The number of sequences a,, a,,..., a, such that a, = 1, 
a,t---+a,#—(a,+-:-+4,)forl<i<n 
and 
a, +ce:'+a,= —(a,+ --> + a,) 
is 0 if n is odd and is C, ,._, if n is even. Hence 
g(2n) = 3(C,-1 + G-1) 
g(2n+ 1) =46,. 
The proof now follows straightforwardly from the well-known generating function 


nt -—vl—- 4x 
Cx" = ax , 


n>0 


A Translation Invariant Measure 


6348 [1981, 446]. Proposed by James Henle and Stanley Wagon, Smith College. 


Prove or disprove: If w is a countably additive translation invariant measure from the Borel 
subsets of R to [0, co], then p is invariant under all isometries of R; 1.e., it is also invariant under 
reflections. 


Solution by the proposers. The assertion is false, although it is true under the additional 
hypothesis ((0, 1]) < oo, for then p is a scalar multiple of Lebesgue measure. To get a counterex- 
ample it suffices to construct a Borel set C C€ [0, 1] such that for any countable set, {a,}, of reals, 


—-C¢ U{C+a4,:n < o}. 


For then we may define a o-ideal J of Borel sets by: 


1° @) 
I= (4 C R: A is Borel and A Cc - C + a, for some sequence of reals a,}. 
n= 1 

Then let » be the measure assigning measure 0 to all sets in J, and measure oo to all other Borel 
sets. This is the desired measure since p is translation invariant and countably additive, u(C) = 0, 
and p.(—C) = oo. 

It remains to construct C. Simply let C consist of those reals in [0, 1] having a base 4 expansion 
using no 2’s. Then C is an asymmetric Cantor set, which is closed, and hence Borel. To prove that 
C has the desired property it suffices to show that, for any sequence {a,,), 
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1, otherwise. 
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rn. l-—vl—4x 
yy C.x"= ax , 


n>0 


A Translation Invariant Measure 


6348 [1981, 446]. Proposed by James Henle and Stanley Wagon, Smith College. 


Prove or disprove: If 4 is a countably additive translation invariant measure from the Borel 
subsets of R to [0, oo], then p is invariant under all isometries of R; 1e., it is also invariant under 
reflections. 


Solution by the proposers. The assertion is false, although it is true under the additional 
hypothesis »([0, 1]) < oo, for then p is a scalar multiple of Lebesgue measure. To get a counterex- 
ample it suffices to construct a Borel set C C€ [0, 1] such that for any countable set, {a,,}, of reals, 


—-C¢ U{C+4,:n < o}. 


For then we may define a o-ideal J of Borel sets by: 


1° @) 
T= (4 C R: Ais Borel and A Cc - C + a, for some sequence of reals a,}. 
n= 1 

Then let » be the measure assigning measure 0 to all sets in J, and measure oo to all other Borel 
sets. This is the desired measure since p is translation invariant and countably additive, u(C) = 0, 
and p(—C) = oo. 

It remains to construct C. Simply let C consist of those reals in [0, 1] having a base 4 expansion 
using no 2’s. Then C is an asymmetric Cantor set, which is closed, and hence Borel. To prove that 
C has the desired property it suffices to show that, for any sequence (a,,), 
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1-C¢ UC+#+a,. 


n=1 


Note that 1 — C consists of reals in [0, 1] having a base 4 expansion using no 1’s. Suppose {a,,} is 
given. It is a simple matter to get a real x in [0, 1] by building its base 4 expansion inductively so 
that the expansion has no 1’s (and hence x is 1 — C) but yet, for each n, x — a, has two 2’s ina 
base 4 expansion, and therefore at least one 2 in any of its base 4 expansions. Namely, suppose 
the first r base 4 digits of x have been constructed guaranteeing that x is as required with respect 
to a,, for m < n, and let x,__, denote the number beginning with the already constructed r digits 
of x and ending in a string of 0’s. 

Consider the base 4 expansion (not ending in all 3’s) of x,_, — a,,. If this expansion ends in all 
l’s, add a string of the form 0... 03030 to the already constructed digits of x, where the string 
3030 corresponds to positions beyond the rth where x, , — a, has I’s. Otherwise add a string of 
the form 0... 070...0...0j70...0 to x where i, j are in positions beyond the rth where 
X,—-, — a, has a 0, 2 or 3, andi, 7 equal 2, 0 or 3 accordingly; moreover, the final 0 and at least 
one of the 0’s between i and j are in positions where x, , — a, has a digit different from 3 (this is 
so that carrying will not enter into the arithmetic). These additions guarantee that x — a, will 
have a 2 in any of its base 4 expansions, and hence that x will not lie in C + a,. 


K. E. Hirst points out that a counterexample is given in his paper “Translation invariant measures which are not 
Hausdorff measures,” Proc. Cambridge Philos. Soc., 62 (1966) 693-698. 


Countable Sets of Linear Functions 


6349 [1981, 446]. Proposed by J.-B. Hiriart-Urruty, University of Clermont-Ferrand II, France, 
and T. H. Sweetser, University of California at San Diego. 


Let @ be a countable set in L(R”,R™)—the space of all linear functions from R” to R™; we 
define @x = {Ax|A € @). Does B € L(R",R”) satisfying Bx € @xVx © R” necessarily belong 
to @? 


Solution by Chin-Chi Shan, Ohio State University. The answer is “yes.” Let {e,;}"_ , be a basis of 
R” and 


W = ( > a’ 'e:ae R\<0)}. 


i=] 


Then any n vectors in W constitute a basis of R” since their coefficients from a Vandermonde 
determinant. Since W is uncountable and @ is countable, there exists at least one A © @ such that 
{x € W: Bx = Ax}| > n. Hence B = A on a basis of R”, and so B = A on R”. 


Also solved by V. Anantharam, Carlos Biasi and Wilson Mauricio Tadini (Brazil), Jozef Bobok (Czechoslovakia), 
Jerzy Browkin (Poland), F. S$. Cater, Chico Problem Group, John Cobb, G. Crofts, Patrick R. Gardner, Jaroslav 
Gurican (Czechoslovakia), William H. Gustafson, Dinh The Hung and Benny Cheng, Elgin Johnston, Joel Levy, 
J. G. Mauldon, William Myers, Ivan Netuka (Czechoslovakia), Nicholas Passell, Eero Posti (Finland), Adam Riese, 
C. Ray Rosentrater, A. R. Sourour (Canada), R. A. Vitale, David M. Wells, J. B. Wilker (Canada), Pei Yuan Wu 
(Taiwan), and the proposers. 


Separation of Edges of a Square by a Closed Set 


6351 [1981, 540]. Proposed by James W. Walker, MIT. 


Let F be a closed subset of the unit square J* = [0,1] < [0, 1] such that F contains the closed 
horizontal edges [0,1] <X {0} and [0,1] < {1}, but does not intersect the open vertical edges 
{0} X (0, 1) and {1} X (0, 1). Show that either both horizontal edges belong to the same connected 
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I1-C¢ UC+Ha,. 


n=1 
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Separation of Edges of a Square by a Closed Set 


6351 [1981, 540]. Proposed by James W. Walker, MIT. 


Let F be a closed subset of the unit square J* = [0, 1] x [0, 1] such that F contains the closed 
horizontal edges [0,1] x {0} and [0,1] < {1}, but does not intersect the open vertical edges 
{0} X (0, 1) and {1} X (0, 1). Show that either both horizontal edges belong to the same connected 
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I1-C¢g UC+#+a,. 


n=1 


Note that 1 — C consists of reals in [0, 1] having a base 4 expansion using no 1’s. Suppose {a,,} is 
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X,—-1 — 4, has a0, 2 or 3, and i, 7 equal 2, 0 or 3 accordingly; moreover, the final 0 and at least 
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Countable Sets of Linear Functions 


6349 [1981, 446]. Proposed by J.-B. Hiriart-Urruty, University of Clermont-Ferrand II, France, 
and T. H. Sweetser, University of California at San Diego. 
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W = ( > a’ le: ae R\0)}. 


i=] 
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Also solved by V. Anantharam, Carlos Biasi and Wilson Mauricio Tadini (Brazil), Jozef Bobok (Czechoslovakia), 
Jerzy Browkin (Poland), F. S. Cater, Chico Problem Group, John Cobb, G. Crofts, Patrick R. Gardner, Jaroslav 
Gurican (Czechoslovakia), William H. Gustafson, Dinh The Hung and Benny Cheng, Elgin Johnston, Joel Levy, 
J. G. Mauldon, William Myers, Ivan Netuka (Czechoslovakia), Nicholas Passell, Eero Posti (Finland), Adam Riese, 
C. Ray Rosentrater, A. R. Sourour (Canada), R. A. Vitale, David M. Wells, J. B. Wilker (Canada), Pei Yuan Wu 
(Taiwan), and the proposers. 
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Let F be a closed subset of the unit square J* = [0,1] < [0, 1] such that F contains the closed 
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component of F, or else both vertical edges belong to the same component of I* \ F. 


Solution by John Cobb, University of Idaho. Suppose no component of J*\ F contains both 
vertical edges. Then F separates these two edges and hence, by the unicoherence of J”, some 
component G of F separates them also. (See M. H. A. Newman, Elements of the Topology of Plane 
Sets of Points, (Cambridge University Press), 2nd ed., Theorem 14.3.) If G fails to contain one of 
the horizontal edges, say the top, then the connected set 


{0} x (0,1) U [0,1] x {1} U C1} x (0,1) 
lies in I*\G, contradicting the separation of the vertical edges by G. 
Also solved by the Chico Problem Group, Jay Hook, Terry L. Lay, Mark D. Meyerson, and the proposer. 


Can §,, be Embedded in A,, , ;? 


6353 [1981, 541]. Proposed by the editors. 


“Everyone” knows that the symmetric group S, can be embedded in A,,,. Can S, be 
embedded in 4,,, ,; (alternating group)? 


I. Solution by Lenny Jones, Shippensburg State College, Shippensburg, Pennsylvania. Yes, for 
n = |, and no, forn > 1. For n = 1 this is obvious. For n > 2, by way of contradiction, assume S,, 
can be embedded in A, ,. Then [A,,,: S,] = (” + 1)/2 which yields a contradiction unless n is 
odd and greater than 3. (The case n = 3 is disposed of using the “well-known” fact that A, has no 
subgroup of order 6.) The action of A,,, , on the cosets of S, gives a homomorphism from A,,, | 
into S(n+41)/2- This homomorphism has a nontrivial kernel since (n + 1)!/2 > ((n + 1/2)! for 
odd n > 5. But this contradicts the fact that A,, , 1s simple for n > 5. 


II. Solution by James Riddell, University of Victoria, Canada. First consider cases n > 13. Such 
an n can be expressed n = 2+ p+q+--- where p,gq,... are distinct odd primes, since every 
integer > 10 can be partitioned into distinct odd primes (see Sierpinski, “Elementary Theory of 
Numbers,” Warsaw, 1964, p. 144, Theorem 12). Then S, contains a permutation of order 
2:p:q:::, namely the product of disjoint cycles of the respective lengths 2, p, q,.... But A,,,, 
cannot contain a permutation of this order, since otherwise n + 1 would have to be expressible 
n+1=2+2+pt+qt:::. 

For smaller n we need consider only n odd, since for n even, n! + (n+ 1)!/2. The cases 
n=5=24+3,n=7=2+5,n=9=2+7, and n= 11 =2 +9 can be argued similarly to 
the above, and the case n = 3 by noticing that S$, has two elements of period 2 whose product has 
period 3 (and @, does not have such a pair). 


Also solved by D. M. Bloom, Duane M. Broline, Jerzy Browkin (Poland), D. W. Brown, Paul L. Chabot, 
. Lorraine L. Foster, Robert Gilmer, James Henle & Stan Wagon, A. A. Jagers (The Netherlands), Michael Josephy 
(Costa Rica), John Kahila, Eugene Levine, Desmond MacHale (Ireland), Joseph J. Rotman, Claudia Spiro, Rony 
Teitler (England), Gary L. Walls, and Michael Woltermann. Partially solved by Peter R. Atwood and Kenneth 
Klinger. 


MISCELLANEA 


91. If I had been trained in research—that ridiculous contradiction in terms—should I have 
done more than produce additions to the existing store of little papers with blunted conclusions, 
of which there are already too many? 


~“*George Ponderevo” in H. G. Wells, Tono— Bungay, Book 2, Chapter 2. 
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Topologie. Fonctions réelles d’ une variable réelle. Second Edition. By A. Doneddu. (Nouveau cours 
de mathématiques, vol. 4.) Vuibert, Paris, France, 1979. 315 pp. 


R. P. Boas 
Department of Mathematics, Northwestern University, Evanston, IL 60201 


We all know that Mathematics has changed greatly during the last half-million class 
periods—fifty years, that is. It has grown at an accelerated rate, and many important problems 
have been solved in our time after resisting many efforts. Has Mathematics Education kept up? 

Well, to some extent. Half a century ago I was a student in a course, more or less at the level of 
Doneddw’s book. The instructor had been specifically warned by the Department not to mention 
uniform convergence (it was too difficult a concept!). Doneddu does discuss uniform convergence, 
in fact, in exactly the terms in which I was to encounter it the next year (although I think, now, 
that there are better ways). In half a century we have gained perhaps a year. Shouldn’t we expect 
more than that from a field that is growing explosively? 

Doneddu says that his book was written as one of a series of texts prescribed by a decree of the 
French government in 1972. The other volumes are: (1) Structures fondamentales, (2) Polynémes et 
algébre linéaire, (3) Espaces euclidiens et hermitiens, Géometries, (5) Fonctions Vectorielles, Séries, 
Equations différentielles, (6) Géometrie différentielle, Intégrales multiples. Give or take a few topics, 
this could be an outline of an advanced calculus course of the 1930’s, except for the first part of 
volume 4. I presume that these volumes are not intended as a first course for beginners, although 
technically they could be. (In fact I know a professional mathematician who taught himself 
calculus from a three-volume Vorlesungen tiber Differential-und Integralrechnung.) Volume 4 could 
be used as a handy reference in which you could look up concise and reliable statements and 
proofs of calculus theorems. But be warned: Doneddu’s exposition suffers from his apparent 
belief that more terminology means more rigor. Presumably this will not trouble the intended 
audience, who must have been trained at Bourbaki’s knee, but it does create trouble for outsiders. 
For example, Doneddu provides no index entries for some of Bourbaki’s favorite words and no 
index of symbols. I had to turn a good many pages, one by one, before I found out for sure what 
%(E, R) stood for. 

If the students who use this book ever learn that calculus is used for something, whether in or 
out of mathematics, they will not learn it from the book itself. Considering that the directions 
taken by calculus have been largely determined by its applications to geometry, physical sciences, 
and more recently to social sciences, this omission of applications seems hard to justify. Calculus 
is too close to the real world to be presented as an abstract game. 

It is clearly too much to hope that textbooks on calculus, at whatever level, will start including 
very much new mathematics very soon, but they could at least show some awareness of what has 
been going on. Doneddu, for example, mentions computers and calculators in passing, but gives 
no indication of the influence that these tools have had on many parts of mathematics, and on 
numerical analysis in particular. Indeed, he has a chapter on numerical analysis that could have 
been written 50 years ago, except that it is much less sophisticated than what would have been 
acceptable then. 

Anyone ought to be able to think of a dozen significant topics at the advanced calculus level 
that could reasonably count as advanced calculus, but are represented in few, if any, current 
textbooks on advanced calculus (or introductory real analysis, or whatever it happens to be 
called). (My own list would include generalized functions (distributions), constrained extremal 
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problems with inequality constraints, Abel’s theorem for power series, inequalities, and methods 
of approximating infinite series and integrals.) Are such topics omitted because they are not 
traditional? They do not appear in books written for engineers and scientists, either; is this 
because the corresponding faculties themselves use only more traditional mathematics? I am 
rather afraid so. 

You are probably going to ask how more topics can be put into a course in advanced calculus 
that is already too crowded. My guess is that it is too crowded because people are trying to tell 
everything that they know about each topic. It is harder, but more rewarding, to try to pare each 
topic down to essentials, and then cover more topics. 


Pathways to Solutions, Fixed Points, and Equilibria. By C. B. Garcia and W. I. Zangwill. 
Prentice-Hall, Englewood Cliffs, New Jersey, 1981. xvi + 479 pp., $32.00. 


WERNER C. RHEINBOLDT 
Department of Mathematics and Statistics, University of Pittsburgh, Pittsburgh, PA 15261 


For over a century the continuation principle has proved to be an important tool in 
mathematics. To illustrate the idea, suppose that for a point y(0) on some manifold M, exactly 
one point x(0) with a specific property is known to exist on another manifold M,,. Then the same 
assertion may be shown to hold for another point y(1) of M, by using continuity arguments to 
prove it for all points y(s), 0 < s < 1, of a path on M, connecting y(O) with y(1). Early uses of 
this approach date at least to H. A. Schwarz and his work on conformal mappings around 1869. 
Then J. Hadamard and M. Levy applied it in connection with the inversion of nonlinear 
mappings, and it is also a basic tool in J. Leray’s and J. Schauder’s celebrated work on differential 
equations. These are only a few of many possible examples. 

Until a few decades ago the continuation principle was strictly considered a theoretical tool; 
for example, there would have been no interest in computing explicitly the paths in our sketchy 
illustration. The first application of the approach to a numerical problem appears to go back to a 
little noticed paper by E. Lahaye in 1932, but it was not until the early fifties and the advent of 
automatic computing that continuation techniques began to be used widely for numerical work. 

One of the main problems to which these techniques are applied concerns the solution of a 
nonlinear equation Fx = 0 in R”. In order to compute a specific solution x*, a typical approach is 
to imbed the given equation into a family of equations H(x, t) = 0, 0 < t < 1, for which there 
exists a computable solution path x = x(t), O< t<_ 1, beginning at a known point x(0) and 
ending at x(1) = x*. A related but conceptually distinct problem arises in many areas of 
engineering, notably structural mechanics, where the equation under consideration always in- 
cludes a number of physically important, intrinsic parameters. In these problems it is of little 
interest to determine only a few specific solutions. Instead, the requirement is to follow paths on 
the solution-manifold in the space of all state and parameter variables and thereby to detect 
specific features of the manifold which signify a change of behavior of the system under study. 

For the first of these problems, namely the computation of a specific solution of some 
equation, two distinct continuation techniques are available. The first one involves the case when 
the solution paths are differentiable which in turn allows their representation as a solution of a 
differential equation. The second approach was initiated by H. Scarf in 1967 in a very different 
setting. In fact, his technique is based on homological rather than homotopic concepts and—as 
observed by H. Kuhn—made use of a numerical form of Sperner’s lemma. Since then this 
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approach has been reformulated and is now considered principally in the form of algorithms 
involving piecewise-linear solution paths. 

Up to now most of this material on the numerical analysis of both of these forms of 
continuation algorithms could be found mainly in the journal literature or in conference 
proceedings. This book represents a first introduction to the area on a level accessible to graduate 
students and to all those interested in applying these methods in their own fields. 

The authors’ own area of interest centers around applications in mathematical programming 
and economics. It is therefore not surprising that the corresponding chapters are probably the 
strongest ones in the book. There are very nice discussions of applications to nonlinear program- 
ming, economic equilibria, and game theory. Correspondingly, the presentations of the piecewise 
linear algorithms and of some of the underlying concepts, such as linear complementarity, are 
especially well done. 

In contrast, the case of differentiable paths is dealt with in a somewhat less rounded form, and, 
in fact, the discussion of the corresponding algorithms seems rather sketchy. There is no 
consideration of the indicated problem of using continuation algorithms for analyzing solution 
manifolds, and very little mention is made of the extensive engineering literature in this area. Only 
a few elementary examples of structural equilibrium problems are included. Similarly, the brief 
discussion of the ideas behind catastrophe theory only skirts the central points of the theory, and 
the considerations of the loss of stability of the equilibria under study are not related to any 
numerical algorithms. 

However, these last observations should not detract from the overall value of the book. In fact, 
any book which attempts to cover a fairly large area with relatively elementary mathematical 
techniques cannot satisfy every reader. This text is indeed a very nice introduction to the field 
which should serve its intended purpose well, namely, to make this circle of concepts and 
techniques accessible to a wider audience, and to provide a possible text for graduate courses and 
seminars on these topics. For this it deserves a warm recommendation. 


LETTERS TO THE EDITOR 


Material for this department should be sent to Professor P. R. Halmos, Department of Mathematics, Indiana 
University, Bloomington, IN 47405. 


7 June 1982 


Editor: 


I am, and have been since 1962, the mathematics department at this small (student body about 
400) “liberal arts” college. My normal teaching load is anywhere from 14 to 17 hours each 
semester—4 to 5 different math classes. As a result, and I suppose by choice, my mathematical 
involvement is other than research. I happen to like (math) teaching, and, even more, I am most 
happy doing or reading some sort of mathematics. 

About 80% of my math majors go on to graduate school. My course work for them involves 
everything from calculus to topology, complex variables, and operations research. 

All this is prelude to justify what J want in the MONTHLY; namely “folklore” ideas in various 
fields. Examples: proving the vector product distributes over addition by pseudo-division, an 
accessible (to junior and senior math majors) topological discussion of orientation relative to 
Gauss’ theorem (manifold theory), use of the Sylow theorems and other techniques in showing the 


1983] LETTERS TO THE EDITOR 67 


approach has been reformulated and is now considered principally in the form of algorithms 
involving piecewise-linear solution paths. 

Up to now most of this material on the numerical analysis of both of these forms of 
continuation algorithms could be found mainly in the journal literature or in conference 
proceedings. This book represents a first introduction to the area on a level accessible to graduate 
students and to all those interested in applying these methods in their own fields. 

The authors’ own area of interest centers around applications in mathematical programming 
and economics. It is therefore not surprising that the corresponding chapters are probably the 
strongest ones in the book. There are very nice discussions of applications to nonlinear program- 
ming, economic equilibria, and game theory. Correspondingly, the presentations of the piecewise 
linear algorithms and of some of the underlying concepts, such as linear complementarity, are 
especially well done. 

In contrast, the case of differentiable paths is dealt with in a somewhat less rounded form, and, 
in fact, the discussion of the corresponding algorithms seems rather sketchy. There is no 
consideration of the indicated problem of using continuation algorithms for analyzing solution 
manifolds, and very little mention is made of the extensive engineering literature in this area. Only 
a few elementary examples of structural equilibrium problems are included. Similarly, the brief 
discussion of the ideas behind catastrophe theory only skirts the central points of the theory, and 
the considerations of the loss of stability of the equilibria under study are not related to any 
numerical algorithms. 

However, these last observations should not detract from the overall value of the book. In fact, 
any book which attempts to cover a fairly large area with relatively elementary mathematical 
techniques cannot satisfy every reader. This text is indeed a very nice introduction to the field 
which should serve its intended purpose well, namely, to make this circle of concepts and 
techniques accessible to a wider audience, and to provide a possible text for graduate courses and 
seminars on these topics. For this it deserves a warm recommendation. 


LETTERS TO THE EDITOR 


Material for this department should be sent to Professor P. R. Halmos, Department of Mathematics, Indiana 
University, Bloomington, IN 47405. 


7 June 1982 


Editor: 


I am, and have been since 1962, the mathematics department at this small (student body about 
400) “liberal arts” college. My normal teaching load is anywhere from 14 to 17 hours each 
semester—4 to 5 different math classes. As a result, and I suppose by choice, my mathematical 
involvement is other than research. I happen to like (math) teaching, and, even more, I am most 
happy doing or reading some sort of mathematics. 

About 80% of my math majors go on to graduate school. My course work for them involves 
everything from calculus to topology, complex variables, and operations research. 

All this is prelude to justify what J want in the MONTHLY; namely “folklore” ideas in various 
fields. Examples: proving the vector product distributes over addition by pseudo-division, an 
accessible (to junior and senior math majors) topological discussion of orientation relative to 
Gauss’ theorem (manifold theory), use of the Sylow theorems and other techniques in showing the 
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actual construction of non-isomorphic subgroups of groups of order 10 to 17, and developments 
in operations research explained in less than 500 pages. 

I do like the history of development of mathematical concepts. Think the students really profit 
from learning about the difficult times the “early day” people had in smoothing out the ideas so 
glibly presented in our short class periods. I like to use the concept of the evolution of normal 
subgroups in this context. 

Book reviews are important to me. Publishers do not have much to do with small colleges. How 
my colleagues react to current texts is important to me. Examples: I switched from Thomas & 
Finney to Philip Gilletts calculus, I use Boas’ A Primer of Real Functions in my one semester 
introduction to real variables course—how do others react to these texts? I refuse to teach a 
course in computer programming, yet I have my students learn (by some instruction in first year 
calculus) how to program and run the Hewlett-Packard 41. I feel as if any more is not too 
productive—am I way off base? 


William H. Jamison 

Physics and Mathematics Department 
Rocky Mountain College 

Billings, Montana 59102 


Comments, anyone? Ed. 


Letter from the editor. 


One complaint about the MONTHLY keeps recurring to almost every editor almost every year: 
the articles are too hard, too technical, too specialized. Perhaps only a few think that, but, if so, 
they are vocal out of proportion to their numbers; if most readers (possible readers) think that, 
something ought to be done about it. | 

I reject research papers out of hand; the MONTHLY is not a research outlet. If the referees (and 
my own instincts) tell me that a would-be expository paper is just not expository enough, many 
people (referees, authors, associate editors, editorial assistants) go to work and try to prepare a 
good revision. 

What else can be done? An editor doesn’t write the articles. I sometimes solicit them, but 
mostly I just have to wait till they come in. I would like to hear from the readers: do you agree 
with the complaint, and do you have a suggestion for a cure? Would you be willing to write the 
proper kind of article—because you have something to say, because you can say something better 
than the last person who tried it, or just to show that it can be done? 


Paul Halmos 


Mathematics from MACMILLAN in 1983 


Introductory Algebra and 
Intermediate Algebra 


PATRICIA HOOPER and LINDA PULSINELLI, both Western Kentucky University 

Two paperback Text/Workbooks. 

Motivation is the key. The pedagogical approach leads students and professors to better 
results. 


College Algebra and 
College Algebra and Trigonometry 


CHRIS VANCIL, University of Kentucky 
A fresh and concise presentation of the topics that develop and strengthen mathematical 
skills and reasoning. 


Trigonometry An Analytic Approach, Fourth Edition 


IRVING DROOYAN, WALTER HADEL, and CHARLES C. CARICO, all, Los Angeles Pierce 
College. 

Introducing circular functions in the first chapter, this text emphasizes preparation for 
calculus throughout. 


Trigonometry with Calculators 


LAWRENCE S. LEVY, University of Wisconsin, Madison 
Thoroughly covers trigonometry in a relaxed, personal style that begins with the right- 
triangle approach, while integrating regular use of the calculator. 


Fundamentals of Mathematics Third Edition 


WILLIAM M. SETEK Jr, Monroe Community College 

This new edition of a consistent best seller features a chapter on computers and 
computer programming. Its exposition is patient and its worked-out examples and 
problem sets are without peer. 


Introduction to Statistics Third Edition 


RONALD E. WALPOLE, Roanoke College 
This third edition has been revised to include varied real-life situations while retaining 
the excellent pedagogical approach of previous editions. 


Elementary Statistical Concepts Second Edition 


RONALD E. WALPOLE, Roanoke College 
Translates difficult statistical concepts into hundreds of concrete, easy-to-understand 
examples and methods for students with a limited mathematics background. 


Probability and Statistical Inference 


Second Edition 
ROBERT V. HOGG, University of Iowa, and ELLIOT A. TANIS, Hope College 


MACMILLAN PUBLISHING CO., INC., COLLEGE DIVISION 
866 THIRD -AVENUE, NEW YORK, N.Y. 10022 


HICAGO 


A CALCULATING PEOPLE Patricia Cline Cohen 


In this absorbing and original work, Cohen traces the development of 
numeracy in America and links quantification with changes in the way 
men thought about their social, political, and economic relations. 

Cloth § 22.50 (est.) 272 pages (est.) 


LOGIC MACHINES AND DIAGRAMS 
Second Edition Martin Gardner With a Foreword by Donald Michie 


In this book, Gardner provides the only history in English that details 
the methods of solving logic problems by diagrams and by machines 
other than digital computers. He discusses everything from Ramon 
Lull’s thirteenth-century diagrams to the Stanhope Demonstrator, 
Jevon’s Logic Machine, the Marquand Machine, and even simple logic 
card windows. 

Cloth $16.00 Paper $5.95 176 pages Illus. 


MATHEMATICS AND HUMOR John Allen Paulos 
Paulos studies the formal properties of humor by plotting, in 
mathematically graphic terms, the structure of jokes and humorous 
thoughts. He looks at jokes and nonstandard geometry, self-reference 

in logic and humor, repetition and rules, combinatorics, Wittgenstein, 
and Woody Allen, among others. 

‘*His approach is unorthodox and fresh.’’—Douglas Hofstadter, New 
York Times Book Review 

Paper $4.95 124 pages Illus. 


INFINITE-DIMENSIONAL OPTIMIZATION AND 
CONVEXITY Ivar Ekeland and Thomas Turnbull 


The authors seek to determine whether, when given an optimization 
problem consisting of minimizing a functional over some feasible set, 
an optimal solution—a minimizer—may be found. 


Chicago Lectures in Mathematics series 
Cloth $15.00 (est.) Paper $7.00 (est.) 200 pages (est.) 


from Midway Reprints ———— ——____@_@$_$@ 
LECTURES ON SIMPLICIAL OBJECTS IN 


LIE GROUPS ALGEBRAIC TOPOLOGY 
J. Frank Adams Peter May 
Paper $10.00 198 pages Paper $9.00 168 pages 


5801 South Ellis Avenue 


The University of Chicago Press Chicago. TL 60631 


HOUGHTON MIFFLIN 
A 


The Mainstream of Algebra and 


Trigonometry, Second Edition 
A. W. Goodman, University of South Florida 
506 pages e cloth e Solutions Manual e 1980 


Business Statistics: 
Basic Concepts and Methodology 
Third Edition 


Wayne W. Daniel and James C. Terrell 
Both of Georgia State University 

About 800 pages e cloth e Study Guide 
Instructor’s Resource Manual e Solutions 
Manual e Computerized Test Program 
Computerized Data Base e Just published 


Statistical Analysis for 


Business Decisions 

Gary E. Meek, University of Akron 
Stephen J. Turner, Babson College 
About 768 pages e cloth e Instructor’s 
Manual e Just published 


Practical Math for Business 
Third Edition 


Alan R. Curtis, Lansing Community College 
About 368 punched, perforated pages 
paper e Instructor's Annotated Edition 
Early 1983 


Mathematics of Finance 
Sixth Edition 


Robert Cissell and Helen Cissell 

David C. Flaspohler, Xavier University 
 sopaees e cloth e Instructor’s Manual 
1 


Basic Behavioral Statistics 
Dennis E. Hinkle, Virginia Polytechnic 
Institute and State University 

William Wiersma and Stephen G. Jurs 
Both of University of Toledo 

410 pages e cloth e Student Workbook 
Solutions Manual e 1982 


Also by Hinkle, Wiersma, and Jurs 
Applied Statistics for the 
Behavioral Sciences 


489 pages e cloth e Workbook e Solutions 
Manual e 1979 


Teaching Elementary School 
Mathematics: An Active 

Learning Approach 

Harold H. Lerch 

University of Illinois, Urbana 

398 pages e cloth e Activities Handbook 
(see below) e Instructor's Manual e 1981 


Also by Lerch 
Active Learning Experiences 
for Teaching Elementary 


School Mathematics 
461 pages e paper e 1981 


Teaching Mathematics: 
Methods and Content 
Second Edition 


Fredricka K. Reisman, University of Georgia 
Dag Pages e cloth e Instructor's Manual 
1 


Basic Statistics for Education and 

the Behavioral Sciences 

Sharon L. Weinberg and Kenneth P. Goldberg 
Both of New York University 

gop aes e cloth e Instructor's Manual 


Overcoming Math Anxiety 
Sheila Tobias 
2/8 pages e paper e 1979 


For adoption consideration, request examination copies 
from your regional Houghton Mifflin office. 


«~, Houghton Mifflin 


13400 Midway Rd., Dallas, TX 75234 

1900 So. Batavia Ave., Geneva, IL 60134 
Pennington-Hopewell Rd., Hopewell, NJ 08525 
777 California Ave., Palo Alto, CA 94304 


Developmental 
Delivery 
Systems 


That Deliver 


A diagnostic/prescriptive approach. . . 


PCSP: Personalized Computational Skills Program 


A Brush-Up Book® 

Bryce R. Shaw, Senior Author 

Gwen Shelton, Bee County College, Texas 
Sandra Pryor Clarkson, Hunter College, New York 


VOLUME 1 COMPUTATIONAL SKILLS ¢ 1980 
Module A: Whole Numbers 

Module B: Fractions 

Module C: Decimals, Proportions, and Percents 


VOLUME 2 APPLICATIONS ¢ 1981 

Module D: Applications Using Computational Skills 
Module E: Special Skills with Applications 

Module F: Beginning Algebra with Applications 


PCSP COMBINED EDITION e 1982 


PCSP uses skill models, worked examples, and thousands of practice 
problems and applications to improve computational competency. Built- 
in diagnostic and mastery tests pinpoint students’ problem areas and 
measure their learning progress. 

The six modules are available separately or in two volumes, which 
may be used independently, concurrently, or in sequence. The Combined 
Edition offers a quick brush-up on basic skills together with their 
applications in a single volume. 


An interactive approach... 


Basic College Mathematics: An Applied Approach 
Richard N. Aufmann and Vernon C. Barker 

Both of Palomar College 

512 pages ¢ paper e Instructor's Annotated Edition 

Alternate Testing Program e Solutions Manual « 1982 


A mathematically sound review of basic computation, applications, and 
beginning algebra. 


Introductory Algebra: An Applied Approach 
Richard N. Aufmann and Vernon C. Barker 

About 512 pages e paper e Instructor's Annotated Edition 
Alternate Testing Program e Solutions Manual ¢ Early 1983 


A mathematically sound and comprehensive coverage of the skills and 
applications typically found in an elementary algebra course. 


Each of the above texts in the Aufmann/Barker series features an 
interactive delivery system that results in a higher level of mastery with 
less demand on the instructor's time. An objective-specific approach facil- 
itates management of instruction, and an applied approach enhances 
students’ insight into the value of the mathematics they're learning. 


Essential Mathematics with Applications 
Vernon C. Barker and Richard N. Aufmann 
About 288 pages « paper e Answer Booklet ¢ Just published 


Here is a new program especially designed for the mature student who 
desires a mathematically sound yet economical review of the 
computational skills and applications generally regarded as essential 
for adult life and/or continued study. 

While the objective-by-objective organization helps students and/or 
the instructor to manage instruction, the interactive delivery system 
helps students to learn on their own. Emphasis on applications helps 
students to use the mathematics they have learned. 


For adoption consideration, request examination material from your regional Houghton Mifflin 
office. 


3, Houghton Mifflin 


13400 Midway Road, Dallas, TX 75234 

1900 South Batavia Avenue, Geneva, IL 60134 
Pennington-Hopewell Road, Hopewell, NJ 08525 
777 California Avenue, Palo Alto, CA 94304 


AMONG THE NEW BOOKS IN 1983 
THAT ARE GOING TO MAKE A DIFFERENCE 


Linear Algebra with Applications 

Second Edition 

by Jeanne Agnew, Oklahoma State University; and R. C. Knapp, Herkimer County 
Community College. An informal, highly applied introduction for mathematics, 
computer science, engineering, and physical science students. 


Elements of Calculus 

by Don Allen, Charles Chur, and William Perry, Texas A & M University. Fulfills the 
elementary calculus requirements for students majoring in business, biology, and 
the social sciences. 


College Algebra and Trigonometry , 
by David Gustafson and Peter Frisk, Rock Valley College. Covers topics with 
precision and detail in a lively and accessible style. 


Precalculus Mathematics 

A Functional Approach, Second Edition 

by Karl J. Smith, Santa Rosa Junior College. Highly student motivated—includes 
an innovative treatment of the graphing of trigonometric functions. 


BROOKS/COLE PUBLISHING COMPANY 


For further information write Craig Barth, Editor 
555 Abrego Street, Monterey, California 93940 , 


RICE UNIVERSITY Position in Operations Research 


The Mathematical Sciences Department of Rice University expects 
to have a position in Operations Research available beginning Fall, 
1983. Although the position is expected to be at a senior level, applica- 
tions at all levels are invited. Primary duties include providing leader- 
ship in teaching, research, and direction of graduate students in the 
department’s Operations Research program. The department currently 
has responsibilities in the areas of computer science, numerical analy- 
Sis, operations research, probability, statistics, and physical mathe- 
matics and biomathematics. The department has an active undergraa- 
uate and graduate program in these areas. If interested, please send 
resume and names of four references to 


Professor Richard Tapia 


Chairman 
Mathematical Sciences Dept. 
Rice University 
P.O. Box 1892 
Houston, TX 77251 


WILEY 
has the right solutions 
for all your textbook problems. 


INVITATION TO GEOMETRY 

Z. Alex Melzak 

A lively, well illustrated treatment of geome- 
try designed for students with little back- 
ground in math. Emphasizes spatial relation- 
ships and application of geometry. 

(1-09209-6) December 1982 

approx. 272 pp. In Press 


CASE STUDIES IN 
MATHEMATICAL MODELLING 
Edited by D. J. G. James and J. J. McDonald 
A practical, authoritative volume that brings 
together 16 incisive articles—each written by 
a top expert in the field—on the applications of 
modelling techniques. 

(0-27177-9) 1981 214pp. $17.95 Paper 


INTRODUCTION TO THE THEORY 
OF NUMBERS 

Harold N. Shapiro 

Here is a sophisticated introduction to the 
problems and properties of integers that 
details basic number theory. Brings together a 
variety of applications within a general philoso- 
phy and approach to counting problems. 
(1-86737-3) December 1982 

approx. 488 pp. In Press 


FUNCTION THEORY OF SEVERAL 
COMPLEX VARIABLES 

Steven G. Krantz 

The first book to address the interplay 
between several complex variables and har- 
monic analysis, partial differential equations, 
and functional analysis. 
(1-09324-6) 1982 437pp. $39.95 


AN INTRODUCTION TO TENSOR 
CALCULUS, RELATIVITY AND 
COSMOLOGY, 3RD ED. 

D. F. Lawden 

The third edition of this highly readable text 
contains sections on black holes and gravita- 
tional waves, with a new chapter covering the 
application of general relativity to cosmology. 
(1-10082-X) 1982 205pp. $35.95 Cloth’ 
(1-10096-X) $16.95 Paper 


COMPUTATIONAL 
METHODS IN ELEMENTARY 
NUMERICAL ANALYSIS 

J. L. Morris 

This self-contained introductory text stresses | 
a “learn by doing” approach to numerical anal- 
ysis with exercises to be done on a computer 
or micro-computer. | 

(1-10419-1) March 1983 450 pp. $41.95 Cloth 
(1-10420-5) 450 pp. $24.95 Paper 


ALGEBRA 

Volume 1, 2nd Ed. 

P. M. Cohn 

Reviews important background in algebra and 
introduces more advanced topics, emphasiz- 
ing linear algebra and the properties of groups 
and rings. 

(1-10169-9) 1982 410pp. $23.95 Paper 


ELEMENTS OF GROUP THEORY 
FOR PHYSICISTS 

Third Edition 

A.W. Joshi 

Develops the mathematics of .group theory 
and its application to quantum mechanics, 
crystallography, and solid state physics. 
(0-27306-2) 1982 334pp. $16.95 


COMBINATORIAL INTEGRAL 
GEOMETRY 

R.V. Ambartzumian 

This groundbreaking work presents a meth- 
odology by which the probabilities associated 
with random geometrical objects are related 
to measurements in the underlying space. 


(1-27977-3) 1982 240 pp. $45.00 
CONVEX SETS AND THEIR 
APPLICATIONS 


Steven R. Lay 
Designed for upper level undergraduate and 
first year graduate students. This is a unified 
approach to the theory and applications of con- 
vex sets. Contains over 90 illustrations and 
450 exercises. 


(1-09584-2) 1982 244pp. $29.50 


To order a 30-day examination copy of any of these books, write to Lisa Sullivan, Dept. 3-8415 or CALL 
TOLL FREE (800) 526-5368. In New Jersey, call collect (201) 797-7809. 


605 Third Avenue, New York, N.Y. 10158 


Prices subject to change without notice. 


WILEY-INTERSCIENCE a division of John Wiley & Sons, Inc. 


In Canada: 22 Worcester Road, Rexdale, Ontario M9W 1L1 


3-8415 


A Transition to Advanced Mathematics 
by Douglas Smith, Central Michigan University, Maurice Eggen, 
Trinity University, and Richard St. Andre, Central Michigan University 


A new book for the course that bridges the gap between the calculus sequence 
and the upper division mathematics courses—offers a core of five chapters— 
beginning with coverage of logic and proofs and ending with cardinality—that 
introduce the basic tools and principles of proof 


The core chapters are followed by independent chapters on algebra and analysis 
that cover some of the key theorems in these areas to give students a chance to 


For further information write Craig Barth, Editor 
BROOKS/COLE PUBLISHING COMPANY 
555 Abrego Street, Monterey, California 93940 


practice newly acquired conceptual skills. 


“open up fields of seemingly inexhaustible wealth.” 
Prof. Alexander Grothendieck 


SSS’ HIGHLY ACCLAIMED companion volumes on 
new properties & methods of analysis of general algebra- 
ic curves & their invariants under inversion. 


SYMMETRY, An Analytical Treatment CURVES & SYMMETRY, vol. 1 
by J. Lee Kavanau by J. Lee Kavanau 


August,1980, 656pp., illus., $29.95 Jan..1982, 448pp., over 1,000 
“One of the most original treatments of plane indiv. curves, $21.95, $47 the set 
curves to appear in modern times. Theauthor’s “Casts much new light on inversion & 
new and deeper studies...reveal a great number its generalization, the linear fraction- 
of beautiful & heretofore hidden propertiesofal- al (Moebius) transformation,with 
gebraic plane curves.” Prof Basil Gordon promise of increasing their utility by 
an order of magnitude?’ Prot Richard Fowler 


“Provides sharp new tools for studying the prop- 


erties of general algebraic curves.” 

Prof Richard Fowler 
“Striking new results onsymmetry & classifi- 
cation of curves...Read this book for more in sym- 
metry than meets the eye.” = Amer. Math. Monthly,1981 


extremely rich in content... 
Nordisk Matem Tids., 1981 


Science Software Systems, Inc., 


“Replete with fascinating new findings 


..accompanied by a wealth of beautiful 


& instructive illustrations.” 
Prof. Basi! Gordon 


Write for details about our 
Open Competition in Geometry 


BankAmericard, MasterCharge - 213-477-8541 


11899 W. Pico Blvd., Los Angeles, Calif, 90064 


Figure on Success 
with Scott, Foresman Mathematics Texts! 


MILLER /LIAL 


[NEW] Introductory Algebra 
A Text /Workbook 

Second Edition 

© 1983, softbound, 544 pp. 


[NEW] 


Intermediate Algebra 
A Text /Workbook 

Second Edition 

© 1983, softbound, 656 pp. 


Both texts with Instructor's Guide, 
Audio Tapes, and new Video Tapes. 
(Rounding out the series: Arithmetic: 
A Text/Workbook by Miller / Salzman, 
©1981, softbound, 531 pp.) 


HELTON/LIAL 


[NEW] Precalculus Mathematics 
A Functlons Approach 


© 1983, hardbound, 512 pp. 


Instructor’s Guide with Test Bank 


LIAL/MILLER 


[NEW] Algebra and Trigonometry 
Third Edition 
©1983, hardbound, 640 pp. 


Study Gulde and Instructor’s Manual 
with Solutions and Tests 

(Part of the Lial/Miller hardbound 
precalculus series.) 


For more information write 
Meredith Hellestrae 
Department SA 

1900 East Lake Avenue 
Glenview, Illinois 60025 


STEFFENSEN/JOHNSON 


Trigonometry with Analytic 


Geometry 

© 1983, softbound, 448 pp. 

(The newest of six books in the 
Steffensen/Johnson softbound series.) 


All Steffensen/Johnson series texts include 
Supplementary Exercises, Instructor’s 
Guide, Chapter Tests and Final Exam, and 
computer-oriented test items 


LIAL/MILLER 


Mathematics with Applications 
in the Management, Natural, 
and Social Sciences 

©1983, hardbound, 704 pp. 


Instructor's Guide with Solutions and Tests, 
answers to even-numbered exercises 
and computer program 


MILLER/SALZMAN 


Business Mathematics (NEW 
Third Edition 
©1983, softbound, 480 pp. 


Annotated Instructor's Edition, Test 
Supplement with Transparency Masters 


SIMON 


Calculus with Analytic 


Geometry 
© 1982, hardbound, 926 pp. 
Answer Book and Solutions Manual 


Jr 


Scott, Foresman and Company 


Algorithms for successful teaching— 
Coming soon from Harper & Row. 


Mary Kay Beavers 


ESSENTIAL MATHEMATICS 
Instructors Manual and Test Bank. 560 pages/tent. Paper. 
Ignacio Bello 


CONTEMPORARY 
BASIC MATHEMATICAL SKILES 


Second Edition 
Instructor's Manual and Test Bank. 432 pages/tent. Paper. 


Patrick J. Boyle 


TRIGONOMETRY WITH 
APPLICATIONS 


Instructor's Manual. 448 pages/tent. 


Elizabeth Difanis Phillips. 
Thomas Butts 


Michael Shaughnessy 
INTERMEDIATE ALGEBRA 


Applications and Problem Solving 
Instructors Manual. 560 pages/tent. 


To request examination copies, write to Suite 3D, Harper & Row, 10 East 53d 
Street, New York, N.Y. 10022. Please include course title, enrollment, and current text. 
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AWARD FOR DISTINGUISHED SERVICE TO 
PROFESSOR EDWIN F. BECKENBACH 


IVAN NIVEN 
Department of Mathematics, University of Oregon, Eugene OR 97403 


This year’s recipient of the Award for Distinguished Service to Mathematics was a widely-known 
member of the mathematical community, active for many decades in research, in teaching, in 
expository writing, and in dedicated work on behalf of professional organizations. Today we 
honor Edwin F. Beckenbach for the long and outstanding service he gave in such a variety of 
ways. His influence has been widespread, significant, and lasting. 

Born in Dallas in 1906, Ed Beckenbach attended school there, graduating as valedictorian of 
his high school class. In the fall of 1924 Ed entered Rice University in Houston, where he 
continued to his Ph.D. in 1931. His dissertation was written under the direction of Lester R. Ford, 
Sr., a distinguished mathematician who served as editor of the MONTHLY some forty years ago. 
Tibor Rado also strongly influenced his work. 

The depression year 1931 was not an auspicious time to move into the job market, but Ed 
survived those tough times of the 1930’s very well. After two years as a National Research Fellow 
at Princeton, Ohio State, and Chicago, he accepted a faculty position at Rice University. He 
moved on later to the University of Michigan, then to the University of Texas, and finally to 
UCLA in 1945, where he served for three decades. Ed Beckenbach is one of the scholars who 
brought UCLA into the national prominence it now enjoys. Among other contributions, Ed 
played a central role in the establishment of the Institute for Numerical Analysis there. 

Ed wrote extensively on inequalities, on geometric complex variable theory, and on its 
extension, through subharmonic functions, to the theory of surfaces. For example, he has written 
two books on inequalities with Richard Bellman, one in the New Mathematical Library Series of 
the Association, and a more advanced book which has become a standard research reference 
work. He edited Modern Mathematics for the Engineer, vols. 1 and 2, Applied Combinatorial 
Mathematics, and Concepts of Communication, with various translations into eight foreign lan- 
guages, and is coeditor of a mathematics dictionary. In addition to these advanced works, Ed is 
author, or coauthor of several books at the elementary, intermediate, and collegiate levels. He 
served on writing teams for the School Mathematics Study Group, the National Council of 
Teachers of Mathematics, and the African Education Program. In addition to all this writing and 
editing, Ed contributed his time and talent to professional organizations, including of course the 
Association. 

He was a member of the Council of the American Mathematical Society, and chairman of a 
special blue ribbon Committee on Mathematical Reviews at a time when MR was in considerable 
distress. He served on the membership committee of the Society for Industrial and Applied 
Mathematics, and was chairman of a special committee to reorganize the election procedures of 
the Conference Board of the Mathematical Sciences. Ed was a trustee of the Association of 
Members of the Institute for Advanced Study, and chairperson of its Western Committee. He was 
one of the organizers of a week-long conference on mathematical inequalities, held biennially at 
Oberwolfach in Germany. He edited the Proceedings of these international conferences. 

Lengthy listings tend to become dreary, so this report is confined to a sample of Ed 
Beckenbach’s many and varied contributions to the world of mathematics. No report would be 
complete, however, without drawing attention to Ed’s role in the founding and editing of the 
Pacific Journal of Mathematics. Indeed, the January 1982 issue of that journal is dedicated to the 
two pioneers who were most instrumental in getting the enterprise off the ground, Ed Beckenbach 
and Frantisek Wolf of Berkeley. Few of us play central roles in the founding of a journal. Fewer 
still have the vision, the dedication and the energy to launch not one but two journals, and Ed is 
one of this distinguished group. Ed’s urging over many years that the Association should have a 
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newsletter culminated in the creation of our very successful, very well-received newsletter, 
FOCUS. Its editor, Marcia P. Sward, calls Ed the “Father of FOCUS”. 

Let me conclude by emphasizing other service to the Association. Ed was elected chairman of 
the Texas Section, but he left for Michigan before he took office. He was a visiting lecturer for the 
MAA, and served a five-year term as editor of the Mathematical Notes Section of the MONTHLY. 
As chairman of the Committee on Publications since 1971, he guided the unprecedented growth of 
our journals and our series of books and monographs with skill, determination, and enthusiasm. 

In all his activities, Ed Beckenbach enlisted the cooperation of his colleagues by his skill at 
negotiation, his unfailing courtesy and consideration toward others, and his common sense and 
good humor. But Ed’s cooperative and accommodating spirit at the committee table completely 
disappeared in another of his roles. On the tennis court Ed was a hard contender, a tough 
adversary who showed ’em no mercy. Captain of the tennis team at Rice University back in the 
twenties, and later the coach of the team, he spanned six decades with his favorite sport. Even 
recently Ed and his wife Alice competed in national tournaments of “superseniors”; in more 
peaceful moments they indulged their lifelong hobby, tending their hillside acre of plants ranging 
from apricots to orchids. Incidentally, Alice surely holds a record for faithful attendance at 
meetings of mathematicians, having started at the age of eleven as daughter of a one-time 
president of the Association. 

Ed Beckenbach clearly served the mathematical community very well. We are all indebted to 
him for his preeminent leadership. Ed Beckenbach is indeed a most worthy recipient of the Award 
for Distinguished Service to Mathematics. 

* * * 


Edwin F. Beckenbach died on September 5, 1982. 


SINGULAR VALUE ANALYSIS OF CRYPTOGRAMS 
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Department of Computer Science, University of New Mexico, Albuquerque, NM 87131 


1. Singular Value Analysis and Cryptanalysis. The singular value decomposition is a matrix 
factorization which can produce approximations to large arrays. Cryptanalysis is the task of 
breaking coded messages. In this paper, we present an unusual merger of the two in which the 
singular value decomposition may aid the cryptanalyst in discovering vowels and consonants in 
messages coded in certain variations of simple substitution ciphers. 

Texts in many languages, including English, have the property that vowels are frequently 
followed by consonants, and consonants are frequently followed by vowels. We say a text is a 
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““vowel-follows-consonant” text, or more briefly, a “vfc” text if the proportion of vowels 

following vowels is less than the proportion of vowels following consonants. That is, if 

(1) number of vowel-vowel pairs < number of consonant-vowel pairs 
number of vowels number of consonants 


Some languages produce better vfc texts than others and some deviations by individual letters 
can be expected. In English text, the letter h is often preceded by other consonants to form a 
single sound, as in ch, gh, ph, sh, and, especially th. The letters 1, n, m, and r are often followed by 
consonants. Nevertheless, English is a predominantly vfc language. In Hawaiian, every consonant 
is followed by a vowel, so there are no consonant-consonant pairs. In the romaji transliteration of 
Japanese, the only consonant-consonant pairs are ch, sh, ts, and n, followed by a consonant and 
several double consonants. Russian has many sounds which, in transliteration, are of the 
consonant-consonant or vowel-vowel form, such as sh, ch, ts, ya, and ye, but in the Cyrillic 
alphabet, these are represented as single letters. Thus, Russian is also a predominantly vfc 
language. 


2. The Cryptanalyst’s Problem. A simple substitution cryptogram is a coded message in which 
the individual letters have been replaced by a permutation of themselves. When faced with such a 
message, a cryptanalyst might first count the occurrences of individual letters in the message and 
compare the frequencies with the known frequencies in typical uncoded text. However, a fairly 
long message is required before this technique has much chance of success. 

Another aid in breaking the code is a partitioning of the alphabet of the cryptogram into two 
subsets, representing the vowels and the consonants. In order that such a partition be plausible, it 
ought to satisfy the vfc rule (1). This is the main task which we wish to consider here—and is what 
we call the cryptanalyst’s problem. A trial and error solution, which tries all possible partitions 
until one satisfying the vfc rule is found, is clearly prohibitive. 

Let m be the number of letters in the alphabet. For any text, the digram frequency matrix is the 
n-by-n array A with a;, = the number of occurrences of the ith letter followed by the jth letter. 
Blanks and punctuation, if present, are ignored and the first letter of the text is assumed to follow 
the last letter. In general, the matrix is not symmetric, but for each i 


4; = aj; = fi 
J J 


where f, = the number of occurrences of the /th letter. 
For any proposed partitioning of the alphabet into vowels and consonants, two column 
vectors, v and c, can be defined by 


v; = 1 if the /th letter is a vowel, 0 otherwise, 
c; = 1 if the ith letter 1s a consonant, 0 otherwise. 


Note that v + c is a vector with all 1’s and that the inner product v’c is zero. Also note that the 
value of the quadratic form v’Av is the number of vowel-vowel pairs in the text. 
Using A, v and c, the vfc rule (1) can be stated 


v'Av < c"Av 
v'A(otec) cTA(v +c) 
Cross-multiplying and cancelling the common term, we obtain 
(2) (v7Av)(cTAc) — (vAc)(cTAv) < 0. 
The cryptanalyst’s problem, then, is: Given A, find a partitioning v and c so that (2) holds. 


3. The Singular Value Decomposition. The singular value decomposition, or SVD, is a matrix 
factorization which numerical analysts use in a wide variety of ways. Although its primary uses are 
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in the analysis of systems of simultaneous linear equations and in the computation of pseudoin- 
verses, we will use it here to obtain “simple” approximations to the digram frequency matrix. For 
this purpose, we express the SVD as a sum of rank one matrices of the form 


_ T T T 
(3) A ~~ 01X11) + 02X72 torte t O,X%n Yn 
where 


0, 2>0,>°°': >4,>0 


x} x ; = 6; (the Kronecker delta) 


yy, = 8; j- 


The coefficients o, are known as the singular values and x;, and y, are the /eft and right singular 
vectors, respectively. They can also be characterized in terms of the solutions to the symmetric 


Jy Jy ) 


It is not hard to see that the rank of a matrix is the number of nonzero singular values. (In fact, 
this is a particularly useful way to define rank.) 

There is a fast, reliable algorithm for computing the SVD [3], [6]. For a 26-by-26 matrix, the 
computation of the singular values and corresponding left and right vectors takes only a few 
seconds on a medium speed modern computer. 

The normalization of the vectors x, and y, insures that all of the rank one matrices x,y have 
the same norm (that is, the sum of the squares of the elements). Consequently, the numerical 
importance of each term in the sum (3) can be measured by the size of the coefficient o,. If the o, 
decrease fairly rapidly as 7 increases, then an accurate approximation to A can be obtained by 
truncating the series after only a few terms. Truncating the series after k nonzero terms provides a 
rank k approximation to A. Such approximations are related to those obtained by factor analysis 
and have a wide variety of applications. One unusual application is to digital image processing [1]. 


4. Rank One Approximation. A rough approximation to a digram frequency matrix can be 
obtained by taking only the first term in its SVD: 


~ _ T 
A = A, = 6)x\)yj. 


Let e be the vector of all 1’s and let f be the vector with f, = the number of occurrences of the ith 
letter in the text. Then Ae = A’e = f and so 


o,(yie) x, ~ o,(xfe)y, = f. 
Consequently, if we were to assume that the digram frequency matrix were only rank one, we 
would conclude that it was symmetric and that each row and column was proportional to the 
frequency vector f. 
Of course, in practice, A is not rank one. Nevertheless, the first left and right singular vectors 
tend to be approximately equal and reflect the frequencies of the letters in the text. 


5. Rank Two Approximation. The rank two approximation obtained from the first two terms 
of the SVD is the simplest approximation which takes into account the correlation between pairs 
of letters in the text. The second left and right singular vectors contain the key to the solution of 
the cryptanalyst’s problem. 

Assume that A is approximated by a matrix of rank two, so that 


(4) A = Ay = 06,X,y{ + 0)X2)3. 


We propose to use the signs of the components of x, and y, to partition the alphabet as follows: 
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v= { if x;. > Oandy,, < 0 
0 otherwise, 
1 ifx;, < Oandy,, >0 
0 otherwise, 


n= ( 1 if sign (x;.) = sign (y,2) 
0 otherwise. 
The third category—the “neuter” letters—are the ones that cannot be classified as either vowels 
or consonants. It turns out in practice that few letters fall into this category. In English text, for 
example, the letter h is usually neuter. It might be possible to consider a finer partition involving 
“left vowel, right consonant” and so on, but we have not pursued this idea. 

Using these definitions, we obtain a solution to the cryptanalyst’s problem as follows. 


THEOREM. Let A = A, be a nonnegative rank 2 matrix with the SVD expansion in (4). Let v and 
c be defined by (5). Then the vic rule, 


(6) D = (v’Av)(c"Ac) — (v'Ac)(c7Av) < 0 
is satisfied. 


Proof. Since A is a nonnegative matrix, it follows from the Perron-Frobenius theorem [4] that 
x, and y, have nonnegative components. Placing (4) in (2) and expanding produces eight terms. 
The two terms involving only subscript 1 cancel, so do the two terms involving only subscript 2: 


— Ty yTyy rly yT. TT, pT, yT. 
D= 6163(v Z) VVC Zz WoC + V' Zn VoD C1Z iC 
~ vz, yle c'zy yJv — vz, yle c7z,yjv). 


Of all the different inner products appearing in this expression, only two—namely, y/v and 
c'z,—are negative. Consequently, all four terms in the parentheses are negative and D is negative. 


6. Effects of Encipherment. When a message M is encoded by a simple substitution cipher c, 
(where c is a permutation of the integers 1 to n) each occurrence of the ith letter u, is replaced by 
the c(i)th letter, u..;). The resulting cryptogram is called M.. If A is the digram frequency matrix 
of M, and C is the permutation matrix C = (6,;;) ;) which has in its ith row, the c(i)th row of the 
identity matrix, then it is not difficult to show that the digram frequency matrix of M, is CAC’. 
This matrix has in row c(i), column c(j/), the frequency of the digram u;u, in M, which it 
represents. It follows that the singular values of CAC’ are the same as those of A, and the /th left 
and right singular vectors are, respectively, Cx; and Cy,. These have the same coefficients as x, 
and y,, but they are permuted by C; the coefficients appearing in row i in x, and y, appear in row 
c(i) in Cx, and Cy,. If the scheme described above, applied to A, classifies the letter u; as a vowel 
in M, then applied to CAC’, it will classify u,,,), the encoding of u;, as a vowel in M,. 

Simple substitution ciphers are, of course, very simple ciphers, and no self-respecting cryptana- 
lyst regards them as a challenge. A more sophisticated cipher is the k-alphabetic cipher (k is a 
positive integer). In this cipher, k permutations, c,, c,,..., c, are used in cyclic fashion to encode 
the letters in M to produce the cryptogram M.. A letter is said to be in position p, in M 
(1 <p <k), if it is the mth letter of M and m is congruent p modulo k. If the letter u; occurs in 
position p, it is encoded as Uc (i): Thus, the encoding of a letter depends, not only on what letter it 
is, but also on its position in M. 

The digram frequency matrix of a cryptogram encoded by a k-alphabet cipher is of little use in 
decoding it, since the various occurrences of a digram in M. do not represent the same digram in 
M, if they do not occur in the same position. Cryptanalysts have some clever ways to deduce the 
probable value of the cycle length k. See, for example, Gaines [2] or Sinkov [5]. Armed with this 
information, a cryptanalyst can calculate A digram frequency matrices, A, = (a;,,) where a; ,, is 
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FIG. 1. The digram frequency matrix for Lincoln’s Gettysburg address. 
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the frequency with which the digram u;u,; occurs in positions p, p + 1 modulo k. All of these 
occurrences are encoded into the same digram u. (i) 4cy4 (4) the cryptogram M.. By an argument 
similar to that given above for the simple substitution, if A, is the digram frequency matrix for 
digrams occurring in position p, p+ 1 in M, then C, A. pp- +, 1s the corresponding digram 
frequency matrix for the cryptogram M.. It has the same coefficients as A,, but its rows are 
permuted by C,, and its columns by C,,, ;. Its singular values are the same as those of A,,, and its 
jth left and right singular vectors are, respectively, C,x,,; and C,,\y,;, where x,, and y,; are those 
of A,. If the classification scheme described above, applied to x,; and y,_ 1); classifies u, as a 
vowel, then it will also, applied to C,x,; and C,y541;; classify its encoding u, c(i) a8 a vowel. 

If this is done for each p, then the cryptanalyst has, for each p, a classification of the letters in 
position p of the cryptogram, into vowels and consonants. The classification is, of course, only as 
reliable as it would have been if it had been applied to the original message M, or, more 
particularly, to those subsequences of M consisting of the digrams occurring in a particular 
position p, p + 1. If these are representative samples of the digrams occurring in the language of 
M, then the classification is as reliable as it is when applied to equally representative plaintexts, as 
we have done in sections 7 and 8. 


1 81.7256 
2 53.5189 
3 45.1604 
4 31.1073 
5 19.9258 
6 18.8196 
/ 16.8777 
8 12.1174 
Q . 10.1647 
10 9.4667 
11 7.6957 
12 6.249] 
13 4.7882 
14 2./120 
15 2.1048 
16 1.7556 
17 1.6395 
18 0.9360 
19 0.8129 
20 0.4996 
2] 0.3232 
22 0.2069 
23 0.0148 
24 0.0000 
25 0.0 
26 0.0 


Fic. 2, The singular values. 


7. Experimental Results. Fig. 1 is the digram frequency matrix for Lincoln’s Gettysburg 
Address, a text of 1,148 characters. Notice, for example, that “th,” with 47 occurrences, is the 
most frequent pair, that “q” occurs only once, and that “j,” “x” and “z” do not occur at all. 

Fig. 2 gives the singular values of the matrix in Fig. 1. Since three letters are missing, the matrix 
has rank 23 at most, and 3 of the singular values should be zero. The subroutine finds two exact 
zero values and one value, the size of roundoff error on the computer. (Exact zeros are printed as 
0.0, while numbers less than 10~> are printed as 0.00000.) 
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It certainly cannot be claimed that the singular values decrease rapidly. In fact, the rank two 
approximation only vaguely resembles the original matrix. Nevertheless, useful information can be 
obtained from the first two pairs of singular vectors. 

Fig. 3 shows the first singular vectors x, and y,, together with the frequency vector f. It can be 
seen that the components of the two singular vectors are roughly equal, and are roughly 
proportional to the components of the frequency vector. Thus, even though the matrix is not 
particularly well approximated by the first term of its SVD, the singular vectors still retain the 
properties predicted by the rank one theory. 


A 0.3275 0.322] 102. A -0.5097 0.1574 
B 0.0470 0.0441 14. B 0.0412 -0.0386 
C 0.1200 0.1135 31. C 0.0719 -0.0788 
D 0.2011 0.2259 58. D 0.1436 -0.2163 
E 0.4394 0.4517 165. E -0. 3306 0.5305 
F 0.0875 0.1065 27. F -0.0006 -0.0198 
G 0.0966 0.0799 28. G 0.0521 -0.0623 
H 0.3481 0.3378 80. H 0.3877 0.3293 
J 0.1830 0.2339 68. I -0.2070 0.1791 
J 0.0000 0.0000 Q. J 0.0000 -Q. 0000 
K 0.0099 0.0097 3. K 0.0033 -0.0049 
L 0.1203 0.1218 42. L 0.0298 -0.0853 
M 0.0607 0.0468 13. M 0.0634 -0.0613 
N 0.2165 0.2435 77. N -0.0649 -0.3983 
0 0.2387 0.2563 93. 0 -0.389] 0.1070 
P 0.0522 0.0564 15. P 0.0385 -0.0535 
Q 0.0007 0.0054 1. Q -0.0010 -0.0062 
R 0.2954 0.2493 79. R 0.1692 -0.3402 
S 0.1683 0.139] 44. S 0.0801 -0.1462 
T 0.4453 0.4367 126. T 0.3785 -0.3878 
U 0.0532 0.0597 21. U -0.0860 -0.0516 
V 0.1167 0.0817 24 V 0.1884 -Q.1405 
W 0.1210 0.1044 28 W 0.1559 -0.0112 
X 0.0 0.0 0 X 0.0 0.0 
Y 0.0339 0.0344 10 Y -0.0045 -0.0215 
0.0 0.0 0 LZ 0.0 0.0 
Fic. 3. The first right and left singular vectors and the letter Fic. 4. The second right and left singular 
frequency vector. vectors, The sign patterns identify vowels 


and consonants. 


Fig. 4 shows the second singular vectors x, and y,. The alternating sign patterns predicted by 
the rank two theory are clearly evident. Figs. 5 and 6 give a graphical summary of the quantitative 
information in Fig. 4 by plotting each letter at a point in the two-dimensional plane determined 
by its components in x, and y,. Thus, “a” is plotted at coordinates (— 0.5097, 0.1574), “b” at 
coordinates (0.0412, — 0.0386), and so on. The more frequent letters are in Fig. 5 and the less 
frequent letters in Fig. 6. The box in both figures has corners at cn 0.1, +0.1). 

Since they fall in the same quadrant (the second), the letters “a,” e, ” “97” and “o” should all 
be classified as either vowels or consonants, and we have, of course, chosen to call them vowels. 

The letters “h,” “n,” “u’ and “y” must be called neuter because the corresponding signs in the 
two vectors agree. The letter “‘q” occurred only once. The classification of q as neuter is 
nevertheless interesting. Its one occurrence is in the word “equal.’”’ One might expect it to be 
classified as a consonant, since it occurs between two vowels. Note, however, that the algorithm 
does not recognize u as a vowel. It is classified as neuter. This is, no doubt, attributable to the very 
high frequency of the digram ou, which accounts for 7 of the 21 occurrences of u. The letter “h” 
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Fic. 5. The more frequent letters, plotted with coordinates from Fig. 4. 


clearly shows a tendency to be followed by a vowel, and to be preceded by a consonant. The letter 
“‘n” shows a weak tendency to be followed by a consonant and a strong tendency to be preceded 
by a vowel. The other three neuter letters occur infrequently. The letter “j,” “x,” and “z” are seen 
not to occur at all. The remaining 14 letters are classified as consonants. 

If a simple substitution cryptogram were made from text such as the Gettysburg Address, the 
digram matrix A would be replaced by PAP’ for some unknown permutation matrix P. As shown 
in Section 6, the singular vectors of the transformed matrix would classify the encoding of each 
letter in the same way as x, and y, classified the letters of the plaintext. 


8. Other Languages. The same experiment was performed on texts of approximately one 
thousand characters each, written in five other languages selected for their diversity. The 
classifications of letters resulting from these tests are tabulated below. 


Language Vowels Consonants Neuter Absent 
Hawaiian AEIOU HKLMNPW BCDFGJQ 
RSTVXYZ 
Japanese AEIOU BDGHKMNRSTWYZ CFILPQVX 
German AEFOPU BHIKLNRV CDGJMSTWZ QXY 
Spanish AEO CDFGLMNQRSXYZ BHIJPTUV KW 


Finnish AEIOUY DHJKLMNPRSTV B CFGQWXZ 
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Fic. 6. The less frequent letters. The scale is 5 times that of Fig. 5. 


None of the discrepancies is very surprising. Several of them are associated with letters which 
occurred so infrequently that no significance can be attached to their classification. In the Finnish 
example, B occurred only once, at the beginning of a word of foreign origin. It followed a final N 
in the preceding word and preceded an E. In the German text, P occurred only once, and that 
occurrence was in the very Germanic trigram SPR. Occurring only between two consonants, it is 
not surprising that the analysis classified it as a vowel. The most common neighbor of F, on both 
sides, in the selected sample, is R. The classification of I is confused because of the very high 
frequency of the diagrams IE and EI. More generally, the German language does not share the 
aversion of the other languages to consecutive consonants, and consequently many German letters 
fall into the neuter category. The classification of Y as a vowel in the Finnish sample is not 
surprising; Y has a value in Finnish that is hardly distinguishable to a non-Finnish ear, from that 
of U. The neuter classification of I and U in the Spanish example is clearly attributable to the high 
frequency of vowel-vowel digrams in which U or I is the first letter, having a value equivalent to 
W or Y in English. The perfect performance of the algorithm in Japanese and Hawaiian is clearly 
the result of their rigorously observed exclusion of consonant-consonant digrams. Most of the 
Japanese Hiragana characters are transliterated as a consonant-vowel digram. 


9. Conclusions. The second singular vectors in the singular value decomposition of the digram 
frequency matrix provide the cryptanalyst with a helpful and surprisingly reliable way to classify 
the letters in a cryptogram as vowels or consonants, if the encoding algorithm is simple 
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substitution or k-alphabetic substitution, with k known, and if the text if vfc text. Texts written in 
many natural languages, with certain exceptions, tend to be vfc texts. Near-exceptions are German 
(and, presumably, other germanic languages) in which the vfc character is somewhat diminished 
by the frequency of consonant-consonant digrams, and Spanish (and, presumably, other romance 
languages) in which certain vowel-vowel pairs are frequent. Despite these deviations from the vfc 
rule, the second singular vectors classify correctly most of the letters which occur with a frequency 
high enough to be statistically significant. 

A computer program which uses the SVD as the starting point in an automated, heuristic 
approach to solving cryptograms is described by Schatz [7]. 
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WHAT IS THE GEOMETRY OF A SURFACE? 


ROGER FENN 
School of Mathematics and Physical Sciences, University of Sussex, Falmer, Brighton BNI 9QH, England 


1. In this article we shall assume the definition of a geometry proposed by Klein in his 
Erlangen program, that is, a geometry is a group of transformations of space together with the 
propositions left invariant by this group. The space in question will be the two-dimensional plane. 
By a surface we shall mean a closed compact orientable surface of genus y. That is a sphere with y 
handles attached. The fact that two such surfaces are homeomorphic if and only if their genera are 
equal was probably known to Riemann. A modern proof can be found in Massey’s book [5, 
Chapter 1]. 

The exact result which will be proved is the following: 


THEOREM 1. Let y > 1. Then there is a group of hyperbolic translations of the hyperbolic plane 
such that the space of orbits under these translations is homeomorphic to a surface of genus y. 

Moreover, there is a compact polygon in the plane whose translates under the group are distinct for 
distinct group elements and which form a network of nonoverlapping polygons covering the whole 
hyperbolic plane. 


Our aim is to prove the above by means as elementary as possible, in particular without using 
deep results from the theory of functions. 
In order to illustrate the general result, we consider firstly the case where y = 1. Here the 
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geometry is Euclidean and the surface is a torus. 

In the Euclidean plane let a and b be the translations given in coordinate form by a(x, y) = (x 
+ 1, y) and b(x, y) = (x, y + 1). Then a and b generate a group T called the torus group. This is 
the free Abelian group with basis {a, b}. Let S; be the quotient space under the action of T. That 
is (x, y) and (x’, y’) are to be identified in S; if and only if (x’, y’) = g(x, y) for some g € T. 
Equivalently (x, y) and (x’, y’) are identified if and only if x — x’ and y — y’ are integers. Then 
S, is obtained from the unit square 9,0 < x < 1,0 < y < 1 by identifying according to the rule 
indicated in Figure 1. 


bO abO 


O aO 


Fic. 1. The fundamental region for the torus group. 


So S; is a torus. 


In technical language the universal cover of the torus is the Euclidean plane and the group of 
covering translations corresponding to the fundamental group is generated by the Euclidean 
isometries a and b. The square in Figure 1 is called a fundamental region. The translations of the 
fundamental region by the covering translations cover the Euclidean plane in a network of 


Fic. 2. The translates of the fundamental region by the torus group. 


nonoverlapping squares all of the same size: see Figure 2. 

If y = 0, then the surface is a sphere and the geometry is spherical or elliptic. We shall not 
consider this case here. The interested reader can read the books by Coxeter or Guggenheimer [2], 
[3]. 
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The idea in the sequel which was known to Poincaré, Klein and Fricke is to generalize these 
notions so that the torus is replaced by a surface of genus y > 1 and the Euclidean plane is 
replaced by the hyperbolic plane. The fundamental region will become a hyperbolic polygon with 
4y sides and the Euclidean translations will become hyperbolic translations which cover the 
hyperbolic plane with an infinite network of copies of the fundamental polygon. 


2. A Quick Review of Hyperbolic Geometry. In this section a brief description of the 
hyperbolic plane will be presented. By and large, proofs will be omitted. There are many excellent 
books on hyperbolic geometry to which the interested reader may refer for more details. A 
suitable reference might be Coxeter [2] or the appropriate chapter of Caratheodory [1, 80-88]. 

The model used here is due to Poincaré. Let SC be the interior of the unit disc in the complex 
plane C. The points of SC are the points of the hyperbolic plane. The points on the unit circle S$! 
are often called the points at infinity. The hyperbolic lines or U-lines for short are the arcs in SC of 
circles orthogonal to S'. Also included as K(-lines are diameters of KH. 

Two points A, B of iH determine a unique JC-line. Write AB for the SC-arc joining them. This 
notation will also be used if one or both of A, B lie at infinity. If two JC-lines meet at A inside KH, 
their circle extensions also meet at the inverse point 1/A outside JC. 

If two 3C-lines meet only at infinity, they are called parallel. If two ‘-lines fail to meet, they 
are sometimes called ultraparallel. 

If A, B, C, D are four points of C, their cross ratio is defined to be 


8(A, B,C, D) =(A—C)(B— D)/(A — D)(B- C). 


Let A, B lie in SC. The line through A and B meets the circle at infinity in two more points 
C, D: see Figure 3. 


Cc 


FIG. 3. 


The cross ratio of these points, lying on a circle, is real and positive. Define the hyperbolic 
distance between A and B by 
d(A, B) = log 8(A, B,C, D). 


The angle between two S{-lines is measured in this model in exactly the same way as the 
ordinary Euclidean angle. It is an important fact in the history of noneuclidean geometry that the 
three angles of a triangle have sum less than 7. This is a consequence of the Gauss-Bonnet 
theorem and the fact that the hyperbolic plane has constant negative curvature. It is also useful to 
considervdeal triangles in which one or more vertices are at infinity. In this manner polygons with 
angles equal to zero can be considered. 


I(x) 


B 


Fic. 4. The angle II(x). 
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Let NM be an ‘-line where NM are at infinity and let A be a point of SC not on the line NM. 
Let B be the foot of the perpendicular from A to NM. If AM is one of the SC-lines through A 
parallel to NM and x is the distance AB, let II(x) denote the angle BAM: see Figure 4. The 
function II(x) is called the angle of parallelism. For Euclidean geometry this would always be 
a /2. Analytically II(x) = 2 tan7 '(e7*). 

Transformations of the hyperbolic plane which preserve orientation and arc length are called 
(orientation preserving) isometries. They will also preserve area, angles and the sense of angles. All 
such isometries have the analytical form 


Z>W=e'®?(A-Z)/1-AZ), |Al< 1 


and hence can be thought of as acting on the whole complex plane C. They constitute the group of 
bilinear transformations 


W=(aZ+B)/(yZ+ 58), ad — By #0 


which preserve the unit disc, |Z|< 1 and hence its interior SC. There is a unique bilinear 
transformation taking the points A, B, C to P, Q, R given by 

(W— QO)(P—R)/(W- R)P- Q)=(Z- BY A- C)/(Z— C)(A — B). 
The analytic form expresses the fact that the cross ratio 6(Z, A,B,C) is preserved. The 
uniqueness follows from the fact that the only bilinear transformation with three fixed points is 
the identity. 

Let A, B and A’, B’ be two pairs of points with d(A, B) = d(A’, B’). Let C, D and C’, D’ be 
the corresponding points at infinity as in Figure 3. Then the unique bilinear transformation which 
takes A, B, C into A’, B’, C’ preserves cross ratios and so takes D into D’. This isometry takes the 
circle through D,C perpendicular to DABC into the circle through D’,C’ perpendicular to 
D’A’B’C’. But both circles are the same, namely the circle at infinity. Consequently, there is a 
unique isometry of J taking two given points to two other given points the same distance apart. - 

The discriminant A of an isometry is given by 


A = sin*($¢/2)/(1 — AA ). 
The value of A divides the set of isometries into three classes. 
Elliptic isometries, or rotations, A < 1 


These have two fixed points p and 1/p separated by the unit circle. An elliptic isometry can be 
thought of as a rotation about the fixed point lying inside SC through an angle equal to 2cos”'VA. 
The set of orbits is the pencil of circles (and one line) with degenerate circles p and 1/p. The one 
line is the perpendicular bisector of the two fixed points: see Figure 5. 

The orthogonal pencil to the orbits is permuted by the isometry. 

The elliptic isometry with fixed point A and angle of rotation ¢ is given by 


(W -— A)/(W- 1/4) =e'®?(Z-A)/(Z—- 1/74). 


© 


Fic. 5. Orbits of an elliptic isometry. Fic. 6. Orbits of a parabolic isometry. 
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Parabolic isometries, A = 1 


These have a unique fixed point lying on the unit circle. The orbit set is the pencil of circles 
tangent to S! at the fixed point. Circles touching the circle at infinity are called horocycles. Again 
the orthogonal family is permuted by the isometry: see Figure 6. 

A parabolic isometry is the “limit” of an elliptic isometry as the internal fixed point tends to 
infinity. : 

A parabolic isometry with fixed point A is given by 

iA/(W — A) =iA/(Z-— A)—-k 
where k is an arbitrary nonzero real number. 

Hyperbolic isometries or translations, A > 1 


This case is dual to the elliptic case. Hyperbolic isometries have two fixed points lying on the 
unit circle. The orbit set is the pencil of circles (and the one straight line) passing through these 
fixed points. One of these circles defines an J{-line called the axis of the isometry: see Figure 7. 
The other circles are sometimes called hypercycles or equidistant curves. They are the set of points 
“equidistant” from the axis. 


Fic. 7. Orbits of a hyperbolic isometry. The axis is dotted. 


A hyperbolic isometry can be thought of as a translation along the axis through a distance 2 
cosh” '/A. 

One fixed point of a hyperbolic isometry is a source and the other a sink. This allows the 
following useful characterization of hyperbolic isometries: let L,, be the portion of the y axis in IC 
oriented upwards in the usual manner. Then L,, divides JC into two regions, a right and a left. In 
general any hyperbolic line divides into two regions. Let h be any isometry which takes L into 
L,, and preserves the orientation of L and L,. Call the region divided by L right if h takes it into 
the right side of L,. Similarly for left. 

Suppose now that an isometry / takes the oriented line L, into the oriented line L, preserving 
their orientation. The lines L, and L, are not to meet even at infinity, i.e., they are ultraparallel. 
Let H, be the part of SC to the right of L,. Define H, similarly. If H, ¢ H, or H, C M,, then h is 
hyperbolic. In the first case the source lies in JC,, in the second case the source lies in J(,. 

Another useful characterization 1s that for all other types of isometry h, there are points Z and 
hZ arbitrarily close to one another. 

A hyperbolic isometry with fixed points A and B is given by 


(W- A)/(W- B)=k(Z— A)/(Z— B) 
for some real k # 0. 


3. The Construction of the Fundamental Polygon P. To help understand the following the 
reader should consult Figure 8, which illustrates the case n = 8. Let X,, X5,..., X,, be the nth 
roots of unity where n = 4y, y > 1. Let Y,, ¥,..., Y, be points on S' so that Y, is midway 
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between X, and X,, ,. (Here i is a cyclic index taking the values i = 1,2,..., n, mod n.) Consider 
the JC-lines X,X,,,. These form a regular n-gon with vertices at infinity and internal angle equal 
to zero. The half spaces of the JC-lines Y,Y,,, determine a regular compact n-gon with internal 
angle equal to 2a’, say. Let the vertices of this polygon be Z},--- Z,. In the diagram Z; is B and 
Z; 1s A. Let Z; be points on the lines OX,, equidistant from O and lying between Z/ and X;, 
i= 1,...,”. Define P to be the polygon with vertices Z,,..., Z,. Let 2a be the internal angle of 
P. The triangle OZ, Z, has larger area than the triangle OAB so that a < a’. By continuity 2a@ can 
take any value in the range 0 < 2a < 2a’. 
By considering the triangle OAB it can be seen that 


(1) 2a’ + 2a/n< 7. 
By considering the triangle A Y, B it can be seen that 
(2) 2a — 4a’ <7. 


The inequalities 1 and 2 imply 
22/4 < 2a’ <a(1—2/n). 
By continuity P may be chosen so that its internal angle 2a = 27/n. 


4. The Hyperbolic Isometries a, and b,. Let the vertices of P be Z,, Z,,..., Z,, 1n anti-clock- 
wise cyclic order. Let a, be the isometry which takes Z,;_ |Z4; to Z,;_.Z,,_, and let b,; take 


Y%, Y, 
X75 


Fic. 8. The fundamental polygon for the case n = 8. (Not drawn to scale.) 
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Z4j-2 24,1 tO Z4;Z4;41,4 = 1,..., y. For an illustration of a, and 5, see Figure 9. 
Looking at Figure 8 it can be seen that the half spaces defined by the JC-lines through Z,Z, 


and through Z,Z, are nested. Therefore a, (and similarly b,) is hyperbolic. By symmetry all a; 
and b, are hyperbolic. However, this will also follow later as part of a more general result. 


25 


24 


Z) 


Fic. 9. The isometries a and b. 


Let 
c= a;b,a;'b;' — [a;, b,), i=1,...,y¥. 


Then ¢;Z4;4) = Z4i—3- 

Consider the group G of all isometries generated by a), b,, a2, bz,..., a,, b,. If g is an element 
of G, let gP = g(P) be the image of P under the isometry g. 

As g varies over the 4y elements 


_ | —1 _ 
g=a,,4,b,,a,b,a, ,..., C\C +++ C,_1a,b ay, cen +++ c,_ye, = 1, 


the polygons gP share the vertex Z, and the corresponding angles are lined up clockwise around 
Z,: see Figure 10 for the case y = 2. The interiors of adjacent polygons are disjoint for the 
following reasons: Consider say the intersection of a,b, P.and a,b,a;'P. This is a translation of 
the intersection of P and a; 'P which only consists of their common edge. Since P was chosen to 
have internal angle equal to 27/4y, the 4y polygons gp form a full neighbourhood of Z,. So in 


, db b,a;! 
[a,, b;]a,byP [a,, b\]azb,a4 P 


[a,, b\]ayP Z| P = [a,, b][az, b2]P 


[a1, b,|P a,P 


a,b,a,; 'P a,b\P 


Fic. 10. The polygons all fit round a vertex without overlapping. 
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particular if g = c,c, --- c,, then gP = P. A quick calculation shows that gZ, = Z|. Since g is an 
orientation preserving isometry, it follows that gZ, = Z; for all vertices Z; of P. But the only 
isometry of SC with more than two fixed points is the identity. So g = c,c) --+ c, = 1. 

By a cyclic permutation of the 4y factors in c,c --- c, the same is true for every other vertex 
of P. Furthermore if gZ,; is a vertex of gP, the polygons which form a neighbourhood of Z; are 
transformed to form a neighbourhood of gZ,. 


5. The Tiling of the Plane by the Polygons gP. In this section it will be shown that the 
translates gP of the fundamental polygon P cover the whole plane without overlapping. This tiling 
is constructed implicitly in Siegel’s book [9, Chapter 12]. See also the book by Magnus for y = 2 
[4, p. 187]. It has already been shown that the polygons fit nicely along the edges and about the 
vertices. It remains to show that the polygons do not overlap globally. For the case y = 2 see 
Figure 11. The idea behind the proof is to consider the space JC obtained from the union of the 
polygons by not allowing them to overlap at a distance and then to show that J actually is JC! 


QW 


Fic. 11. The network of polygons when y = 2. 


More specifically let gP be a copy of the polygon gP for each g in G. Assume initially that the 
gP are mutually disjoint. Let X be the set of generators of G and let X_' be the set of inverses. So 


X = (a), 4y,...,4,,5,, bg,..., b,}, 
—-1_ —1 —1 —1 —1 —|1 —1)\- 
X = {a » A> gerry A, , 5) , D5 pees, B, : 


Then gP and AP are to be glued along an edge if and only if g-'h € X U X~'. In this case let e be 
the edge of P in common with g_'hP. Then gP and AP will have the edge ge in common: Glue gP 
and hP together using their copy of this edge. 

Let SC denote the resulting space. So JC = UgP with the above identifications. Notice that IC 
has the same local structure as (. There is a map p: JC — SC which takes P identically to its copy 
P. Locally distances, angles and indeed geometry are identical and are identified by the map p. 
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The polygons gP cover SC completely without overlapping. 

Firstly let us show that p is onto. Because SC is connected, it is only necessary to show that 
U gP is both open and closed in SC. It is not hard to see that UgP is open. This is because if the 
point B lies in gP, it either lies in its interior, or in the interior of an edge of gP which is also the 
edge of an adjacent hP, gh”! © X U X7', or it is a vertex of gP in which case it is surrounded by 
an open neighborhood of translates of P. So let A be a point of which does not lie in UgP, but 
does lie in its closure UgP. It will be shown that points such as these do not exist. 

Every neighbourhood of A must meet at least one gP, in fact an infinite number of them. Let 
us suppose that this neighbourhood is the disc of radius e and centre A where ¢ is to be a suitably 
chosen small number. 

Now there is a positive number 6 such that if B lies in gP and C is a distance less than 6 from 
B, then either C lies in gP or C lies in hP where gh” ' € X U X~'. A suitable choice for 5 could be 
one half the length of any side of P. But this leads to a contradiction if ¢ is put equal to 6 because 
A was supposed not to lie in UgP. 

Therefore the closure of UgP is the whole of the hyperbolic plane and so the map p is onto SC. 

It will now be shown that p has the path lifting property and hence is one-to-one. To have the 
path aunne property means that if a: I > JC is a path where J = [0,1] is the closed interval 
0<x<landd & ICis a point such fat pA = a(0), then there is a unique path &: I > SC, the 
lift ot a x such that p& = a and &(0) = 

Let V be a vertex of gP in SC and te U(V) be the interior of the union of all polygons with 
vertex V. Then the open sets U(V) as V varies over all vertices cover §C. Moreover pU(V) is an 
exact copy of U(V) in K and the union of the pU(V) covers IC. 

if the path a lies entirely in some pU(V) where X € U(V), then the lift & is uniquely defined 
by 


. ~ \\—1 
é=(plU(V)) @ 

If not, then by a standard compactness argument a can be broken up into a sequence of paths 

each of which does lie in some pU(V). That is there exists a subdivision 


O=t)<t,<--: <t,=1 
of the interval [0, 1] such that the image of 
a, =allt;, tis] 


lies in some pU(6), i = 0,..., 2 — 1. We proceed to lift a step by step. Suppose a has been lifted 

for 0 < ¢ < t;. Consider a;: this has a unique lift &;(t;) = &,_ |(¢;) because a, lies in some pU(6). 

So the lift of al[0, t;] can be extended to a[[0, ¢;, ,]. After n steps, the whole of a will be lifted. 
To see the uniqueness of &, consider another lift & with a&(0) = (0). Let 


€ = sup{t|0 <¢ < 1, &[0, ¢] = &’'[0, r]}. 


If € = 1, we are done. If § < 1, then for some small ¢, a(§ — e, € + €) lies ina pU(6) and so & and 
a’ have a unique extension to [1, § + ¢) which is a contradiction. 

Actually rather more has been shown here because clearly a continuous variation in a will 
induce a continuous variation in &. 

Now suppose that p(A) = p(B) = A. Join A and B by a path a. This is the unique lift of a 
loop a such that a(0) = a(1) = A. Consider the 1-parameter family of loops a,, 0 < t < 1 given 
by 

a(x) =ta(x)+ (1-t)A,0<x< 1. 
Here we are using the affine structure of as a convex subset of C. In this way it is possible to 
shrink the loop « based at A through a family of loops based at A to the trivial loop. Consider the 


lift d,. This starts out as a path joining A to B and varies continuously joining A to B all the time 
until it is a trivial path. Therefore A = B and p is a one-to-one map. But we have already seen that 
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p is onto and so §C is an exact copy of SC. In particular the polygons gP cover KH completely 
without overlapping. 


6. The Structure of the Group G. We have seen that G is in one-to-one correspondence with 
the polygons gP and that it has the hyperbolic isometries a,,..., a,, b,,..., b, as generators, and 
that these satisfy the relation 


[a,,b,|[a.,b,]--- [a,, b, | = |, 


We shall now show that in a technical sense to be defined below, this is the only relation satisfied 
by G. 

Let G be the free nonabelian group on the elements 4,, 5, G5, 65,..., G.,, b, and let: G> G 
be the homomorphism which doffs hats, so 


$(4,)=a,, $(6,)=5,, ete. 
The element 


lies in the kernel K of ¢ 
K = {g © Gio(8) = 1}. 
Let ¢ be an arbitrary element of G. Then 


A 


g=%,--- &,wherex, € XU X~!, 

X= {4,5,,..., 4,,6,}, X= {4@7',67',..., a7), 3}. 
If g lies in K, then there is a sequence of polygons 

Pi, Pp = xX, P,, P3 = Xp_\X,Pi,..., Pe = Xz °° Xp X,P 


such that P, and P,, , have an edge in common and P,,, = x,P, = P,. Let N be the union of the 
interiors of the P, together with the union of the interiors of their common edges. Then ‘ — N is 
divided into an unbounded outside region and a closed interior set R by the Jordan curve 
theorem. 

Suppose firstly that the inside R consists of just one point gZ,. Then, perhaps after some 
cancellation, ¢ is a cyclic permutation of some power of the product 


or 
(gb,8° 'ga,'g-') --- (gay 'g-') = ger'g 
depending on whether the sequence of polygons is clockwise or not. 
We shall now see that the general case is a combination of this special case. 
Let H be the subgroup of G consisting of all products 
m 
| 8:6 8 T 


(= 


for some g, in G and exponents e, = +1, i = 1,..., m. Then H is a subgroup of K and we shall 
show that H actually is the whole of K. The proof will proceed by induction on pv the total number 
of vertices lying in R. If vy = 1, then R consists of a single point and this case was considered 
above. 

Suppose now that R contains at least one polygon. Then by reordering the sequence P,,..., P,, 
it may be assumed that R contains a polygon Q, adjacent to P, as in Figure 12. 
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Fic. 12. 


The sequence Q,, Q),..., Q;_; = P;, Q; = P;, corresponds to the element g = y,j. --- y, say 
of K and hence of H by the above. Because the sequence P,, P,,..., P,, Q),Q),..-, Q)_> 
contains fewer polygons in its inside, it corresponds by induction to an element r of H. We can 


calculate X, --- X, in terms of qg and r as follows: 
The sequence P,, P,,..., P,, Q,, Q2,---, Q;_-2 gives rise to the word r as follows: 
P, =k +++ &,P, 


Q, =P Py = VX. ++ KP 
QO. = iQ, = Vi Xo °° Py 
Q; = J) \Q2 = Vi-1I1V 1% oo XP, 


Qi. = Yao WY X10 KP 
Py = V3 0° WiViXq 00 XP 
So r= Py Py QR ‘Ri Xq +++ Ry. 
Using the fact that y, = %;' we see that 


A 


A —1l -; 
X00 = (V2) "Ord r- 
By induction g and r lie in H. Since H is a normal subgroup of G it follows that %,%, --- %, also 
lies in H. 
If R contains no polygons, then a similar argument works by pushing in R along edges. 
In group theoretic language G is generated by a), b,, a2, b2,..., a,, b, and has one relation 


c=[a,,b,][a,, b]--- [a,, b, | =]. 


7. The Orbit Space S,. By analogy with the torus considered in the introduction, let S, be the 
hyperbolic plane divided out by the action of the group G. Then S, is obtained from the polygon 
P by identifying edges e, = Z;Z;,, with e, = Z;Z;,, if there is a group element transporting e, 
into e,. Now to any edge e there is a unique disjoint edge f and a unique generator or its inverse in 
X U X7~' which maps e to f. Conversely associated with each generator, there is a unique pair of 
edges which are identified by this generator. The result of identifying P by these generators is the 
orientable surface of genus y. So it only remains to show that these are the only identifications. 


Let e and f be two edges of P and g an element of G which takes e into f. Let xP be the 
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polygon having the edge f in common with P where x € X U X7~!. Then gP is either P or xP. But 
we have seen that there is a 1-1 correspondence between the polygons gP and the group elements 
g. Hence either g = 1 or g = x. So the only identifications are the ones above. These identifica- 
tions define the orientable surface of genus y as in Massey’s book [5]. 


8. All Elements of G — (1) Are Hyperbolic. Suppose that g is a nontrivial element of G which 
is not hyperbolic. Then there would be points Z, gZ in SC arbitrarily close to one another. 
However, there is a positive minimum for the distances between any two points in P which are to 
be identified, and this is therefore a minimum for all identified pairs in 3C. Therefore all elements 
of G — {1} are hyperbolic. 
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A NECESSARY AND SUFFICIENT CONDITION FOR THE PRIMALITY OF 
FERMAT NUMBERS 


RICHARD MCINTOSH 
Department of Mathematics and Statistics, University of Calgary, Calgary, Alberta, Canada, T2N 1N4 


A Fermat number is an integer of the form 
F(n) = 27° +1, n> 0. 
If F(n) is prime, it is called a Fermat prime. The purpose of this paper is to give a necessary and 
sufficient condition for Fermat primes. 
We will need the tangent numbers 7(7) and the Bernoulli numbers B(n). They are defined to 
be the coefficients in the following power series 


0° 0° 


tanz= ) T(n)z"/n!, z/(e?7-1)= ) B(n)z"/n!. 


n=0 n=0 


THEOREM. F'(n) is prime if and only if F(n) does not divide T( F(n) — 2). 


Richard McIntosh: | am an undergraduate student. I discovered the theorem when I was working on a summer 
research project at the University of Calgary in 1980. 
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Proof. The definition of tan z implies 


tn = Ce“) (2k -ic| 
i(e? +e") 2\e?” 
-1(_ Ae | 
Z e2iz — | i | 
I ST yyeyn n 
~4{-ie+ Y [(2iz)” — (4iz) |a(n)/m); 
n=0 


and by equating coefficients we obtain the identity 
(*) T(2n — 1) = 22"-1(22" — 1) B(2n)//n, n>. 


Let N(2n) and D(2n) denote the numerator and denominator of |B(2n)|, respectively. 
(N(2n), D(2n)) = 1. From the von Staudt-Clausen theorem [2] 


D(2n)= [] p. 


p prime; 
(p—W2n 
It follows from (*) that 


2F()—2(QF™—1 — 1)N(F(n) - 1) 
2?""'D(F(n) - 1) 
D(F(n)-1)= [] p=2F(n)F(n-1)--- F(O), 


T( F(n) — 2) = 


p prime; 
(p—1))2?" 
where Fe) = (FCO) if FC) is prime, 
l if F( k) is composite. 


Thus 
F(2" — 1) F(2" — 2)--- F(0)2"~?"-2N(F(n) — 1) 
| F(n)F(n—- 1)--- F(0) ) 
Since (D(F(n) — 1), N(F(n) — 1)) = 1 and F(n) is odd, it follows that 

( F(n), 27-2"? (F(n) — 1)) = 1. 


F(k + 1)=2+ F(kK)F(k — 1)--- FO) for k >0 implies (F(n), F(k)) = 1 for all k #7. 
Hence F(n) is prime if and only if F(n) does not divide T( F(n) — 2), which completes the proof. 


T(F(n) - 2) = 
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MISCELLANEA 


92. The greatest enemy to true arithmetic work is found in so-called practical or illustrative 
problems, which are freely given to our pupils, of a degree of difficulty and complexity altogether 
unsuited to their age and mental development... 


—F. A. Walker, 1899 (quoted in R. E. Moritz, Miscellanea Mathematica, New York, 1914, p. 
81). 
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92. The greatest enemy to true arithmetic work is found in so-called practical or illustrative 


problems, which are freely given to our pupils, of a degree of difficulty and complexity altogether 


unsuited to their age and mental development... 
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81). 


What ever happened to proofs by transfinite induction? See p. 145. 
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showing cute examples of how to mislead others with the misuse of statistics. The CUPM panel’s 
recommendation that this course be taught by an experienced senior faculty member is very 
important. Furthermore, I recommend that this person have experience in teaching statistics. 

Taking a broader view, one sees that those at the forefront of statistics are beginning to discuss 
“Statistics as a Discipline.” In fact, this was the title of the main address presented by Paul 
Minton at the second annual Conference for Texas Statisticians held at the Baylor University 
(April, 1982). The main point of Professor Minton’s talk concerned the idea of offering 
undergraduate degrees in Statistics. 

There are two points I want to make. First, it is conceivable that a statistics course “for coping, 
with life” could cover most of the topics in the list given by the CUPM panel. Second, topic 14 in 
a list of topics for this statistics course could be “mathematics and its dangers.”’ Need I say more? 

Since it would be cowardly to avoid a recommendation for a replacement for topic 14, I 
suggest: 


14. The Use and Interpretation of Statistics in Everyday Life. 


Danny W. Turner 
Department of Mathematics 
Baylor University 

Waco, Texas 76798 


MISCELLANEA 


With Apologies to John Milton 


94. John Milton wrote a famous sonnet on being blind. This modern version looks at a 
different kind of blindness. 


When I consider how my grants are spent, 

Ere half my days in these dark halls and wide; 
My teaching talent, which I’d love to hide, 
Lodged with me useless, though my soul more bent 
To serve therewith my Provost, and present 

My teaching ratings, lest he returning chide; 

“Do Deans exact much teaching, grants denied?” 
I fondly ask: the Provost, to prevent 

That murmur, soon replies, “Deans do not need 
Either that teaching or those grants. Who best 
Does his committee work, serves best. His state 
Is Deanly: thousands at his bidding speed 

And fly to Washington, disdaining rest; 

They also serve who only loaf and wait.” 


— Edwin Hewitt 


ANSWER TO “PHOTO” ON PAGE 100 


The subject is Max Zorn; the picture was taken in the summer of 1981. He is alive and well and 
very much a part of the mathematical community in Bloomington, Indiana. 
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Mathematical Sciences, State University of New York, Binghamton, NY 13901 


PATRICIA A. WOODWORTH 
Department of Mathematics, Ithaca College, Ithaca, NY 14850 


1. Introduction. A cursory view of employment advertisements shows that database manage- 
ment is an area of intense current interest. Informally, a database is just a collection of programs 
and data sets which store and retrieve large amounts of information in an orderly manner. 
Databases are widely used in industry and government, and are increasingly useful for research 
assistance. For example, a popular system called DIALOG, operated by Lockheed Missile and 
Space Company, Inc., offers researchers telephone access, via computer terminals, to over 
100 separate databases containing abstracts, key words, cross-indices, etc. One of these is 
SCISEARCH, constructed by the Institute for Scientific Information, which essentially contains 
the Science Citation Index in a computer-accessible form. SCISEARCH contains about 3 million 
entries, and can easily answer queries such as “List all papers written in English which reference 
either B. Sz.-Nagy’s paper Sur les contractions de 1’ espace de Hilbert, Acta. Sci. Math. (Szeged), 15 
(1963) 82-85, or J. J. Schaffer’s paper On unitary dilations of contractions, Proc. Amer. Math. Soc., 
6 (1955) 322.” The actual request sent to SCISEARCH could be 


SELECT CR = NAGY BS, 1963, V15, P82 
SELECT CR = SCHAFFER JJ, 1955, V6, P322 
COMBINE lor2 

LIMIT 3/ENG 


(This form is not the most desirable, but is easily learned.) DIALOG will also contain the 
American Mathematical Society’s MATHFILE database, built from the reviews appearing in 
Mathematical Reviews [30]. 

Besides being useful research tools, databases are an interesting source of mathematical 
problems. These include statistical and cryptographic considerations of security [28, Chap. 9], 
questions of computational complexity [4], [28, Chap. 6], and logical and algebraic questions of 
how to store and access the information. It is this last area which is the focus of this paper. There 
are several competing schemes for database organization, one of which 1s called the relational 
form. This is the form which has been most intensively studied, and was advanced by Codd [9] as 
being sufficiently powerful to model real-world situations while being conceptually and computa- 
tionally simple enough to manipulate. It is particularly susceptible to mathematical analysis, 
which may account for its popularity in the research community. We feel that an expository 
account of some of the issues leading to, and arising from, the so-called normal forms for 
relational databases may be of interest to mathematicians, all of whom have encountered relations 


Quentin F. Stout: My 1977 Ph.D. was in operator theory and functional analysis, directed by John B. Conway at 
Indiana University. Since 1976 I have been an Assistant Professor at State University of New York at Binghamton. I 
have lon had an interest in both computer science and mathematics, and I especially enjoy problems that draw 
from both disciplines. My current research is primarily in computer science, particularly massively parallel 
computation. 


- 


Patricia A. Woodworth: I received my Ph.D. from the State University of New York at Binghamton in 1977 
under David Klarner. My dissertation dealt with the combinatorial problem of plane tree enumeration. Since then, I 
have been an Assistant Professor in the Mathematics Department at Wilkes College (one year) and Ithaca College. 
Recently I have become increasingly involved in computer science education and research. 


101 


102 QUENTIN F. STOUT AND PATRICIA A. WOODWORTH 


c# CN SEC CR PN 
111 Calculus I 1 4 Jones 
111 Calculus I 2 4 Smith 
111 Calculus I 3 4 Thomas 
111 Calculus I 4 4 Jones 
111 Calculus I 5 4 Johnson 
112 Calculus II 1 4 Smith 
112 Calculus II 2 4 Thomas 
243 Linear Algebra 1 3 Jones 
243 Linear Algebra 2 3 Smith 
243 Linear Algebra 3 3 Johnson 
318 Analysis 1 3 Smith 


TOTAL-OFFERINGS 


Fic. | 
C# CN SEC CR PN 
111 Calculus I 3 4 Thomas 
112 Calculus II 2 4 Thomas 


TOTAL-OFFERINGS | 


CN 


Calculus I 


Calculus II 


TOTAL- OFFERINGS | [CN] 


Fic. 2 


Pi 


418 


418 


[February 


Pi 


125 


382 


418 


125 


432 


153 


418 


125 


382 


432 


153 
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in several courses but few of whom have used them in nontrivial applications. This paper concerns 
only a small, though significant, portion of database design. Readers who wish to obtain a broader 
view are encouraged to consult such texts as Cardano [8], Date [14], Martin [24] or Ullman [28]. 

As an example, suppose DB University wishes to computerize its information concerning 
courses. The information they wish to keep includes course name (CN), course number (C#), 
section (SEC), credit (CR), professor name (PN), and professor number (P#). CN, C#, SEC, 
CR, PN, P# are called attributes, which are just sets of all possible values. For example, P# may 
denote the set of all 3 digit numbers. At any instant the data is a relation on 
{CN, C#, SEC, CR, PN, P#), 1.e., a subset of CN X C# X SEC X CR X PN X P#. The role of 
time is quite important, since the relation can change from instant to instant. However, there are 
some features, beyond the attribute set itself, that remain constant. For example, we will always 
assume that-P# uniquely determines PN, that C# uniquely determines CN and CR, that CN 
uniquely determines C#, and that C# and SEC together uniquely determine P#. A possible 
instance of this relation, which we will call TOTAL-OFFERINGS, is given in Fig. 1. The tabular 
form of Fig. 1 is particularly useful for answering queries such as “What are the names of the 
courses taught by professor number 418?” Such queries can be answered using the relational 
operators selection and projection. We use the shorthand notation “P# = 418” to denote the 
predicate ‘The P#th component of X equals 418” for X € CN XK C# X SEC X CR X PN X 
P#. If we let P denote this predicate, then TOTAL-OFFERINGS, denotes the subrelation 
consisting of all elements X of TOTAL-OFFERINGS such that P(X) is true. We then project 
TOTAL-OFFERINGS, onto the attribute CN, denoted by TOTAL-OFFERINGS,[CN], to 
obtain our answer. This projection is just the standard projection of a subset of a crossproduct 
onto one of its components. See Fig. 2. 

Despite its utility, TOTAL-OFFERINGS has some undesirable defects. For example, five 
different elements of the relation contain the information that the course numbered 111 is named 
CALCULUS I and carries 4 credits. There are several problems here. In addition to being an 
inefficient use of space, the relation is more susceptible to consistency errors. That is, if one inserts 
a new section of Calculus I and enters 3 as the number of credits, then the new entry will be 
inconsistent with the previous ones. Cross checking can be done to prevent this but it is 
time-consuming and usually not done. We can solve both problems by decomposing the 
TOTAL-OFFERINGS relation. We introduce a relation COURSES, where COURSES = 
TOTAL-OFFERINGS [C#, CN, CR], and a temporary relation Rl, where Rl = TOTAL 
OFFERINGS [C#, SEC, PN, P#]. These are given in Fig. 3. By storing COURSES and Rl 
instead of TOTAL-OFFERINGS, we have reduced the total space needed to store the informa- 
tion, eliminated some redundancy, and reduced the possibility of certain logical errors. Now to 
introduce a sixth section of CALCULUS I we only need to modify Rl. We can proceed further 
noting that we have a similar situation with respect to the professors. We decompose R1 into 
PROFESSORS = R1[P#, PN] and OFFERINGS = R1[C#, SEC, P#], illustrated in Fig. 4. The 
properties of projections insure that PROFESSORS = TOTAL-OFFERINGS [P#,PN] and 
OFFERINGS = TOTAL-OFFERINGS [C#, SEC, P#]. 

Each of COURSES, PROFESSORS, and OFFERINGS is a clearly defined logical unit to 
which we have assigned a natural name, and so they provide a reasonable way to compress the 
information contained in TOTAL-OFFERINGS. The collection (COURSES, PROFESSORS, 
OFFERINGS} is called a decomposition of TOTAL-OFFERINGS, with decomposition being an 
informal term meaning that the original relation has been replaced by a collection of smaller ones. 
Further, we claim that the three subrelations in the decomposition contain all the information of 
the original. To define precisely what this means we add the notion of join, the third fundamental 
operation on relations. If Rl and R2 are relations with attribute sets @, and @,, then the join of 
R1 and R2, denoted R1* R2, has attribute set @ = @, U @, and is given by 


Rl*R2={te @@:31,ER1,t,ER2>a¢[/@,] =2, and r[@,] =2) 
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ct CN cR 
111 Calculus I 4 
112 Calculus II 4 
243 Linear Algebra 3 
318 Analysis | 3 


COURSES = TOTAL-OFFERINGS [C#,CN,CR] 


ct SEC PN PH 
111 1 Jones 125 
111 2 Smith 382 
111 3 Thomas 418 
111 4 Jones 125 
111 5 Johnson 432 
112 1 Smith 153 
112 2 Thomas 418 
243 1 Jones 125 
243 2 Smith 382 
243 3 Johnson 432 
318 1 Smith 153 


Rl = TOTAL-OFFERINGS [Ci#,SEC,PN, P#] 


Fic. 3 


where we use ¢[@,] to mean the projection of t onto @,. It is easy to show that the join is 
commutative, associative, and, if @, 1 @, = ¢, is just the cross product. Our claim that the 
decomposition contains all of the information of TOTAL-OFFERINGS is the claim that 
OFFERINGS +* COURSES * PROFESSORS = TOTAL-OFFERINGS. (OFFERINGS, 
COURSES, PROFESSORS} is said to be a lossless decomposition of TOTAL-OFFERINGS, where 
in general if R is a relation with attribute set @ and@=@, U@,U---U@, then (R[@,],..., 
R[@,,]} is a lossless decomposition of R if and only if R = R[@,]* --- * R[@,]. A decomposition 
which is not lossless is said to be Jossy. What is lost in a lossy decomposition is the information 
that certain combinations are impossible, since we always have R[@,]* --- * R[@,] DR. For 
example, TOTAL-OFFERINGS [C#, PN] * TOTAL-OFFERINGS [PN, P#] has such entries as 
(112, SMITH, 382). With OFFERINGS « COURSES « PROFESSORS this cannot happen, for we 
assume that P# uniquely determines PN and C# uniquely determines CN. (The next section 
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PH PN 
125 Jones 
382 Smith 
418 Thomas 
432 Johnson 
153 Smith 


PROFESSORS = R1 [P#,PN] 


ct SEC PF 
lil 1 125 
111 2 382 
111 3 418 
lll 4 125 
lll 5 432 
112 1 153 
112 2 418 
243 1 125 
243 2 382 
243 3 432 
318 1 153 


OFFERINGS = R1 [C#,SEC,P#] 


Fic. 4 


contains a proof that this decomposition is lossless.) OFFERINGS contains all of the significant 
information about the correspondence between courses, sections, and professors, with the other 
two relations expanding some details. 

Lossless decompositions can give space efficiency and reduce insertion anomalies, as we have 
seen, and can reduce certain other anomalies. For example, suppose that next semester analysis is 
not being taught. Then that semester TOTAL-OFFERINGS would contain no information about 
analysis. In the decomposed relations, OFFERINGS will also not include any information 
concerning analysis, but in COURSES we can keep the information that analysis is course 
number 318. It is still true that the join of the subrelations will be the entire relation, but 
now the subrelations can contain additional information, and COURSES > TOTAL- 
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OFFERINGS [C#,CN, CR]. The decomposed version allows us to avoid a deletion anomaly 
whereby a deletion of some information (analysis is not being taught) requires us to delete other 
information (analysis is course 318). However, this possibility of having the whole be less than the 
sum of its parts causes some dissension, with several researchers insisting (perhaps only implicitly) 
that at all times each subrelation is a projection of some, perhaps never actually constructed, 
universal relation. This is usually called the universal relation assumption [5], an assumption we 
shall not make. 

Lossless decompositions can also help avoid update anomalies, in which a single change forces 
other changes. For example, if professor number 125 changes his or her name to ROBERTS, then 
that would necessitate changing 3 entries in TOTAL-OFFERINGS. Changing only one entry 
would introduce an inconsistency. In the decomposed version this would not happen since only 
one entry of PROFESSORS would be affected. 

The universal relation is quite powerful, but the problem of insertion, deletion, or update 
anomalies makes it unsuitable, pushing us to decompose into more fundamental units from which 
we can reconstruct the universal relation by the join operator. Codd [11] stated that the two most 
important reasons for using decompositions are: 


(1) To reduce the need for restructuring the collection of relations as new types of data are 
introduced; 
(2) To reduce the incidence of undesirable insertion, update and deletion anomalies. 


We shall see that the normal forms essentially accomplish this, although other considerations will 
introduce conflicting goals. 


2. Second and Third Normal Form. The normal forms, introduced by Codd [10], formalize the 
conditions a relation must satisfy in order to avoid the redundancies and anomalies described in 
the introduction. In this section, we will define the first three normal forms and the process of 
decomposing a relation into normal forms. In the relation TOTAL-OFFERINGS (see Fig. 1) a 
redundancy resulted because the value of the attribute CR depended on the value of the attribute 
C#. For example, if the value of C# is 112, then, at this point in time, the value of CR is 4. Fora 
given value of C# the value of CR may change over time but, at any instant, if the value of C# is 
known, then there is only one possible value of CR. 

The dependency of CR on C# 1s called a functional dependency. In general, assume R is a 
relation with attribute set @ and assume X and Y are subsets of @. In R, Y is functionally 
dependent on X (X functionally determines Y), written as X > Y,ifr,,r, € Randr,[X]=7r[X] 
implies that r,[Y] = 7,[Y]. X < Y means X > Y and Y — X. As specified in the introduction, 
the attributes of the relation TOTAL-OFFERINGS satisfy the following functional dependencies: 


C#°6CN  C#,SEC > P# 
C#—>CR P#-—>PN 


It is important to realize that a functional dependency cannot be determined from any 
particular instance of a relation. For example, if a particular instance of TOTAL-OFFERINGS 
contained only one section of each course, then it would appear that C# — SEC-. Since this is not 
true for all instances of TOTAL-OFFERINGS, this is an invalid conclusion. Therefore the 
functional dependencies must be specified by a person familiar with the situation being modeled. 
An added complication is that these dependencies can also be changed over time. For example, a 
curriculum committee may decide that the credit for a course may be determined by the course 
number and the level of the student taking the course. This complication will be forbidden here. 

It can easily be shown that functional dependencies satisfy the following properties: 


FD1. Projectivity: If Y C X, then X > Y. 
e.g., (C#,SEC} - C# 

FD2. Additivity: If X > Y and X > Z, then X > Y U Z. 
e.g., C# — CN and C# — CR > C# — (CN, CR} 
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FD3. Transitivity: If X > Y and Y > Z, then X > Z. 

eg., {C#, SEC} > P# and P# — PN = (C#,SEC) > PN 
FD4. Augmentation: If X > Y, then X¥ U Z > Y. 

eg., C# — CR = (C#,SEC} — CR 
FD5. Decomposition: If X — Y and Z C Y, then X > Z. 

e.g., C# — (CN; CR) => C# —- CN 


Rule FD3 corresponds to the composition of functions. FD1 implies that X > X for any set X of 
attributes. 

A key for a relation R is a subset X of the attributes @ of R that satisfies the following two 
properties: 


Kl: X > A for all A € @. 
K2: No proper subset of X has property K1. 


There exists at least one key for any relation since the set of all attributes vacuously satisfies K1, 
and hence at least one of its subsets satisfies both K1 and K2. By K1, the values of the attributes 
in a key uniquely determine the values of all other attributes in the relation. The attributes 
{C#,SEC) are a key for the relation TOTAL-OFFERINGS since CN,CR,P#,PN are all 
functionally dependent on C# while PN and P# are not functionally dependent on either C# or 
SEC separately. Since CN - CH, it is easy to see that (CN, SEC) is also a key for TOTAL- 
OFFERINGS. 

An attribute is prime if it is a member of some key. Otherwise it is nonprime. In TOTAL- 
OFFERINGS, C#,CN, SEC are prime while CR, P#,PN are nonprime. 

It is now possible to define Codd’s normal forms. (As defined here, a relation is automatically 
in first normal form. A relation is in first normal from (1NF) if none of the attributes have 
elements which are themselves sets.) 

Assume R is a relation, X is a subset of the attributes of R and A is a nonprime attribute which 
1s not in_X, 

2NF: R is in second normal form if X — A implies that X is not properly contained in any key 

of R. 
3NF: R is in third normal form if X — A implies that X contains some key of R. 


If X contains a key, then it is not possible for X to be contained in a key (condition K2), and 
therefore 3NF implies 2NF. TOTAL-OFFERINGS is neither a 3NF nor a 2NF relation since 
C# — CR and C# is contained in the key (C#, SEC). 

The goal is to decompose a relation R into a collection of 3NF relations R,,..., R, which 
satisfies the lossless decomposition property (1.e., R,* --- *R, = R). The desirability of a 
lossless decomposition was indicated in the introduction. The following theorem can be used to 
achieve this goal. 


THEOREM 1. Assume R is a relation with attributes @. Let X, Y C @ and let Z=@—-—(XUY). 
If X — Y, then the 2 projections R, = R[X U Y] and R, = R[X U Z| form a lossless decomposi- 
tion of R. 

Proof. 

R,*R, — {((x,y,z): x € X,y € Y,z € Z,(x,y) € R, and (x, z) € R)}. 

Clearly R C R, * R,. Assume (x, y, z) € R, * R,. Since (x, z) € R, = R[X U Z], there exists a 
y’ € Y such that (x, y’, z) € R. This implies (x, y’) € R,; = R[X U Y]. But (x, y) © R,, (x, y’) 
© R,, X > Y implies that y = y’. Therefore (x, y,z) = (x, y’,z) &€ R.SoR,;* R,CR. oO 
Note that this theorem provides a technique to generate a lossless decomposition but does not 
immediately guarantee a decomposition into 3NF relations. 


To see how this theorem can be used to generate a 3NF decomposition, consider again the 
relation TOTAL-OFFERINGS. This relation is not in 3NF. One problem, as determined earlier, 
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is due to the “bad” dependency C# — CR where C# does not contain a key (CR is a nonprime 
attribute). Theorem 1 guarantees that the decomposition of TOTAL-OFFERINGS into the two 
projections 0, = TOTAL-OFFERINGS [C#,CR] and 0, = TOTAL-OFFERINGS 
[C#,CN,SEC,P#,PN] is lossless. In fact, 0, is now a 3NF relation. But 0, is not since 
P# — PN and P# does not contain a key. ((C#, SEC) and (CN, SEC) are still keys for 0,.) Then 
applying Theorem | to the relation 0, using the “bad” dependency P# — PN yields the lossless 
decomposition of 0, into 0, = 0,[P#, PN] and 0, = 0,[C#, CN, SEC, P#] (see Fig. 5). This time, 
both relations 0, and 0, are 3NF since no nonprime elements are dependent on any set that does 
not contain a key. The three relations 0,,0;,0, form a lossless 3NF decomposition of TOTAL- 
OFFERINGS. 

The following theorem guarantees that this technique will always yield a desired decomposition 
for any relation. 


THEOREM 2. If a relation R is not in 3NF, then it is possible to decompose it losslessly into a 
collection R,,..., R,, of 3NF relations. 


Proof. Assume R is a relation with attributes @ which is not in 3NF. Then there is a “bad” 
functional dependency X — A where A € X, A is nonprime and X does not contain a key. By 
Theorem 1, the projections R, = R[X U A] and R, = R[@ — A] form a lossless decomposition. If 
R, and R, are in 3NF, the decomposition is complete. Otherwise R, and/or R, are not in 3NF. 
The technique just described can be applied to the resulting relations that are not 3NF. Each 
application yields a decomposition into two relations, each of which will contain fewer attributes 
than the original. Since there are a finite number of attributes, this process must terminate. 0 

In the relation 0, = TOTAL-OFFERINGS [C#, CN, SEC, P#], C# — CN and C# is nota 
key. But CN is prime (CN C (CN, SEC} which is a key), so this dependency does not violate 3NF. 
However it still causes the redundancy and anomaly problems mentioned earlier. This can be 
countered by eliminating the nonprime requirement and thus strengthening the definition of 3NF. 

Assume R is a relation, X is a subset of the attributes of R and A € X. 


BCNF: R is in Boyce Codd Normal Form if X — A implies that X contains a key. 


Then 0, is in 3NF but not BCNF. However it can be decomposed into 0, = 0,[C#, CN] and 
0, = 0,[(C#,SEC, P#]. By Theorem 1, 0, and 0, is a lossless decomposition. It is also easy to 
check that 0, and 0; are in BCNF (see Figure 6). Thus the relations 0,, 0;, 0, and 0; form a 
lossless BCNF decomposition of TOTAL-OFFERINGS. Clearly, Theorem 2 can be modified to 
yield the following theorem. 


THEOREM 3. [f a relation is not in BCNF, then it is possible to decompose it losslessly into a 
collection of BCNF relations. 0 


The decomposition technique described here is not without problems. In particular, it does not 
generate a unique decomposition nor will it necessarily generate a decomposition with a minimal 
number of relations. Consider the BCNF decomposition of OFFERINGS generated above (Fig. 
6). Since C# — CN and C# — CR, then C# — {CN, CR). If this dependency had been used 
first, the BCNF decomposition would have been COURSES, PROFESSORS and OFFERINGS 
as in the introduction (see Figs. 3 & 4). This decomposition has only 3 BCNF relations and would 
be preferable since it would require less space. There are algorithms to insure that a minimal 
number of 3NF relations are generated [7]. These algorithms involve more theoretical aspects of 
the set of functional dependencies, some of which are described in Part 4. But there is no known 
algorithm to use one minimal 3NF decomposition to help generate the remaining minimal 
decompositions. 

In addition, the decomposition may not always yield “natural” relations. Consider the relation 
R with attributes C# (course number), SEC (section), P# (professor number) and O# (the 
unique office number for a professor), where we assume professors do not share offices. Then 
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C# CN 

lil Calculus I 

112 Calculus II 
243 Linear Algebra 
318 Analysis 


O. = O, [C#,CN] 


Cit SEC Pt 
111 1 125 
111 2 382 
111 3 418 
111 4 125 
111 5 432 
112 1 153 
112 2 418 
243 1 125 
243 2 382 
243 3 432 
318 1 153 
O, = 0, [C#, SEC, PH] 
FIG. 6 


{C#,SEC} — P# while P# > O# and O# — P#. Both of these last two dependencies are 
“bad.” Apply the algorithm using O# — P# to generate R[O#,C#,SEC] and R[O#,P#]. 
This is a BCNF decomposition. (Note that (C#,SEC} —- O# by the transitive property.) But it is 
not likely that anyone would have a use for a relation that stored the course number, section and 
office number of the professor teaching the course. General decomposition algorithms do not 
provide a means to avoid such unnatural decompositions. 

Theorems 2 and 3 show that a relation can always be decomposed into either a 3NF or a 
BCNF collection. But a BCNF decomposition may cause another problem. Consider the relation 
SCHEDULE with attributes Cl (class, meaning course number and section), S# (student 
number), and T (time) (Fig. 7). At any time, a student has only one class scheduled but a class can 
have more than one student. In addition, a ciass is scheduled for only one time. The functional 
dependencies for SCHEDULE are {(T,S#) — Cl and Cl > T. Thus {T,S#) is a key for the 
relation. Since T is prime, the dependency Cl — T does not violate 3NF. But it does violate 
BCNF. The decomposition algorithm can be applied to SCHEDULE to yield the 2 BCNF 
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cl St Tt 
111-1 12654 1:00 
111-1 12785 1:00 
111-2 13865 2:00 
111-2 17642 2:00 
111-2 18725 2:00 
243-1 12654 3:00 
SCHEDULE 
Fic. 7 


relations S, = SCHEDULE [C1,T] and S, = SCHEDULE [C1,S#]. This decomposition is 
lossless. But the dependency {T,S#} — Cl in SCHEDULE is not a dependency in either S$, or 
S,. Then times could be assigned to classes and inserted in S, without violating any dependencies 
in S, while students are being assigned to classes and inserted in $,, without violating any 
dependencies in §,. This could result in scheduling conflicts (e.g., a student being assigned to two 
classes which meet at the same time). This problem could be handled by adding interrelational 
dependencies to the database but this is very awkward and is not done in practice. It is apparent 
that the restricted goal of avoiding redundancies and anomalies in a lossless decomposition does 
not eliminate all the problems involved with a database. The issue of preserving dependencies will 
be considered in Section 4. 


3. Fourth Normal Form. When some authors examined the semantics of relational databases it 
became clear that there was often structure beyond that implied by the functional dependencies. 
For example, suppose we expand the database by adding both the student number (S#) and the 
possibility of team-teaching. One possible instance is given by the relation REGISTRATION in 
Fig. 8. Our concept of key is challenged, for now the only nonprime attributes are PN and CR. 
We still have the functional dependencies P# — PN, CN @~ C#, C# — CR, and their implica- 
tions, but we can no longer assert (C#,SEC} — P# or {C#,SEC) — S#. Nonetheless, a class 
determines its professors and students. For such situations Delobel [16], Fagin [17], and Zaniolo 


c# CN SEC CR PN P# Sif 


318 Analysis 1 3 Smith 153 17825 

318 Analysis 1 3 Jones 125 17825 

318 Analysis 1 3 Smith 153 16245 

318 Analysis 1 3 Jones 125 16245 

318 Analysis 1 3 Smith 153 24692 

318 Analysis 1 3 Jones 125 24692 
REGISTRATION 


FIG. 8 
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[29] introduced the notion of multivalued dependencies: if a relation R has attribute set @, and if 
X,Y Cc @, then X multidetermines Y, denoted by X —— Y, if whenever r,,r, € R and r,[X] = 
r,[ X], then there is anv, © R such thatr,[X U Y]=r,[X U Y]andr,[@ — Y]=rn[@— Y]. The 
definition is symmetric in Y and @ — Yso X >> Yifand only if X ~—- @ — Y. In REGISTRA- 
TION, (C#,CN, CR, SEC) ~ > (PN, P#) and {C#,CN,CR,SEC} ~—> SH, for if in a given 
section of a given course professor A has student 1 and professor B has student 2, then in that 
same section of that course professor B has student 1 and professor A has student 2. 

In such situations one should project onto smaller relations to separate independent facets of 
the relation. The justification for this is contained in the following proposition. The proof is 
straightforward and will be omitted. 


PROPOSITION 1. Let R be a relation with attributes @, and let X,Y C @. Then X >— Y if and 
only if R= R[XU Y]*R[@- Y].0 


As with functional dependencies, multivalued dependencies have several properties. Among 
these are: 


MD1: X ~- Yiff X ~-- @—- Yiff X ~>-> Y—- X. 

MD2?: If X ~- Yand Y -- Z then X --> Z — Y. 

MD3: If X >> Yand Vc W, then {W, X} >> {V, Y). 

MD4: If X — Y then X -- Y. 

MDS: If X ~- Zand Y > Z’, where Z’ C Zand YN Z= @, then X > Z’. 


Proposition 1 shows multivalued dependencies can be viewed either as semantic constructs or 
as the extrema of the theory of lossless binary joins. This latter view in particular suggests another 
normal form, first defined by Fagin [17]: 


4NF: A relation R is in fourth normal form if whenever X >— Y holds in R, then X > A 
holds for each attribute A of R. 


There are two immediate questions: can every relation be decomposed into a lossless family of 
relations, each of which is in 4NF, and what is the relationship of 4NF to the other normal forms? 
The former question is handled in the same manner as used to prove that every relation has a 3NF 
decomposition, using any “bad” dependency to create two subrelations. To answer the latter 
question we assert that 4NF implies BCNF. To see this, suppose R is in 4NF and suppose R has a 
functional dependency X¥ — Y. By property MD3, X -— Y, so by the definition of 4NF, if A is 
any attribute of R, then X — A. Hence R is in BCNF. 

Multivalued dependencies are deceptively similar to functional dependencies, so we should 
point out differences. For example, there is no decomposition rule, e.g., in REGISTRATION we 
know (C#, CN, CR, SEC) ~—-> (P#,PN), but it is not true that (C#,CN, CR, SEC) ~-> PN. 
There is a weak decomposition given in the following proposition. 


PROPOSITION 2. Let R be a relation with attributes @, and suppose X >— Y in R. Let Y’ Cc Y 
and ZC @—(XUY). Then X >> Y' in R’=R[XUY’'U Z]. 


Proof. Let r,, r, © R’ be such that 7,[ X] = r,[X], and let s,, s, © R:be such thats,[X U Y’ U 
Z] =r, fori = 1,2. Then there is ans, € R such that s,[Y] = s5,[Y] ands,[@ — Y]=s,[@— Y]. 
Ifr, =s5,[X U Y’ U Z], thenr,[Y]=7,[Y],74[.X VU Z]=7[X VU Z].0 


The loss of decomposition (property FDS) leads to difficulties because we must be more careful 
when trying to reduce a set of multivalued dependencies to a simpler equivalent set. More 
troublesome problems occur when we try to combine projections and dependencies. For func- 
tional dependencies, X > Yin R[X U Y]if and only if X¥ > Yin R, but X —-— Y is always true 
in R[X U Y]. Even worse, Fagin [17, Table V] gives an example of a relation S with attributes 
W, X,Y, Z such that X -- Y in S[X UYU Z], but there are no nontrivial multivalued 
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dependencies in S. In this case S is said to have an embedded multivalued dependency, denoted 
X —>-— Y{Z. The proof of the existence of a 4NF decomposition for every relation is still valid in 
the presence of embedded multivalued dependencies, but their presence complicates the selection 
of a good decomposition. They also lead to problems in axiomatizing dependencies. Fortunately, 
Fagin asserts that such embedded dependencies rarely occur in practice [17]. 

As an example of a 4NF decomposition, we will decompose the relation REGISTRATION in 
Fig. 7. We first remove the simple dependencies C# — CR, C# @ CN, and P# — PN by 
forming the relations COURSES, with attributes (C#,CN, CR}, PROFESSORS, with attributes 
{P#,PN), and a temporary relation Rl, with attributes (C#,SEC, P#,S#). Notice that REG- 
ISTRATION = COURSES * R1 * PROFESSORS, and both COURSES and PROFESSORS are 
in 4NF. To decompose R, further, we note we have (C#,SEC) > — (P#, PN) in REGISTRA- 
TION, so by Proposition 1 we have {(C#,SEC} ~— P# in Rl. We form the relations TEACH- 
ING-ASSIGNMENTS, with attributes (C#,SEC,P#)}, and COURSE-WORK, with attributes 
{C#, SEC, S#). Each of these subrelations is in 4NF, and their join is Rl, so we have produced 
the desired decomposition. Figure 9 shows the resulting values. 


cH SEC PH 
318 1 153 


318 1 125 


TEACHING — ASSIGNMENTS = REGISTRATION [C#, SEC, P#] 


ct SEC “st 
318 1 17825 
318 1 16245 
318 1 24692 


COURSE — WORK = REGISTRATION [C#,SEC,S#] 


FIG. 9 


In our example the 4NF version is superior to the less decomposed versions. However, since 
4NF implies BCNF, there are examples, mentioned earlier, where the 4NF may force decomposi- 
tions which do not preserve functional dependencies. To avoid such problems, there is also a weak 
fourth normal form (W4NF): a relation R with attributes @ is in W4NF if it is in 3NF and 
whenever X¥ >-— Y, then @ = XU Y. Clearly 4NF implies W4NF implies 3NF. As before, a 
simple cardinality argument shows that every relation has a W4NF decomposition. 4NF is more 
widely discussed than W4NF primarily because it was introduced first, but some of the early 
semantic arguments may be viewed as arguing more for W4NF than for 4NF. 

One can go beyond multivalued dependencies and define join dependency [1], [26]. Given a 
relation R with attributes @ and a collection { X,}"_ ,, where each X, C @, then R jointly depends on 
the {(X,) if R = R[ X,]* --- * R[X,]. Join dependencies seem to have arisen purely from 
theoretical considerations, pushing the theory of lossless joins to their limits. Fagin [19] uses them 
to define a join normal form (JNF) in which there are no nontrivial joint dependencies. However, 
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we do not pursue this because in realistic examples join dependencies never seem to arise except 
when they are implied by simpler dependencies. Further dependencies, again more motivated by 
theory than by practice, are discussed in [22]. Finally, Fagin [20] has recently introduced a further 
normalization, domain-key normal form (DKNF), which is motivated by the observation that all 
current database systems routinely will check that each data value entered is in the correct domain 
and that all keys are distinct. While it is interesting, we do not pursue this normal form primarily 
because we are not convinced that database design should be reduced to the lowest common 
denominator of database systems, especially at so early a stage in the history of databases. 


4. Dependencies. This section looks a little more deeply into the theory of dependencies. 
Assume F is the given set of functional dependencies on a set of attributes @. The closure of F, 
denoted F'*, is the smallest superset of F which is closed under axions FD1—FD3 (projectivity, 
additivity and transitivity—see Part 2). That is, any instance of a relation R with attributes @ that 
satisfies the functional dependencies in F also satisfies all the dependencies in F’*. Armstrong [2] 
proved that these axioms were complete in the sense that there exists an instance of a relation R 
which has attributes @ and which satisfies the functional dependencies in F* and no others. 
(Armstrong’s proof actually involved a set of axioms equivalent to FD1—FD3.) Note that F* can 
be quite large even if F is small. For example, if A > B,, A > B,,...,A > B, © F, then 
A — Y &€ F" for Y any subset of {B,,..., B,}. Since {B,,..., B,} has 2” subsets, computing the 
closure can be a time-consuming process. However, there exist efficient algorithms to determine 
whether a particular functional dependency is in F* [3]. 

A set of dependencies G is a cover for F if G'= F*. G is a minimal cover if the following 
properties hold: 


Cl. The right side of every dependency in G consists of a single attribute. 

C2. G has no redundant dependencies, where a dependency X — A is redundant if G* = (G — 
{X > A})*. 

C3. The left side of every dependency in G has no redundant attributes, where Z is a 
redundant attribute of ZU X —- Aif X > A and ZN X= ¢. 


The existence of a cover satisfying Cl for any set of functional dependencies F is an easy 
consequence of the additive (FD2) and decomposition (FD5) axioms. To generate a cover 
satisfying C2, take a cover of F which satisfies Cl and consider each dependency in some order. If 
a dependency is redundant, remove it from the set. To also satisfy C3, check each dependency and 
each attribute in its left side in some order. If an attribute is redundant, then remove it from the 
dependency. This will yield a cover satisfying C3, and thus every set of dependencies has a 
minimal cover. A minimal cover is not necessarily unique since considering the dependencies in a 
different order may yield different minimal covers. 

The theory of functional dependencies (closure and covers) has been related to other mathe- 
matical areas. Delobel and Casey [15] developed an equivalence between functional dependencies 
and Boolean functions such that axioms applied to one are equivalent to analogous axioms 
applied to the other. Fagin [18] developed an equivalence between functional dependencies and 
implication statements of propositional logic. In particular, he showed that the closure of a set of 
functional dependencies is equivalent to the: closure under analogous operations of the equivalent 
set of implication statements. Readers wishing to pursue such connections are encouraged to read 
[21]. 

It is now possible to formalize what it means to preserve dependencies. Assume R, is a 
projection of the relation R. The set F, of projected dependencies for R, is the set of all 
dependencies, X¥ — Y, in F* such that all the attributes in X¥ U Y are attributes of R,. The 
decomposition of R into the relations R,,..., R, is dependency preserving if F* 
= (F, U---U F_)* where F, is the set of projected dependencies for R,, i = l,..., n. 

As the relation SCHEDULE (Fig. 7) shows, it is not always possible to find a lossless BCNF 
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decomposition which also preserves dependencies. It is possible to determine whether such a 
decomposition exists. Form all possible dependency preserving decompositions and then check 
each one to see if it is BCNF. However, this algorithm requires so much time it is unusable in 
practice, and so far no suitable algorithm has been developed. In fact, [4] showed that this 
problem is NP-hard, which means it is not likely a good algorithm can be found. 

On the other hand, there always exists a lossless 3NF decomposition which preserves depend- 
encies. This is accomplished by the following algorithm: 


Decomposition Algorithm. Assume R is a relation and F is a minimal cover of the functional 
dependencies for R. 


(1) If an attribute does not appear on the left or right side of any dependency ia F, then it can 
form a relation on its own and be deleted from R. 

(2) Partition F into groups such that all dependencies in a group have the same set of 
attributes on the left. 

(3) Combine the dependencies in each group into a single dependency using the additive axiom 
(FD2). 

(4) For each of the resulting dependencies, X¥ — Y, form the projection R[X U Y]. 


The resulting decomposition is clearly dependency preserving. Since a minimal cover has no 
redundant attributes on the left side of any dependency, the set of attributes X is a key for 
R[ XU Y]. Thus it is in 3NF. As it stands, such a decomposition may be lossy. But if the 
projection R[ X], where X is a key for R, is included in the decomposition, then it is lossless. This 
can be checked by applying the lossless join algorithm of Aho, Beeri, and Ullman [1]. 

The relation TOTAL-OFFERINGS (Fig. 1) can be decomposed using this algorithm. The 
initial set F (see Part 2) of functional dependencies is in fact minimal. The decomposition would 
consist of the following projections: TOTAL OFFERINGS [C#,CN, CR], TOTAL-OFFER- 
INGS [CN, C#], TOTAL-OFFERINGS [P#, PN], TOTAL-OFFERINGS [C#, SEC, P#], and 
TOTAL-OFFERINGS [C#, SEC] since C#,SEC is a key for TOTAL-OFFERINGS. Since this 
set F has other minimal covers, the decomposition is not unique. Nor is it minimal. But the 
collection can be reduced by eliminating projections whose attributes are contained in another 
projection. In this example, the remaining projections would be TOTAL-OFFERINGS 
[C#, CN, CR], TOTAL-OFFERINGS [P#, PN], TOTAL-OFFERINGS [C#, SEC, P#]. In gen- 
eral, the decomposition algorithm does not yield unique nor minimal decompositions. 

Applying this algorithm to the relation SCHEDULE [C1,S#,T] yields the projections 
SCHEDULE [Cl, T] and SCHEDULE [C1,S#, T]. After simplifying, the only remaining relation 
is the original one itself which is 3NF but not BCNF. 

So far, none of this theory takes into account other possible connections between attributes. 
For example, suppose X and Y are attributes of a relation R for which r,,7, € R, r, #h, 
ri [|X] = r[X] implies r,[Y] # 7 [Y]. To exploit this information, you want the decomposition to 
keep X and Y together. Bernstein [7] suggested that in this situation we introduce a dummy 
attribute 6 and include the functional dependency, X, Y — @. After the decomposition algorithm 
has been applied, the dummy attribute is dropped. (Ling, Tompa, and Kameda [23] also study this 
problem.) Apparently such connections rarely occur in practice, or if they do occur, they are 
ignored. 

The concept of preserving multivalued dependencies has been studied, but not as extensively. 
Let D be the given set of functional and multivalued dependencies for a relation R. The closure of 
D, denoted by D + , is the smallest superset of D which is closed under axioms FD1-—FD3 and 
MD1-—MD%S. Beeri, Fagin and Howard [6] showed that these axioms are complete. The computa- 
tion of D + may again require an amount of time which is an exponential function of the number 
of attributes and given dependencies. However, there is an efficient algorithm to determine 
whether a particular multivalued dependency is in D + [3]. As shown earlier, there always exists a 
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decomposition of a relation into 4NF. However, since 4NF implies BCNF, it is not always 
possible to generate a 4NF decomposition that preserves functional dependencies. The existence 
of embedded dependencies also complicates the decomposition process. 


5. Conclusion. In this paper we have roughly followed the historical development of the design 
of relational databases, which essentially started with the early papers of Codd [9], [10], [11]. Codd 
[11] listed the advantages of the relational database as being: 


1) a high degree of data independence; 

2) a community view of the data of spartan simplicity so that a wide variety of users can 
interact with a common model; 

3) a simplification of the job of the database administrator; 

4) a theoretical foundation for database management; 

5) a merging of fact retrieval and file management fields; 

6) a lifting of databased application programming to a level where sets are treated as operands. 


There then followed a period of discovering various decompositions and normal forms [17], [19], 
[20], [22]. Normal forms are motivated by the desire to compress the amount of data being stored, 
without losing the ability to regenerate all the original data. Unfortunately, as demonstrated in 
Part 2, this can run counter to attempts to preserve other information, such as functional 
dependencies, and if applied blindly may lead to relations being stored which have no natural 
interpretation. The role of the semantics of the database, i.e., what the data and relations mean, as 
opposed to only their syntax, i.e., the ways they can be combined, is becoming increasingly 
important. We expect that considerable effort will be spent trying to combine the space 
efficiencies of the normal forms with the demands that the relations be meaningful and that no 
information be lost. One proposal concerning semantics is the notion of a well-defined relation 
offered by Smith and Smith [27]. While we cannot easily give a precise logical definition of well- 
defined, the major requirement of such a relation is that one be able to assign to it and all of its 
attributes names which are natural. The relation in Part 2 with attributes course number, section, 
and office number of the professor teaching the course is not well-defined since there is no natural 
name for it. Other semantic concerns are discussed by Codd [12]. 

There are a large number of closely related, mathematically oriented problems which we have 
only hinted at. For example, in Part 4 we mentioned that some decomposition questions have 
been shown to be NP-hard. One such question is the existence of a BCNF decomposition that 
preserves dependencies [4]. This is not necessarily as bad as it seems, since in practice a person can 
usually decide this problem quite quickly. As the determination of such decompositions is 
increasingly being performed by computers, it does indicate that attention needs to be focused on 
finding computationally efficient heuristics to help find such a decomposition. In particular, some 
effort has been spent to find subclasses of relations for which a desired decomposition can be 
found quickly. 

One closely related area which was not touched in this paper is the problem of actually 
accessing the data stored in a relational database. For example, to answer the query “What 
courses are taught by both Professor Jones and Professor Thomas?”, how should one combine the 
relations in Fig. 4 to answer this question most efficiently? Of course, the first problem is to 
translate English queries into mathematical statements, but once this has been done, one sees that 
a given problem can be solved in many ways, some of which are far more efficient than others. A 
great deal of work has been done on this problem (see, e.g., Ullman, Chap. 6 [28]), but these 
considerations go beyond the aims of this paper. 

One final, important issue concerns the practicality of using relations as the foundation of an 
actual database management system (DBMS). In his Turing Award acceptance lecture, delivered 
in November 1981 [13], Codd stated 
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There has been no shortage of skepticism concerning the practicality of the relational approach 
to database management... . Instead of welcoming a theoretical foundation as providing 
soundness, the attitude seems to be: if it’s theoretical, it cannot be practical. The absence of a 
theoretical foundation for almost all nonrelational DBMS is the prime cause of their 
ungepotchket quality. (This is a Yiddish word, one of whose meanings is patched up.) 


Despite early resistance, relational DBMS seem to be gaining acceptance. Two commercially 
available systems are Tandem Computer Corporation’s ENCOMPASS and IBM’s System R, and 
early reports on their performance are positive [13]. 
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UNSOLVED PROBLEMS 


EDITED BY RICHARD GUY 


In this department the MONTHLY presents easily stated unsolved problems dealing with notions ordinarily 
encountered in undergraduate mathematics. Each problem should be accompanied by relevant references (if any 
are known to the author) and by a brief description of known partial results. Manuscripts should be sent to 
Richard Guy, Department of Mathematics and Statistics, The University of Calgary, Calgary, Alberta, Canada 
T2N 1N4. 


A MISCELLANY OF ERDOS PROBLEMS 


Paul Erdés continues to pour out problems. Here are several that he has handed to me in 
recent months. 


Ifl <a, <a,< +--+: <a, <n, how large can k be if no a, — a; divides a, forl < i<j <k? 


If k > [n/2], the least integer not less than n/2, then a,;,, — a; = 1 for some i. On the other 
hand, by taking a, = 2i — 1, we see that this is best possible. If, however, we ignore the divisor 
a, — a, = 1, then we can take the odd powers of 2 along with the odd numbers, ie.,k > |n/2] + 
| (Ib n)/2] where Ib is the binary (base 2) logarithm. More generally we can ask: 


Is it true that if k > (4 + e)n, then there is a “large” difference a, — a; which divides a,? 
To make this more precise, assume 
(1) (a, — a;)f a, whenever a,;— a; >t 
and write max k = F(n; t) for all sequences satisfying (1). Is it true for every t that 
i Jim F(n; t)/n = 1/2? 
We have seen that F(n; 1) = [n/2] and that F(n;2) > n/2 + clnn. Can you show F(n; 2) < 
(1 + €)n/2? What about larger values of t? 


Erdés writes that in a recent letter to him, I. Ruzsa proved that F(n, t) < n/2 + c,n/in n. The 
lower bound hasn’t been improved. 


A Problem of Erdos and Marcel Erne 


Let G(n) be a graph on n vertices. Denote by f(G(n)) the number of complete graphs 
contained in G(n), and by F(n) the number of possible values of f(G(n)). For example F(1) = 1 
and, for n = 2, f(G) = 2 or 3, so F(2) = 2. For n = 3, f(G) is 3, 4, 5 or 7, so F(3) = 4. If my 
doodlings are correct, F(4) = 8 and F(5) = 16. [Another example of the strong law of small 
numbers?] But Erdés says that F(n) = 0(2”) is easy to show and that we almost certainly can 
prove that 

lim F(n)'7" = 2. 


no 
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A Problem of Erdos and Marcel Erne 


Let G(n) be a graph on n vertices. Denote by f(G(n)) the number of complete graphs 
contained in G(n), and by F(n) the number of possible values of f(G()). For example F(1) = 1 
and, for n = 2, f(G) = 2 or 3, so F(2) = 2. For n = 3, f(G) is 3, 4, 5 or 7, so F(3) = 4. If my 
doodlings are correct, F(4) = 8 and F(5) = 16. [Another example of the strong law of small 
numbers?] But Erdds says that F(n) = 0(2”) is easy to show and that we almost certainly can 
prove that 

lim F(n)'/" = 2. 


no 


1983] UNSOLVED PROBLEMS 119 


Similar questions can be asked about cliques (maximal complete subgraphs) and other types of 
subgraph (e.g., stars, circuits, trees, bipartite graphs). 


Joel Spencer improved on earlier results of Moon and Moser and of Erdés and showed that 
there is a graph G(n) which contains n — lb n + O(1) cliques of distinct size and it is easy to see 
that this result is best possible. What is the corresponding result for hypergraphs (graphs in which 
the “edges” are subsets of the vertices whose cardinality may be greater than 2)? If L(n) is the 
number of times we have to iterate the logarithm to reach a number less than e, then Erdés has 
shown that there is a 3-graph on x vertices (triple system on n elements) which contains n — L(n) 
cliques of distinct size (a clique is here a maximal set of vertices all of whose subsets of size 3 are 
edges). Can it be shown that there is no such 3-graph (triple system) with n — c cliques of distinct 
size, for some constant c? 


A Problem of Erdos and William T. Trotter 


Let S be a set of cardinality n and {4,)} be a family of subsets of S with no A, containing any 
other. Suppose that for each ¢, if there is an A, of size ¢, then there are at least r different A, of 
that size; i.e., any size which occurs must occur at least r times. What is the largest number of 
subsets of different size? 

If r= 1 and n > 3, there are always n — 2 sets of distinct size, but you can’t find n — 1. If 
r> 1andn > n)(r), there are always n — 3 sets of distinct size, but there aren’t n — 2. We have 
no satisfactory estimate of n)(7). 

The content of the previous two paragraphs will be a small subset of a forthcoming paper of 
Erdos, Szemerédi and Trotter. 


A Problem of Erdos and David E. Daykin 


Let F = {A,, Ap,..., A,,) be a family of m subsets of a set S of size 2m. Consider the graph 
G(F) whose vertices are the A,, two vertices being joined just if the corresponding A, are 
comparable, 1.e., just if one contains the other. An old conjecture of Erdés is that if m = (2 — e)2” 
and n > no, then G( F) has fewer than 27” edges. It is easy to see that this is false if e = 0: split S 
into two parts, B and C, each of size n, and for A,, 1 < i < 2”, take all the subsets of B, and for 
A,, 2"<i<2""', take the union of B with every subset of C. Then A; C A, whenever 
1<i<2" <j < 2"*', giving 2°” edges in G(F). 

Daykin and P. Frankl proved that if G(F) has (1 + 0(1)) (%') edges (ie., if the graph is 
complete apart from 0(m7) edges), then m'/” > 1 as n > oo. Daykin and Erdés conjecture that 
if G(F) has cm* edges for some c > 0 and independent of n, then m < (2 + 0(1))”. Perhaps there 
is a C, depending on c, so that m < C - 2” for sufficiently large n. If true, this conjecture is best 
possible. Also perhaps to every e > O there corresponds an 9 such that if G(F') has more than 
m?—" edges, then m < (2 + e)”. Perhaps an Erdés-Stone type result holds: if G(F’) has more than 


(1 + 2) 


(1 — | edges, then m < 270 +%))/('+)_ Many further related questions can be 
asked. 


Three Games Played by Coloring Edges of Graphs 


The first game is played on the complete graph, K,,, on n vertices, each pair of vertices being 
joined by an edge. Left starts by coloring an edge Lilac, and Right replies by coloring another 
edge Red. They continue to color new edges alternately until all the (4) edges are colored. Left 
wins if he can produce a larger Lilac clique (complete subgraph, K,) than any of Right’s Red 
cliques. If Right produces a clique as large or larger than any of Left’s, then she wins. Erddés 
conjectures that, for n > 2, the second player, Right, wins. 
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In the second game, Right colors two edges after each time Left colors one. For n > 3, can 
Right always produce a larger clique? She can for n = 4 or 5. 


In the third game they color alternately, as in the first game, but Left wins if he can produce a 
vertex with larger Lilac valence than any Red valence, 1.e., if he can make more Lilac edges at a 
vertex than Right can make Red edges at any vertex. If the maximum Lilac and Red valences are 
equal, Right wins. Who does win? 


A Conjecture of Marian Deaconescu 


Let I] = p, p, --- p,, be the product of the first n primes. Then 
i, II + pis prime for at least one prime p, p, < p < II? 


Erdés agrees with this conjecture and expects that the least such p is much smaller than I]; in fact 
that p < n° for some constant c. Deaconescu has verified the conjecture for n < 1000. 


A PENTAD OF POINTED PROBLEMS 


Quite often the MONTHLY gets problems which are too unsolved to go into the elementary or 
advanced sections, even with stars on, while they may be too brief to stand alone in this section. 
Here are five examples. 


Some Problems in Geometric Probability 
Paul R. Chernoff, University of California, Berkeley, CA 94720 


Choose n points P,, P,,..., P, at random from the unit disk in R*. What is the probability that 
the polygon P,P,... P,P, is convex? What is the probability that it’s a Jordan curve? If the points 
are chosen from the unit ball in R°*, what’s the probability that the polygon is knotted? 

I have no idea how to solve these, I believe that Dvoretzky has done some possibly related 
work on Brownian motion trajectories. I think that the first question may be easy, but that the last 
one is hard. 


Find an Invertible Linear Combination 


Robert Hartwig, North Carolina State University at Raleigh, NC 27650 


If 0,,0,,Q3, Q4 are square matrices of order n with elements in a field F and the matrix 
ih Q; 
Q. QQ 


a,Q, + a,Q, + a4,Q, 1s invertible? 
What about the same problem with “field” replaced by “division ring” or “the ring of 
integers”? A necessary condition for matrices over a commutative ring with unity, is that the 


A OQ] ._. B 0O 
9 4{ Similarto|) 2 


\4 les | _ h mall ° | 

0 AI|Q, Q, Q, Q,4\LO B 

implies AQ, = Q;B for each i, and hence since R is commutative, AU = UB so that an affirmative 
answer to the question implies the splitting condition. 


of order 2n is invertible, are there scalars a,,a,,a,,a, in F so that U=a,Q, + 


splitting condition holds; that | 
n by n matrices, then 


implies A similar to B. For, if A, B are 
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Pub, 1982, 224 pp, $4 (P). [ISBN: 0-486-24260-9] An unabridged and corrected republication of the 
1969 work Second Miscellany of Puzzles--Mathematical and Otherwise. Sixty-five well-illustrated, 
witty, original challenges, mostly geometrical, plus fifty "tricky or overliteral...more or less 
from everyday life" short puzzles. A truly different kind of collection. Watch out for the unex- 
pected. Highly recommended. The low price is unbeatabie. JK 


General, T(13: 1), S*, L*. Basic Algebra and Geometry for Scientists and Engineers. A.J. Ellis. 
Wiley, 1982, xi + 187 pp, $24.95. [ISBN: 0-471-10174-5] Title misleads. Should read "...for stu- 
dents in the sciences and engineering." Linear systems, determinants and matrices. Vectors in two 
and three dimensions, including complex numbers. Conics and quadric surfaces. Polynomials and qua- 
dratic forms in two and three variables. Straightforward. Concise. Clear. Worked-out examples. 
Applications. Exercises. Answers to many. Useful for students about to take--or currently 
taking--a calculus course. Handy reference. JK 


General, S§(18), P, L. A Panorama of Pure Mathematics As Seen by N. Bourbaki. Jean Dieudonné. 
Trans: 1.G. Macdonald. Academic Pr, 1982, x + 289 pp, $29.50. [ISBN: 0-12-215560-2] A survey of 
those areas of mathematics which have been exposed in the 30+ years of the Bourbaki Seminar. 
Included are topology (algebraic, differential, but not general), geometry (differential, analytic, 
algebraic), algebra (homological, number theory, group theory, commutative, but not general), 
analysis (harmonic, ordinary differential equations, partial differential equations, ergodic theory, 
but not integration and classical analysis). A remarkable book, one which very few mathematicians 


could have written. SG 


General, I.. Mathematics for the General Reader. E.C. Titchmarsh. Dover, 1981, 197 pp, $3 (P). 


[ISBN: 0-486-24172-6] Lucid introduction for laymen to the ideas of mathematics, from counting to 
calculus. Unabridged republication of the 1959 Doubleday edition. GHM 


General, P. Emil Artin: Collected Papers. Ed: Serge Lang, John T. Tate. Springer-Verlag, 1965, xvi 
+ 560 pp, $30. [ISBN: 0-387-90686-x] All the papers (but not the books or lecture notes), each in 
the language in which it was published. JD-B 


General, T(13: 1), S$, L. A Number for Your Thoughts. Stephen P. Richards (Box 501, New Providence, 
NJ 07974), 1982, 207 pp, $7.95 (P). [ISBN: 0-9608224-0-2] Twenty-one easy to read, informative arti- 
cles about numbers (primes, rationals, irrationals, complex, transfinite) and number curiosities, 
properties, and conjectures. These expository essays require no mathematical background, and could 
become the focus for a continuing education course in mathematics. No exercises. LCL 


General, P. The Study of Time IV. Ed: J.T. Fraser, N. Lawrence, D. Park. Springer-Verlag, 1981, 
xxv + 286 pp, $39. [ISBN: 0-387-90594-4] Papers presented at the fourth (1979) conference of the 
International Society for the Study of Time. Concludes with a major bibliography on the’ literature 
of time, 1900-1980, and a cumulative index of the proceedings of all previous conferences. LAS 


General, P. Transactions of the Moscow Mathematical Society, 1982, Issue 1]. Ed: Ben Silver. AMS, 


ow SS 


1982, v + 298 pp, $91 (P). Translation of Volume 41 (1980). 


Elementary, T(13). Intermediate Algebra. Join G. Michaels, Norman J. Bloch. McGraw-Hil1l, 1982, 
xiii + 417 pp, $15.95. [ISBN:. 0-07-041820-9] Includes graphing of conic sections and rational 
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functions, introduces logarithms. No synthetic division, binomial theorem, or probability. AWR 


Precalculus, T*(13: 1). Precalculus Mathematics, A Short Course. Russell E. Thompson. U Pr of 
America, 1982, viii + 189 pp, $10 (P). [ISBN: 0-819-2634-9] Designed to provide the essentials of 
pre~calculus in one semester. Brief. Unpretentious. Without frills. Surprisingly thorough. 
Prepared from typewritten pages. Plastic binder. Exercises. Answers. Appendices. Four-place 
tables of common logs and trigonometric functions. JK 


Education, S, P, L*¥*. Mindstorms: Children, Computers, and Powerful Ideas. Seymour Papert. Basic 
Books, 1980, viii + 230 pp, $6.68 (P). [ISBN: 0-465-04629-0] A celebration of dynamic approaches to 
geometry via LOGO and Turtle Geometry--Papert”s computer toels designed for kids. Papert”’s thesis 
is that young children can explore subtle geometric, algorithmic and computational ideas if only 
they are placed in an environment with appropriate, enjoyable, playful tools. LAS 


Education, P. So You“re a Mathematics Supervisor. Ross Taylor. NCTM, 1982, 20 pp, $2.50 (P). 
[ISBN: 0-87353-199-X] Helpful hints for management in any field. While many suggestions appear to 
be common knowledge (e.g., let a secretary screen your calls), the book provides reinforcement for 
experienced supervisors and a glimpse of necessary supervisory techniques for beginners. MW 


Education, P. Mathematical Problem Solving: Issues in Research. Ed: Frank K. Lester, Joe Garofalo. 
Franklin Inst Pr, 1982, xii + 139 pp, $14.50 (P). [ISBN: 0-89168-049-7] Revisions of papers 
presented at a 1981 conference by researchers active in the field of mathematical problem solving. 
Consistent theme is the need to draw upon the work of cognitive psychologists. Much attention to 
stages of problem solving, content and organization of knowledge, and information processing 
approaches. Information on recent activities and ideas for new research. MW 


Education, P. Computers in the Classroom. Ed: Henry S. Kepner, Jr. NEA, 1982, 158 pp, $7.95 (P). 
[ISBN: 0-8106-1825-7] An introductory survey--already dated--of the ways computers are used in the 
classroom, ranging from elementary classes and computer literacy to interesting applications in high 
school mathematics. LAS 


History, $(14-18), P, L**,. Aristarchus of Samos: The Ancient Copernicus. Six Thomas Heath. Dover 
Pub, 1981, viii + 425 pp, $7 (P). [ISBN: 0-486-24188-2] Republication of Heath’s classic work of 
1913. History of Greek astronomy from Thales through Aristarchus, originator of the heliocentric 
hypothesis. Includes translation and discussion of Aristarchus only extant work "On the Sizes and 
Distances of the Sun and Moon." JRG 

History, $(16-17), P. Histoire et Préhistoire de L’Analyse des Données. J.-P. Benzécri. Dunod, 
1982, vii + 159 pp, 55 FF (P). [ISBN: 2-04-015467-1] A brief history of statistics by a statisti- 


cian, requiring of the reader considerable knowledge of the field. JD-B 


History, P, L. Writings of Charles S. Peirce: A Chronological Edition, Volume I, 1857-1866. Charles 
S. Peirce. Indiana U Pr, 1982, xxxv + 698 pp, $32.50. [ISBN: 0-253-37201-1] Meticulously documented 
selection from the manuscripts of this major American philosopher and scientist (1839-1914). A 
twenty volume series is projected, drawing from Peirce’s 80,000 pages of unpublished manuscripts as 
well as the 12,000 already in print elsewhere. This first volume gives unique insight into the 


early development of Peirce’s ideas on logic and philosophy of science. GHM 


History, P, L. Charles Babbage: Pioneer of the Computer. Anthony Hyman. Princeton U Pr, 1982, xv + 
287 pp, $25. [ISBN: 0-691-08303-7] This biography contains a remarkable and fascinating compilation 
of information, episodes and anecdotes from the life (1791-1871) of this leading (but unsuccessful) 
British advocate of the systematic use of science in industry. Babbage is now most famous as the 
unheeded inventor of the difference and analytical engines, precursors of today’s computers. GHM 


History, P, L. The Evolution of Dynamics: Vibration Theory from 1687 to 1742. John T. Cannon, 
Sigalia Dostrovsky. Stud. in History of Math. & Physical Sci., V. 6. Springer-Verlag, 1981, ix + 
184 pp, $39. [ISBN: 0-387-90626~6] A detailed guide to the major works in vibration theory, from 
Newton to Euler and the Bernoullis, which gave rise to the subject of dynamics of systems with many 
degrees of freedom. GHM 


History, P, L. The Prehistory of the Theory of Distributions. Jesper Litzen. Stud, in the History 
of Math. & Physical Sci., V. 7. Springer-Verlag, 1982, viii + 232 pp, $42. [ISBN: 0-387-90647-9] A 
concise, well-documented survey of several streams of mathematical thought that were eventually sub- 
sumed (in 1950) in Schwartz’s theory of distributions: duality in functional analysis, generalized 
differentiation, Sobolev’s functionals, the generalized Fourier transform, delta-functions, and de 
Rham’s currents. LAS 


History, $(18), P, L*. 100 Years of Mathematics: A Personal Viewpoint. George Temple. Springer- 
Verlag, 1981, xvi + 316 pp, $32. [ISBN: 0~-387-91192-8] A synopsis of key elements in mathematical 
research, from the mid-nineteenth to the mid-twentieth century, touching on the work of over 300 
mathematicians in as many pages. Offers an intellectual scaffolding for the algebra-analysis-— 
geometry paradigm of graduate school mathematics, more akin to an encyclopedic dictionary than to a 


historical treatise. Contains over 600 references to original papers. LAS 


Foundations, P. Regressive Sets and the Theory of Isols. Thomas G. McLaughlin. Lect. Notes in Pure 
& Appl. Math., V. 66. Dekker, 1982, vi + 371 pp, $49.50 (P). [ISBN: 0-8247-1337-0] An esoteric 
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blend of recursion theory, set theory and algebra applied to isols, a type of degree of isolated 
sets. Addressed to specialists only. The first unified presentation of the subject. GHM 


Foundations, P, L. Ordered Sets. Ed: Ivan Rival. D Reidel Pub, 1982, xviii + 966 pp, $99. [ISBN: 
90-277-1396-0] Proceedings of the 1981 NATO Symposium on Ordered Sets. Twenty-three survey articles 
touching upon all areas of current research on ordered sets, from set theory and recursion, to lat- 
tice theory and enumeration problems, to applications in computer and social sciences. Lists nearly 
200 unsolved research problems. GHM 


Foundations, P, L. Three Concepts of Time. Kenneth G. Denbigh. Springer-Verlag, 1981, viii + 180 
pp, $19.80 (P). [ISBN: 0-387-10757-6] Philosophical analysis of time as a composite notion having 
three quite different sources: the time of consciousness having a definite "present;" the direction- 
less, presentless time of physics; and the irreversible, presentless time of thermodynamics and evo- 
lutionary sciences. Seems well written and accessible to nonspecialists. GHM 


Combinatorics, P. Combinatorial and Geometric Structures and their Applications. Ed: A. Barlotti. 
Math. Stud., V. 63. Elsevier North-Holland, 1982, viii + 292 pp, $44.25 (P). [ISBN: 0-444-86384-2] 
A collection of twenty-two papers on finite geometric structures. Some articles are expository; 


most are research reports. A high price for what amounts to a journal issue. SS 


Discrete Mathematics, T(13-14: 1), L. Understanding Finite Mathematics. James Radlow. Prindle, 
Weber & Schmidt, 1982, xi + 624 pp. [ISBN: 0-87150-328-X] The author has "tried to be informal and 
unintimidating.” He is, but the price is a sometimes painfully slow pace. Examples: matrices first 
appear on page 112; linear programming on page 212; conditional probability on page 392. Usual 
applications to Markov chains, game theory, finance. Exercises, answers, tables. JS 


Number Theory, T(18), P. Introduction to Cyclotomic Fields. Lawrence C. Washington. Grad. Texts in 
Math., V. 83. Springer-Verlag, 1982, xi + 389 pp, $39. [ISBN: 0-387-90622-3] A beautiful, up-to- 
date survey of the subject; ideal for graduate students, experts in number theory, and everyone in 
between, Highlights include p-adic L-functions, Stickelberger’s theorems, measures and distribu- 
tions, and Iwasawa theory. SG 


Number Theory, $*(15-17), P*, L*. Unsolved Problems in Number Theory. Richard K. Guy. Unsolved 
Prob. in Intuitive Math., V. 1. Springer-Verlag, 1981, xviii + 161 pp, $18. [ISBN: 0-387-90593-6] 
First of a series of volumes presenting conjectures and unsolved problems (with extensive refer- 
ences) intended for students, teachers and serious amateur math buffs. This volume covers primes, 
divisibility, sequences, Diophantine equations, and additive number theory. Problems are introduced 
with definitions, examples and full references. Presumes a good first course in number theory for 
basic definitions and theorems. LAS 


Number Theory, P. Arithmetic on Modular Curves. Glenn Stevens. Progress in Math., V. 20. Bir- 
khauser Boston, 1982, xvii + 214 pp, $15. [ISBN: 3~7643-3088-0] An exposition of the author’s 
research on the connection between arithmetic properties of certain modular curves and_ special 
values of associated L~functions. 5G 


Linear Algebra, T(15-17: 1), L. Advanced Matrix Theory for Scientists and Engineers. Assem S. Deif. 
Halsted Pr, 1982, x + 241 pp, $34.95 (P). [ISBN: 0-470-27316-1] Covers standard topics in applied 
linear algebra, with an additional chapter on perturbation theory. Reasonable number of examples 
and many exercises, few of which are routine. Primarily intended for engineers and other practical 
users. JRG 


Algebra, $(18), P. Products of Reflections in U(p.qg). Dragomir Z. Djokovié, Jerry G. Malzan. 
Memoirs No. 259. AMS, 1982, vi + 82 pp, $5.20 (P). This monograph deals with products of reflec- 
tions in the unitary group U(p,q). For each AéU(p,q) with det (A) = + 1 the authors have determined 
the minimum number of positive reflections needed to represent A. CEC 


Algebra, T**(14-15), L. Introduction to Algebra. A.I. Kostrikin. Trans: Neal Koblitz. Springer- 
Verlag, 1982, xiii + 575 pp, $28 (P). [ISBN: 0-387-90711-4] A Soviet abstract algebra text. The 
first part contains six chapters: Sources of Algebra; Vector Spaces; Determinants; Algebraic Struc- 
tures (groups, rings, fields); Complex Numbers and Polynomials; Roots of Polynomials. The second 
part delves more deeply into these topics with chapters on groups, representation theory and fields, 
rings and modules. Contains interesting applications, historical notes, and many perceptive com- 
ments. The organization is somewhat unusual, but the exposition is firm, concrete, and clear. This 
is an interesting algebra book, worth using as a text. SG 


Algebra, P. Geometry of Coxeter Groups. Howard Hiller. Research Notes in Math., No. 54. #£Pitman 
Pub, 1982, 213 pp, $19.95 (P). [ISBN: 0-273-08517-4] These notes cover the geometry of flag mani- 
folds from the algebraic and combinatorial point of view. The principal topics are the invariant 
theory of Coxeter groups and the combinatorics of Bruhat order, with the former constituting the 
main theme in studying a variety of topics that bring together geometry, group theory and combina~- 
torics. SS 


Algebra, T(18), S, P. Theory of Group Representations. M.A. Naimark, A.I. Stern. Trans: Elizabeth 
& Edwin Hewitt. Grund. der math. Wissenschaften, B. 246. Springer-Verlag, 1982, ix + 568 pp, $59. 
[ISBN: 0-387~90602-9] The first two’ chapters, with ample examples and exercises, are devoted to the 
algebraic aspects of the theory. Later chapters are concerned with topological groups, Lie groups 
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and Lie algebras, and their representations. Throughout, the theory is limited to finite- 
dimensional representations. LCL 


Algebra, P. The Cohomology of Chevalley Groups of Exceptional Lie Type. Samuel N. Kleinerman. 
Memoirs No. 268. AMS, 1982, viii + 82 pp, $5 (P). 


Algebra, P. Finite Groups III. B. Huppert, N. Blackburn. Grund. der math. Wissenschaften, B. 243. 
Springer-Verlag, 1982, ix + 454 pp, $59. [ISBN: 0-387-10633-2] The last of a three-volume series. 
Contains three long chapters on local finite group theory, Zassenhaus groups, and multiply transi- 
tive permutation groups. SG 


Algebra, P. C*~bundles and Compact Transformation Groups. Bruce D. Evans. Memoirs No. 269. AMS, 
1982, vii + 63 pp, $4 (P). [ISBN: 0-8218-2269-1] Let G be a compact Lie group which acts smoothly 
on a C*® manifold X. The author develops a geometric theory for the structure of the associated C*- 
algebra, C*(G,X). LCL 


Calculus, T(15-16: 1), S*, L. A Brief on Tensor Analysis. James G. Simmonds. Undergrad. Texts in 
Math. Springer-Verlag, 1982, xi + 92 pp, $18.80. [ISBN: 0-387-90639-8] From two-dimensional vectors 
to Christoffel symbols, covariant derivatives and the divergence theorem--all in under 100 pages! A 
spare, informal text, with numerous exercises. Excellent supplement to advanced mathematics and 
physics courses; also suitable as a seminar text. LAS 


Calculus, T(13). Calculus with Applications. Daniel I. Auvil. Addison-Wesley, 1982, xii + 558 pp, 
$23.95. [ISBN: 0-201-10063-0] Should be called "Calculus for the Social Sciences" or "Calculus 
Without Trigonometry" (except for the last chapter). One of the less superficial books of this 
genre. AWR 


Calculus, S(14-15), L. Mathematics 4 Checkbook. J.0. Bird, A.J.C. May. Butterworths, 1981, viii + 
216 pp, $22.95; $9.95 (P). [ISBN: 0-408-00660-9; 0-408-00612-9] A collection of 300 worked problems 
and 500 further problems with answers, covering review of hyperbolic functions, calculus’ through 
elementary differential equations, Fourier series, Laplace transform, and statistics (correlation 
and regression). LCL 


Real Analysis, S(18), P. Hardy Spaces on Homogeneous Groups. G.B. Folland, E.M. Stein. Math. 
Notes, V. 28. Princeton U Pr, 1982, xii + 284 pp, $13 (P). The classical Hardy spaces, conceived in 
the context of complex analysis, developed as important tools of Fourier analysis. Modern HP theor 

has moved further in the direction of real analysis, via harmonic Hardy spaces, the duality of H 

and BMO, and certain decomposition theorems for HP functions. This monograph develops real variable 
HP theory on homogeneous groups. PZ 


Real Analysis, T(17-18: 1, 2), S, P. The Theory of Generalised Functions. D.S. Jones. Cambridge U 
Pr, 1982, xiii + 539 pp, $59.50. [ISBN: 0-521-23723-8] A thorough, readable introduction with exer- 
cises and worked examples to the Temple-Lighthill theory of distributions and their applications. 
"Generalized functions" are continuous operators of the Schwartz class; "weak functions" are distri- 
butions. Starts from advanced calculus--necessary convergence and Lebesgue integration theory is 
developed, though quickly, in the text. One-variable theory precedes that in R"®, PZ 


Real Analysis, T*(15-16: 1, 2), S*, L. Introductory Problem Courses in Analysis and Topology. 
Edwin E. Moise. Universitext. Springer-Verlag, 1982, 94 pp, $12 (P). [ISBN: 0-387-90701-7] 
Short, self-contained, yellow-brick road, to "baby" real-variable theory and elementary set- 
theoretic topology. Each chapter contains relevant definitions and theorems which the student is 
asked to prove. In the problem sets, the student is asked to resolve the status of the "proposi- 
tions" which follow--providing proofs or counterexamples. This research-like approach should prove 
to be very attractive, enjoyable, and rewarding to everyone concerned. LCL 


Complex Analysis, S(17-18), P. Factorization Theory of Meromorphic Functions and Related Topics. 
Ed: Chung-Chun Yang. Lect. Notes in Pure & Appl. Math., V. 78. Dekker, 1982, ix + 194 pp, $27.50 
(P). [ISBN: 0-8247-1834-8] Fourteen papers on questions in factorization of meromorphic functions on 
the plane. Factorization (in the sense of composition) for polynomials and entire functions is well 
understood by comparison with the meromorphic theory which, the editor asserts, is in its infancy. 
The final paper, by the editor, is a survey of progress in the area. PZ 


Complex Analysis, T(18: 1, 2), S, P. Bounded Analytic Functions. John G. Garnett. Pure & Appl. 
Math. Academic Pr, 1981, xvi + 467 pp, $59. [ISBN: 0-12-276150-2] Introduces and surveys results 
and, especially, techniques of bounded analytic functions on the unit disk. Assuming graduate com- 
plex and real analysis, the treatment is self-contained. Topics include BMO, interpolating 
sequences, and the corona construction, Much of the last five chapters is recent work, not else- 
where collected. With exercises, could be an intermediate graduate text. (Extended Review, August- 
September 1982.) PZ 


Complex Analysis, P. Moduli of Families of Curves and Quadratic Differentials. G.V. Kuz’mina. 
Proc. of Steklov Inst. of Math., No. 139. AMS, 1982, vii + 231 pp, $76 (P). [ISBN: 0-8218-3040-6] A 
thorough, self-contained exposition of the application of the method of extremal motions to 
geometric problems in function theory. Studies questions about minimal capacity and maximal diame- 
ter. LAS 
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Differential Equations, T(17-18: 1), S, P. Pseudo-Differential Operators. Hitoshi Kumano-go. MIT 
Pr, 1981, xviii + 455 pp, $60. [ISBN: 0-262-11080-6] Because of a wide range of applications, 
pseudo-differential operators have been shown to be a fruitful object of study. The approach is 
based on elementary calculus and Fourier transforms with the hope (probably vain) of becoming acces- 
sible to undergraduates. Three brisk chapters of theory lead to six chapters on significant appli- 
cations. Formal in style, no exercises. Appendices, bibliographies, index. JS 


Differential Equations, P. Evolution Equations and Their Applications. Ed: F. Kappel, W. Schap- 
pacher. Research Notes in Math., No. 68. Pitman Pub, 1982, xviii + 313 pp, $23.95 (P). LISBN: 0- 
273-08567-0] Proceedings of the conference on differential equations and applications at the 
Volksbildungsheim Schlosz Retzhof, June 2-6, 1981. Topics covered include recent developments in 
the theory of semigroups and their applications to partial differential equations, integro- 
differential equations and delay equations. Applications to epidemics, population dynamics and con- 
trol theory. Twenty papers in all. JK 


Differential Equations, T(16-17: 1), P*. Hyperbolic Boundary Value Problems. Reiko Sakamoto. 
Transl: Katsumi Miyahara. Cambridge U Pr, 1982, ix + 210 pp, $34.50. [ISBN: 0-521-23568-5] On the 
existence and uniqueness of solutions of hyperbolic initial value problems in partial differential 
equations of higher orders. Prerequisites include some knowledge of complex variables and Lebesgue 
integration. Otherwise, self-contained. Carefully written and translated with several helpful 
translator footnotes. JK 


Differential Equations, T(17), S, P. Solution of Differential Equations by Means of One-Parameter 
Groups. J.M. Hill. Research Notes in Math., No. 63. Pitman Pub, 1982, 161 pp, $18.95 (P). [ISBN: 
0-85896-893-2] Advanced teaching material. Intended for applied mathematicians and engineers con- 
cerned with obtaining solutions of differential equations. Some new material on group approach to 
linear equations. Numerous examples, exercises and problems. JK 


Differential Equations, P. Applications of Centre Manifold Theory. Jack Carr. Appl. Math. Sci., 
No. 35. Springer-Verlag, 1981, 142 pp, $14 (P). [ISBN: 0-387-90577-4] A centre manifold is a type 
of invariant manifold for a system of differential equations. Applications to bifurcation theory. 
JG 


Differential Equations, P. Homology and Dynamical Systems. John M. Franks. CBMS Reg. Conf. Ser. in 
Math., No. 49. AMS, 1982, viii + 120 pp, $14 (P). [ISBN: 0-8218-1700-0] Notes of a 1980 conference. 
The basic theme is the connection between the dynamical configuration and the homological structure 
of a system. SG 


Differential Equations, T(17-18), P, L. Partial Differential Equations, Fourth Edition. Fritz John. 
Appl. Math. Sci., V. 1. Springer-Verlag, 1982, x + 249 pp, $18. [ISBN: 0-387-90609-6] A few new 
proofs and examples; more problems, and corrections from previous editions. (Second Edition, TR, 
August-September 1975; Third Edition, TR, April 1979.) LCL 


Differential Equations, P. A Method of Generalized Characteristics. Marc A. Berger, Alan D. Sloan. 
Memoirs No. 266. AMS, 1982, v + 37 pp, $4 (P). The classical and Brownian methods of characteris-— 
tics are generalized to analyze evolution equations of arbitrary order. The method produces expli- 
cit solutions involving ordinary integration or recursion; the formal rules of manipulation are 


accessible to anyone with an advanced calculus background. LCL 


Differential Equations, P. Singular Systems of Differential Equations II. S.L. Campbell. Research 
Notes in Math., No. 61. Pitman Pub, 1982, 234 pp, $19.95 (P). [ISBN: 0-273-08516-6] A continuation 
of the first volume (see TR, February 1982). In this book, more emphasis is placed on analytic and 
numerical procedures. LCL 


Differential Equations, T(13-14), S. Modelling with Differential Equations. D.N. Burghes, M.S. Bor- 
rie. Math. & its Appl. Halsted Pr, 1981, 172 pp, $24.95 (P). [ISBN: 0-470-27101-9] A series of 
elementary (and generally well-known) applications of elementary differential equations: carbon dat- 
ing, inhibited growth, art forgeries, pollution, oscillations, planetary motion, chemical kinetics, 
competing species, arms races, epidemics, and much more. Both the mathematics and the models are 
presented in simplified form, suitable for beginners. (TR, Hardcover, March 1982.) LAS 


Differential Equations, P. Lecture Notes in Mathematics-942: Theory and Applications of Singular 
Perturbations. Ed: .W. Eckhaus, E.M. de Jager. Springer-Verlag, 1982, v + 363 pp, $18.50 (P). 
[ISBN: 0-387-11584-6] Proceedings of an August 1981 Oberwolfach conference. IAS 


Numerical Analysis, L. Methods of Numerical Mathematics, Second Edition. G.I. Marchuk. Trans: 
Arthur A. Brown. Appl. of Math., No. 2. Springer-Verlag, 1982, xiii + 510 pp, $69. [ISBN: 0-387- 
90614-2] This work is primarily a survey of modern numerical methods having widespread application. 
It is a significant revision of the First Edition (TR, January 1977), and incorporates new material 
throughout. AO 


Numerical Analysis, P. Boundary Element Methods. Ed: C.A. Brebbia. Springer-Verlag, 1981, xxiv + 
622 pp, $59. [ISBN: 0-387-10816-5] Proceedings of an international seminar held at Irvine, Califor- 
nia during July 1981. LAS 
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Numerical Analysis, 8(17-18), P. Lecture Notes in Mathematics-916: Zweidimensionale, interpo- 
lierende LIg-Splines und ihre Anwendungen. Karl-Ulrich Grusa. Springer-Verlag, 1982, vii + 238 pp, 
$11.10 (P). [ISBN: 0-387-11213-8] An account, for specialists, of the theory and some applications 
of two-dimensional Lg-Splines. JD-B 


Numerical Analysis, 8(17-18), P. Numerical Methods of Approximation Theory, V. 6. Ed: L. Collatz, 


G. Meinardus, H. Werner. ISNM 59. Birkhauser Boston, 1982, 265 pp, $29.95. [ISBN: 3-7643-1304-8] 
Papers presented at an Oberwolfach workshop, January 1981. JD-B 


Numerical Analysis, $(17-18), P. Numerical Treatment of Free Boundary Value Problems. Ed: J. 
Albrecht, L. Collatz, K.-H. Hoffmann. ISNM 58. Birkhauser Boston, 1982, viii + 349 pp, $35. [ISBN: 
3-7643-1277-7] Papers presented at an Oberwolfach workshop in November 1980. JD-B 


Numerical Analysis, P. Proceedings of the 1982 Army Numerical Analysis and Computers Conference. US 
Army Research Office (P.O. Box 12211, Research Triangle Park, NC), 1982, xiv + 606 pp, (P). Papers 
from a conference at Vicksburg, Mississippi, February 3-4, 1982. LAS 


Functional Analysis, P. Lecture Notes.in Mathematics-923: Functional Analysis in Markov Processes. 


Ed: M. Fukushima. Springer-Verlag, 1982, 307 pp, $18 (P). [ISBN: 0-387-11484-X] This volume con- 
tains 15 articles based on talks given during the summer of 1981 in Katata and Kyoto, Japan. AO 


Functional Analysis, P. LP boundedness of Fourier Integral Operators. R. Michael Beals. Memoirs 
No. 264. AMS, 1982, viii + 57 pp, $4 (P). Results concerning the LP boundedness of a class of 
Fourier integral operators of interest in the theory of hyperbolic partial differential equations. 
AO 


Functional Analysis, P. Factorization of Matrix Functions and Singular Integral Operators. K. Clan- 
cey, I. Gohberg. Operator Theory, V. 3. Birkhauser Boston, 1981, x + 234 pp, $17.55. [ISBN: 3- 
7643-1297-1] Selection of material emphasizes connections between the factorization of non-singular 
matrix functions along closed contours and the solution of systems of singular integral equations. 
Generalized factorization and connections with the theory of singular integral operators are also 


discussed. AWR 


Functional Analysis, 8(18), P. Riesz and Fredholm Theory in Banach Algebras. B.A. Barnes, et al. 
Res. Notes in Math., V. 67. Pitman Pub, 1982, 123 pp, $16.95 (P). [ISBN: 0-273-08563-8] Extends 
the theory of Riesz and Fredholm linear operators on Banach spaces to the setting of Banach and C* 
algebras. Apart from an initial chapter on basic results in Banach spaces, most proofs are given in 
full. Many results are new. For the specialist. PZ 


Functional Analysis, T*(16-18: 1, 2), S*, P. Theorems and Problems in Functional Amalysis. A.A. 
Kirillov, A.D. Gvishiani. Trans: Harold H. McFaden. Problem Books in Math. Springer-Verlag, 1982, 
ix + 347 pp, $38. [ISBN: 0-387-90638-X] The theory covered (in only 135 pages) is about that of a 
typical one-year graduate functional analysis course with measure theory. Over 800 conveniently 
organized problems follow, ranging from entirely straightforward to very difficult. 23 “essential” 
exercises are identified in each subsection. Part III contains hints for every exercise. Inviting 
and invaluable graduate text or companion text. PZ 


Functional Analysis, S(18), P. Lecture Notes in Mathematics-934: Quadrature Domains. Makoto Sakai. 
Springer-Verlag, 1982, 133 pp, $8.80 (P). [ISBN: 0-387~11562-5] Discussion of existence and unique- 
ness for positive measures on complex quadrature domains for various (analytic, harmonic, subhar- 
monic) classes of functions. Applications to Dirichlet integrals, Gaussian curvature, Helle-Shaw 
flows. Bibliography, index. JS 


Functional Analysis, T(17-18: 2), 8, P. Functional Analysis, Second Edition. L.V. Kantorovich, G.P. 
Akilov. Transl: Howard L. Silcock. Pergamon Pr, 1982, xiv + 589 pp, $25 (P); $100. [ISBN: 0-08- 
026486-7; 0-08-023036-9] Second edition of the 1959 original. An encyclopedic treatment of the 
basics ,of functional analysis, and, notably, certain applications to real analysis~-integral opera- 
tors, functional equations, the fixed-point principle, etc. Level appropriate to a graduate text, 
though no exercises. Certainly a valuable reference work. PZ 


Functional Analysis, T*(16-17: 1, 2), S*, L. Linear Qperator Theory in Engineering and Science. 
Arch W. Naylor, George R. Sell. Appl. Math. Sci., V. 40. Springer-Verlag, 1982, xv + 624 pp, $28. 
[ISBN: 0-387-90748-3] Functional analysis in form suitable for engineers, scientists and applied 
mathematicians. In definition-theorem-proof, format, but very readable with much attention to 
motivation. Numerous illustrative examples. Bountiful supply of problems which extend the text. 
Appendices on integration and measure theory and on probability spaces and stochastic processes. 
Flexible to accommmodate different audiences. Originally published in 1971; a welcome new edition. 
JK 


Analysis, T(16-17), S, P, L. Calculus of Variations and Partial Differential Equations of the First 


Order, Second Edition. C. Carathéodory. Transl: Robert B. Dean. Chelsea Pub, 1982, xix + 402 pp, 
$25. A new edition of a great classic, now combined into one volume with known misprints corrected. 
(First Edition, Volume I, TR, April 1967; Volume II, TR, April 1968.) AWR 


Analysis, T(18: 1), S, P. Hankel Operators on Hilbert Space. S.C. Power. Research Notes in Math., 
No. 64. Pitman Pub, 1982, 87 pp, $13.95 (P). [ISBN: 0-273-08518-2] An account of modern aspects of 
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the operator and spectral theory of Hankel operators, with particular attention to those arising 
from recent progress in function theory on the unit circle. Includes detailed proofs of earlier 
results and recent research. Problems and an extensive bibliography. CEC 


Analysis, P. Ergodic Theory and Dynamical Systems II: Proceedings Special Year, Maryland 1979-80. 
Ed: A. Katok. Progress in Math., V. 21. Birkhauser Boston, 1982, xi + 210 pp, $15. [ISBN: 3~-7643~- 


3096-1] Seven survey lectures, complementing Part I (Progress in Math., V. 10; TR, May 1982), con- 
cluding the proceedings of the 1979-80 special year at Maryland. LAS 


Algebraic Geometry, P. Automorphic Forms, Representation Theory and Arithmetic. S. Gelbart, et al. 
Tata Inst. of Fund. Res. Stud. in Math., No. 10. Springer-Verlag, 1981, 355 pp, $17.40 (P). [ISBN: 
0-387-10697-9] Proceedings of the international colloquium held at the Tata Institute of Fundamental 
Research, Bombay, India, January 8-15, 1979. JAS 


Algebraic Geometry, T(16-17: 2), S. Ebene Algebraische Kurven. E. Brieskorn, H. KnUrrer. Bir- 
khauser Boston, 1981, xi + 964 pp, $24.95. [ISBN: 3~-7643-3030-9] A rich and leisurely introduction 
to the theory of algebraic curves in the complex projective plane. Many sketches, historical refer- 
ences, specific examples. Bibliography, but no exercises. JD-B 


Algebraic Geometry, P. Lecture Notes in Mathematics-930: Théorie de Dieudonné, Cristalline I1. P. 
Berthelot, L. Breen, W. Messing. Springer-Verlag, 1982, xi + 261 pp, $14.90 (P). [ISBN: 0-387- 
11556-0] Generalizes "Dieudonné theory" to the case of p-divisible groups and arbitrary schemes of 


positive characteristic. SG 


Algebraic Geometry, T(18), P, Lecture Notes in Mathematics-935: Simple Morphisms in Algebraic 
Geometry. Richard Sot. Springer~Verlag, 1982, iv + 146 pp, $8.50 (P). [ISBN: 0-387-11564-1] An 
introductory text which assumes only basic topology, algebra, and commutative algebra. Describes 
various criteria for an algebra to be simple over its base field. Concludes with a discussion of 
simple morphisms of preschemes. A clearly written text. SG 


Algebraic Geometry, P. Lecture Notes in Mathematics-917: Brauer Groups in Ring Theory and Algebraic 
Geometry. Ed: F. van Oystaeyen, A. Verschoren. Springer-Verlag, 1982, viii + 300 pp, $18 (P). 
[ISBN: 0-387-11216-2] The proceedings of a conference held at the University of Antwerp in August 


1981. LCL 


Algebraic Geometry, P. Lecture Notes in Mathematics-946: Classes Unipotentes et Sous-groupes de 
Borel. Nicolas Spaltenstein. Springer-Verlag, 1982, ix + 259 pp, $14 (P). [ISBN: 0-387-11585-4] A 
study of the fixed points of the action of an affine algebraic group on the variety of its Borel 
subgroups. SG 

Differential Geometry, P. Global Differential Geometry of Surfaces. A. Svec. Kluwer Boston, 1981, 
153 pp, $28.50. [ISBN: 90-277-1295-6] An exposition of the author’s research on global properties of 
surfaces; the principal results describe sufficient conditions for a surface to be a sphere or a 


plane. SG 


Differential Geometry, T(18), S, P. Cusps of Gauss Mappings. Thomas Banchoff, Terence Gaffney, 
Clint McCrory. Pitman Pub, 1982, 88 pp, $12.95 (P). [ISBN: 0-273-08536-0] A self-contained, read- 
able presentation of the authors” research on geometric characterizations of cusps of Gauss mappings 
of surfaces in RY. Contains many pictures, examples, and references. SG 


A A MAT TEI SCRA rEpRSITRET 


Sherk. Springer-Verlag, 1981, viii + 598 pp, $48. [ISBN: 0-387-90587-1] A listing of Coxeter’s pub- 
lications followed by a collection of papers from the May 1979 Coxeter symposium (held at the 
University of Toronto). JNC 


Geometry, T(16-17: 1), S, P. Symmetrien yon Ornamenten und Kristallen. Michael Klemm. Springer- 
Verlag, 1982, vii + 214 pp, $16 (P). [ISBN: 0-387-11644-3] A thorough introduction to mathematical 
crystallography via the summetry groups of ornaments and crystals. LAS 


Algebraic Topology, T(17-18: 1, 2). Differential Forms in Algebraic Topology. Raoul Bott, Loring W. 
Tu. Grad. Texts in Math., No. 82. Springer-Verlag, 1982, xiv + 331 pp, $29.80. [ISBN: 0-387-90613- 
4] A serious introduction to algebraic topology using differential forms and the De Rham theory as 
the principal tool. The implied limitation to spaces which are real manifolds does not appear to 
limit seriously the breadth of material covered. Good index and exercises. JAS 


Algebraic Topology, P. Indu¢tion Theorems for Groups of Homotopy Manifold Structures. Andrew J. 
Nicas. Memoirs No. 267. AMS, 1982, vi + 108 pp, $6 (P). A study of group structure in and exten- 
sion of the surgery exact sequence of compact oriented manifolds with boundary. JG 


Algebraic Topology, T(17-18). Eléments D’Analyse, Tome IX. J. Dieudonné. Gauthier-Villars, 1982, 
xviii + 380 pp, 320 FF (P). [ISBN: 2-04-011499-8] An introduction to algebraic and differential 
topology. Special topics include Chern and Stiefel-Whitney classes, Hodge theory, the Atiyah-Bott- 
Lefschetz formula, cohomology of Lie groups and Grassmannians. Many exercises. SG 


Topology, P. Chain Conditions in Topology. W.W. Comfort, S. Negrepontis. Tracts in Math., No. 79. 
Cambridge U Pr, 1982, xiii + 300 pp, $39.50. [ISBN: 0-521-23487-5] A collection of new results by 
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the authors concerning various considerations related to the cardinality of the open sets of a topo- 
logical space. The work involves recent modifications of combinatorial tools developed in the 
1960°s by Erd¥s and Rado. JAS 


Topology, P. Embedding Coverings Into Bundles with Applications. P.F. Duvall, L.S. Husch. Memoirs 
No. 263. AMS, 1982, iv + 53 pp, $4 (P). A solution to the problem of when a finite regular cover- 
ing of a closed n-manifold is homotopic to an embedding in an n-plane bundle over the manifold. 
This solution is then applied to a study of embedding n-manifold-like continua (up to shape) into 
2n-space. JAS 


Topology, S(17), P, L. Geometric Theory of Dynamical Systems, An Introduction. Jacob Palis, Jr., 
Welington de Melo. Transl: A.K. Manning. Springer-Verlag, 1982, xii + 198 pp, $28. [ISBN: 0-387- 
90668-1] An introduction to the struétural stability and genericity of vector fields. Assumes back- 
ground in differential equations, differentiable manifolds, linear algebra, and Banach spaces. JG 


Topology, P, L. Quasi-Uniform Spaces. Peter Fletcher, William F. Lindgren. Pure & Appl. Math., V. 
77. Dekker, 1982, viii + 216 pp, $29.75 (P). [ISBN: 0-8247-1839-9] Compilation and exposition of 
the widely scattered literature on quasi-uniformities and quasi-proximities, intended to encourage 
the use of those structures in general topology. Includes historical notes, lengthy bibliography, 
index, and 22 unsolved problems. In short, a valuable resource. GHM 


Operations Research, T(16-18: 1), L. Multiobjective Decision Analysis with Engineering and Business 
Applications. Ambrose Goicoechea, Don R. Hansen, Lucien Duckstein. Wiley, 1982, xvii + 519 pp, 
$34.95. [ISBN: 0-471-06401-7] An introduction to multi-objective optimization problems. Considers 
both continuous and discrete problems and presents a number of algorithms for each. Provides For- 
tran computer programs for two of the methods in an appendix. AO 


Operations Research, $(15-16), L. Network Optimisation Practice: A Computational Guide. David K. 
Smith. Halsted Pr, 1982, 237 pp, $59.95. [ISBN: 0-470-27347-X] An elementary introduction to 
several standard network optimization problems and algorithms for their solution. Listings of Basic 
and Pascal implementations of many of these algorithms are included. AO 


Optimization, P. Modern Applied Mathematics: Optimization and Operations Research. Ed: Bernhard 
Korte. North-Holland, 1982, ix + 693 pp, $130.25. [ISBN: 0-444~86134~3] A collection of 17 arti- 
cles based on talks presented during a 1979 summmer school organized by the University of Bonn. The 
articles provide state-of-the-art surveys of a number of areas of optimization theory and operations 
research. AO 


Optimization, T*(16~-17: 1, 2), L*. Optimization Techniques: An Introduction. L.R. Foulds. Under- 
grad. Texts in Math. Springer-Verlag, 1981, xi + 502 pp, $36. [ISBN: 0-387-90586-3] This textbook 
provides an introduction to some of the most important optimization techniques at an undergraduate 
level. The topics covered include linear programmming, integer programmming, network analysis, 
dynamic programming, classical optimization, and nonlinear programming. AO 


Optimization, P. Mathematical Techniques of Optimization, Control and Decision. Ed: J.P. Aubin, A. 
Bensoussan, I. Ekeland. Birkhauser Boston, 1981, viii + 212 pp, $24.95. [ISBN: 3-7643-3032-5] 
Report from the Research Center in the Mathematics of Decision Making. Collection of essays survey- 
ing methods and applications in systems theory and decision making. JRG 


Optimization, T(16-17: 2), P. Control, Identification, and Input Optimization. Robert Kalaba, Karl 
Spingarn. Math. Concepts & Methods in Sci. & Eng., V. 25. Plenum Pr, 1982, xi + 431 pp, $39.50. 
[ISBN: 0-306-40847-3] Major divisions of the book are: I. Optimal control and methods for numerical 
solutions; I1. System identification; III. Optimal inputs for system identification. This attrac~ 
tive book is written to be accessible to students with no previous knowledge of optimal control 
theory or numerical methods, but it will be a useful reference to those in control engineering. AWR 


Optimization, T(17: 2), P. Optimization Over Time: Dynamic Programming and Stochastic Control, V. 
1. Peter Whittle. Wiley, 1982, xi + 317 pp, $46.95. [ISBN: 0-471-10120-6] The goal is to optimize 
decisions to be made in running a dynamic system, a subject variously called dynamic programming or 
stochastic control. Of interest to people in operations research and control engineering. AWR 


Probability, P. Lecture Notes in Mathematics-920: Séminaire de Probabilités XVI, 1980/81. Ed: J. 
Azéma, M. Yor. Springer-Verlag, 1982, v + 622 pp, $31.10 (P). [ISBN: 0-387-11485~-8] This volume, 
together with the supplement (Lecture Notes 921) on stochastic differential geometry, form the 


proceedings of the 1980-1981 seminar at the Université de Strasbourg. JAS 


Probability, S(15-17), L. The Correspondence Between A.A. Markov and A.A. Chuprov on the Theory of 
Probability and Mathematical Statistics. Ed: Kh. 0. Ondar. Transl: Charles and Margaret Stein. 
Springer-Verlag, 1981, xvii + 181 pp, $32. [ISBN: 0-387~90585-5] Translation of a 1977 Russian book 
containing much of the correspondence between these two from 1910-1917, together with editorial com- 
ments. Provides an interesting insight into Markov’s personality, as well as valuable historical 
material. RSK 


Probability, P. Lecture Notes in Mathematics-929: Ecole d”°Eté de Probabilités de Saint-Flour X- 
1980. J.M. Bismut, L. Gross, K. Krickeberg. Springer-Verlag, 1982, x + 313 pp, $17.40 (P). [ISBN: 
0-387-11547-1] Three courses from the 1980 Saint-Flour summer school in probability theory: 
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mechanics and stochastic optimization, equilibrium thermodynamics, and punctual processes. LAS 


Statistics, $(16-17), P. Computing in Statistical Science through APL. Francis John Anscombe. Ser. 
in Stat. Springer-Verlag, 1981, xv + 426 pp, $24.80. [ISBN: 0-387-90549-9] The stated purpose of 
this book is to interest statisticians in the use of the programming language APL. The first third 
of the book presents a description of APL while the remainder presents several examples of statisti- 
cal analysis using APL. AO 


Statistics, T(16-18: 1, 2), S, P, L. The Sequential Statistical Analysis of Hypothesis Testing, 
Point and Interval Estimation, and Decision Theory. Z. Govindarajulu. Math. & Management Sci. Ser., 
V. 5. American Sci Pr, 1981, ix + 680 pp, $48.50 (P). [ISBN: 6-935950-02-8] A comprehensive treat- 
ment of sequential testing and estimation. Presupposes courses in probability, statistics, and 
advanced calculus. FLW 


Statistics, T(16-18: 1, 2), S, P, L. Concepts of Nonparametric Theory. John W. Pratt, Jean D. Gib- 
bons. Ser. in Stat. Springer-Verlag, 1981, xvi + 462 pp, $38. [ISBN: 0-387-90582-0] Could serve as 
a text for a first or second course on nonparametric tests. Most of the material presupposes only a 
calculus-based probability and statistics course. Includes "considerable philosophical and methodo- 
logical discussion." FLW 


Statistics, $(15-17), P*, L. Circular Statistics in Biology. Edward Batschelet. Math. in Biology. 
Academic Pr, 1981, xvi + 371 pp, $69.50. [ISBN: 0-12-081050-6] Deals with analysis of data points 
distributed on a circle (e.g., directions in a plane, or in some cases other cyclical phenomena). 
Part I provides a thorough coverage of the descriptive and inferential tools of this often neglected 
area of statistics, while Part II contains the mathematics required. Good set of references. RSK 


Statistics, S*(15-17), L. The Making of Statisticians. Ed: J. Gani. Springer-Verlag, 1982, viii + 
263 pp, $19.80. [ISBN: 0-387-90684-3] Interesting collection of 16 autobiographical essays by 
pioneers (born 1897-1917) in the area of modern’ statistics. Includes sketches of their lives, 
explanations of how they became interested in probability and statistics, accounts of their major 
contributions, and some predictions about the future. RSK 

Statistics, P. Lecture Notes in Statistics-14: GLIM 82: Proceedings of the International Conference 
on Generalised Linear Models. Ed: Robert Gilchrist. Springer-Verlag, 1982, 188 pp, $12.50 (P). 
[ISBN: 0-387-90777-7] 


Statistics, T(15-16: 1, 2), S. Introduction to Probability Theory and Statistical Inference, Third 
Edition. Harold J. Larson. Wiley, 1982, xi + 637 pp, $27.95. [ISBN: 0-471-05909-9] Presupposes one 
year of calculus. This edition has new material on descriptive statistics, the bivariate normal 
distribution, the F-distribution, and nonparametric procedures. (TR, First Edition, October 1969; 
Extended Review, November 1972; TR, Second Edition, December 1974.) FLW 


Inverse Gaussian Distribution. Bent Jgrgensen. Springer-Verlag, 1982, 188 pp, $12 (P). [ISBN: 0- 
387-90665-7 ] 


Statistics, $(16-18), P. Lecture Notes in Statistics-9: Statistical Properties of the Generalized 
2 


Statistics, T(13: 1), S$, P, L. The Randomized Clinical Trial and Therapeutic Decisions. Ed: Niels 
Tygstrup, John M. Lachin, Erik Juhl. Statistics, V. 43. Dekker, 1982, xviii + 296 pp, $39.75. 
[ISBN: 0-8247-1856-9] Could serve as a text for doctors or potential doctors. Contributions by 
several M.D.°s and biostatisticians to a general treatment of randomized clinical trials. No exer- 


cise sets. FLW 


Statistics, $(16-18), P. Connectivity in Multi-factor Designs: A Combinatorial Approach. Lothar 
Butz. Res. & Educ. in Math., V. 3. Heldermann Verlag, 1982, vii + 190 pp, 32 DM (P). [ISBN: 3- 
88538-203-2] Uses graph theoretic techniques to deal with experimental design problems. FLW 


Statistics, T(413: 1), S, L. Statistics Without Tears: A Primer for Non-mathematicians. Derek Rown- 
tree. Charles Scribner’s, 1981, 199 pp, $6.95 (P). [ISBN: 0-684-17502~9] Introduces basic statisti- 
cal concepts while expecting very little in the way of calculations to be done by the reader. No 


exercise sets, but questions for the reader occur in the body of the text. FLW 


Statistics, 8(16-18), P. Lecture Notes in Statistics-l]: Random Coefficient Autoregressive Models: 
An Introduction. Des F. Nicholls, Barry G. Quinn. Springer-Verlag, 1982, v + 154 pp, $12 (P). 
[ISBN: 0-387-90766-1] Autoregressive linear models are those whose coefficients vary with time, 
adjusting automatically to changing conditions. This monograph develops the "quite tractable" 


theory of autoregressive models with random coefficients. LAS 


Statistics, T(16-18: 1, 2), S, P. Estimation of Dependences Based on Empirical Data. Vladimir Vap- 
nik. Transl: Samuel Kotz. Ser. in Stat. Springer-Verlag, 1982, xvi + 399 pp, $56. [ISBN: 0-387- 
90733-5] A study of the results of adopting the principle: "select, from an admissible set of func- 
tions, a function which fulfills a definite relationship between a quantity characterizing the qual- 
ity of the approximation and a quantity characterizing the “complexity” of the approximating func- 
tion." FLW 


Statistics, 8*(14-18), P, L*. Encyclopedia of Statistical Sciences, Volume 2: Classification to Eye 
Estimate. Ed: Samuel Kotz. Wiley, 1982, viii + 613 pp, $75. [ISBN: 0-471-05547-6] Second volume of 
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a planned eight-volume set. (See TR, October 1982, of Series and Volume 1.) RSK 


Computer Programming, T(13: 1). Problem Solving and Structured Programming in WATFIV. Frank L. 
Friedman, Elliot B. Koffman. Addison-Wesley, 1982, xvi + 539 pp, 16.95 (P). [1ISBN: 0-201-10482-2] 
WATFIV is a dialect of Fortran. This textbook is an introduction to programming using the WATFIV 
language. It is based on the second edition of Problem Solving and Structured Programming in For- 
tran by the same authors (TR, December 1981). AO 


Computer Programming, S(15-17), P, L. ADA, An Introduction: Ada Reference Manual (July 1980). Henry 
Ledgard. Springer-Verlag, 1981, vii + 373 pp, $16.80 (P). [ISBN: 0-387-90568-5] This volume con- 
sists of two parts. The first part is an introduction to the programming language Ada written for 
readers with experience in high-level language programming. Because there are no exercises, it is 
not suited for use as a textbook. The second part is a reproduction of the November 1980 printing 


of the Ada Reference Manual which defines the language. AO 


Computer Programming, $(16-17), P. Programming in Ada. J.G.P. Barnes. Addison-Wesley, 1982, x + 
340 pp, $17.95 (P). [ISBN: 0-201-13792-5] This textbook on the programming language Ada was written 
by one of the members of the Ada Design Team. It provides an introduction to nearly all aspects of 
the language and even incorporates some of the very recent changes to the language definition. AO 


Computer Programming, T(13: 1). BASIC for Students: With Applications. Michael Trombetta. 
Addison-Wesley, 1981, xi + 291 pp, $9.95 (P). [ISBN: 0-201-07611-X] Suitable for a semester course. 
Clearly stated objectives. Uses flow charts. Subroutines under~emphasized. RBK 


Computer Programming, T(13), S. Some Common Pascal Programs. Ed: Lon Poole. Osborne/McGraw-Hill, 
1982, ix + 235 pp, $14.99 (P). [ISBN: 0-931988-73-X] Lists 76 complete Pascal programs. For each 
one the book provides a short introduction, the program listing and sample output. Programs are of 
a difficulty appropriate to a first course in programming. MS 


Computer Programming, 8(16), P. Synthetische Programmierung auf dem HP-41C/CV. W.C. Wickes. Hel- 
dermann Verlag, 1982, x + 137 pp, (P). [ISBN: 3-88538-800-6] German translation of the fifth edition 
of Synthetic Programming on the HP-41C CV. JD-B 


Computer Programming, $(13), P. Problems for Computer Solution, Student Edition. Stephen J. 
Rogowski. Creative Computing Pr, 1979, v + 104 pp, $4.95 (P). [ISBN: 0-916688-13-5]; Problems for 
Computer Solution, Teacher Edition. 1980, iv + 181 pp, $9.95 (P). [ISBN: 0-916688-14~3] Ninety-plus 
problems arranged by mathematical topic, from arithmetic through calculus. Teacher’s edition repro- 
duces student page and includes sample printout, Basic listing and analysis. Good source although a 
few problems are more easily done without computer; Basic examples do not encourage structured 
approach. MW 


Data Structures, T*(15-16: 1), L. Data Structures and Algorithms. Alfred V. Aho, John E. Hopcroft, 
Jeffrey D. Ullman. Addison-Wesley, 1983, xi + 427 pp, $26.95. [ISBN: 0-201-00023-7] This text is 
based on the material in the first six chapters of the well-known textbook The Design and Analysis 
of Computer Algorithms (TR, November 1975) by the same authors plus some additional material. It 
is designed to be used in a first course on data structures and algorithms. AO 


Software Systems, S(16-17), P, L. Engineering a Compiler: VAX-1] Code Generation and Optimization. 
Patricia Anklam, et al. Digital Equip, 1982, xv + 269 pp, $24. [ISBN: 0-932376-19-3] This work 
relates the experiences of the authors while producing a commercial PL/I compiler. It emphasizes 
the software engineering aspects of compiler construction, particularly that of the code generation 
phase. AO 


Computer Science, T(16-18: 1, 2), S, P, L. Fundamentals of Interactive Computer Graphics. James D. 
Foley, Andries van Dam. Addison-Wesley, 1982, xx + 664 pp, $34.95. [ISBN: 0-201-14468-9] A major 
four-part treatment covering both hardware and software: basics; mathematics, data structures, and 
display architecture; modern raster technology; making realistic synthetic photographs. Assumes 
only introductory material in programming, computer architecture, and data structures. JAS 


Computer Science, T?(13~14: 2, 3), P?. Mathematics for Computing. G.P. McKeown, V.J. Rayward-Smith. 
Halsted Pr, 1982, xi + 428 pp, $24.95 (P). [ISBN: 0-470-27268-6] Potpourri of topics in discrete and 
continuous mathematics (including calculus, probability and abstract algebra) with emphasis on algo- 
rithms for computer implementation. Coverage seems too succinct for textbook use, too unreliable 
for reference (e.g., "Theorem 3.2.3. A necessary and sufficient condition for convergence of an 
alternating series {aj-ajta3-...} is that {ay} is monotonically decreasing." There a "proof" is 
even sketched, though both implications are false). GHM 


Computer Science, P. Lecture Notes in Computer Science-140: Automata, Languages and Programming. 
Ed: M. Nielsen, E.M. Schmidt. Springer-Verlag, 1982, vii + 614 pp, $27.60 (P). [ISBN: 0-387-11576- 
5] The proceedings of the Ninth International Colloquium on automata, languages and programming held 
at Aarhus University, from July 12-16, 1982. CEC 


Computer Science, T(14: 1), L. Computer Architecture and Organization with Examples Using the PDP- 
ll. Theodore H. Meyer, Jr. Dilithium Pr, 1982, x + 333 pp, $16.95 (P). [ISBN: 0-918398-55-x] 
Reviews the principles of computer organization using the PDP-11l as its model. Covers information 
representation, and the major components of a computer system--memory, 1/0 processor, bus structure. 
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About one-third of the book is devoted to MACRO-1l1 assembly language programming. MS 


Computer Science, T*(17-18: 2), P. Principles of Database Systems, Second Edition. Jeffrey D. Ul11- 
man. Computer Sci Pr, 1982, vii + 484 pp, $24.95. [ISBN: 0-914894~-36-6] Covers the complete field 
of the design and analysis of data base systems. Reviews the three major models--network, hierarch- 
ical, and relational--but concentrates most heavily on the relational model. Has an extensive 
mathematical treatment of many aspects of data base design including query optimization, distributed 
data base systems and design theory. Advanced book for the serious student. (TR, First Edition, 
August-September 1980.) MS 


Control Theory, T*(17: 2), S, P*. Interconnected Dynamical Systems. Raymond A. DeCarlo, Richard 
Saeks. Electrical Engin. & Electronics, No. 10. Dekker, 1981, vi + 512 pp, $57.50. [ISBN: 0-8247- 
6639-3] Beginning graduate level textbook. Provides analytical tools for analysis and design of 
interconnected networks and systems. Discusses computational problems encountered in implement ing 
theory. Well-illustrated. Problems. No answers. Numerous references plus extensive bibliography. 
JK 


Syetems Theory, P. Lecture Notes in Control and Information Sciences-38: System Modeling and Optim- 
ization. Ed: R.F. Drenick, F. Kozin. Springer-Verlag, 1982, xi + 893 pp, $52.40 (P). [ISBN: 0-387- 
11691-5] Proceedings of the 10th IFIP (International Federation of Information Processing) Confer- 
ence, New York City, September 1981: five plenary addresses, five topical addresses, and scores of 


contributed papers on a very wide variety of modelling situations. LAS 


Applications, $(15-16), L*. The Application of Mathematics in Industry. Ed: Robert S. Anderssen, 
Frank R. de Hoog. Kluwer Boston, 1982, xiv + 202 pp, $32.50. [ISBN: 90-247-2590-9] The proceedings 
of a one-day seminar held at the Australian National University in 1980, presenting a wide variety 
of industrial applications of mathematics. AO 


Applications (Artificial Intelligencs), T*(16-17: 1), S*, P. Progress in Pattern Recognition, 
Volume l. Ed: Laveen N. Kanal, Azriel Rosenfeld. Elsevier North-Holland, 1981, vii + 391 pp, 
$53.25. [ISBN: 0-444~86325-7] A collection of 10 papers written especially for this proposed two- 
volume series in pattern recognition. The papers are survey or tutorial in nature describing recent 
progress in the area of data representation, algorithms for scene processing, and imaging software. 
For the most part the authors are all very well known people in the field and the book summarizes 
well the state of the art of pattern recognition as of 1980-81. MS 


Applications (Biology), $(16-18), P. Lecture Notes in Biomathematics-45: Competition and Coopera- 
tion in Neural Nets. Ed: S. Amari, M.A. Arbib. Springer-Verlag, 1982, xiv + 441 pp, $26 (P). 
[ISBN: 0-387-11574-9] The last decade has seen an explosion of experimental research on the brain, 
yet the theory remains highly fragmented and uneven. These expository reports constitute the 
proceedings of a major conference to catalyze better integration of theory and experiment. Partici- 
pants included brain surgeons, neurophysiologists, mathematicians, computer scientists, physicists. 
One cannot help but be impressed with the richness, the excitement, and the importance of this area 
of mathematical modeling. LCL 


Applications (Biology), P. Lecture Notes in Biomathematics-44: Recognition of Pattern and F Ed: 


Duane G. Albrecht. Springer-Verlag, 1982, 225 pp, $15 (P). [ISBN: 0-387-11206-5] Proceedings “of a 
conference held during March 1979 at the University of Texas, Austin. LAS 


Applications (Cryptography), P, L*. Secure Communications and Asymmetric Cryptosystems. Ed: Gus- 
tavus J. Simmons. AAAS Selected Symposia Ser., No. 69. Westview Pr, 1982, x + 338 pp, $30. [ISBN: 
0-86531-338-5] Papers presented at the 1980 AAAS annual meeting surveying the then-current state of 
" . if] . . _ 
public key" cryptosystems that depend on an asymmetry determined by a computationally hard (NP 

complete) problem. Includes papers on policy and applications, as well as details of various coding 
systems. LAS 


Applications (Economics), P. Dynamic Analysis of Open Economies. Masanao Aoki. Academic Pr, 1981, 
xxvi + 341 pp, $39.50. [ISBN: 0-12-058940-0] An analysis of how economies with different structures 
are affected by common disturbances (e.g., inflation, exchange rates, internal and external bal- 
ances, etc.), and how these disturbances spread throughout the world.~ The emphasis on dynamic 
analysis (employing variational and perturbation techniques) distinguishes this book from others 
with similar objectives. LCL 


Applications (Engineering), T(16: 1), S, P*. Approximate Methods in Engineering Design. T.T. Fur- 
man. Math. in Sci. & Eng., V. 155. Academic Pr, 1981, viii + 388 pp, $59. [ISBN: 0-12~269960-2] 
Techniques and methods, along with case studies from mechanical engineering, in support of the phi- 
losophy "...approximate methods of calculations with a reasonable knowledge of the degree of accu- 
racy involved can, in general, be much more valuable than time-consuming precise and rigorous 
mathematical analyses." Appropriate error and statistical analyses. JK 


Applications (Engineering), P. Dynamics of Manipulation Robots: Theory and Application. M. 
Vukobratovié, V. Potkonjak. Scientific Fund. of Robotics, No. 1. Springer-Verlag, 1982, xiii + 303 


pp, $37.50. [ISBN: 0-387-11628-1] Computer methods for forming mathematical models of the dynamics 
of robots, with applications. Requires a knowledge of linear and nonlinear systems theory and a 
strong background in mechanism dynamics. LCL 
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Applications (Engineering), P. Control of Manipulation Robots: Theory and Application. M. 
Vukobratovié, D. Stckié. Scientific Fund. of Robotics, No. 2. Springer-Verlag, 1982, xiii + 363 
pp, $45. [ISBN: 0-387-11629-X] A continuation of the first volume in this series (see preceding 
review); this volume focuses on the control of robots. A two-level, suboptimal, decentralized pro- 
cedure is described for implementing global control. Several illustrations to six-degree-of-freedom 
robots. LCL 


Applications (Linguistics), P. A Grammar of English on Mathematical Principles. Zellig Harris. 
Wiley, 1982, xvi + 429 pp, $43.50. [ISBN: 0-471-02958-0] A new theory of English grammar based on a 
partial order between operator and argument words. "The partial order relation defines a subset of 
English sentences, which are not always colloquially customary, but in which all information possi- 
ble to convey in the language can be expressed." LAS 


Applications (Physics), P. Functional Integration: Theory and Applications. Ed: Jean-Pierre 
Antoine, Enrique Tirapegui. Plenum Pr, 1980, x + 355 pp, $42.50. [ISBN: 0-306-40573-3] The proceed- 
ings of an international workshop held in Louvain-la-Neuve in 1979. The papers present attempts to 
provide a rigorous mathematical foundation for the Feynman path integral as well as a number of 
applications in physics. AO 


Applications (Physics), P. Lecture Notes in Mathematics-905: Differential Geometric Methods in 
Mathematical Physics. Ed: H.-D. Doebner, S.1. Andersson, H.R. Petry. Springer-Verlag, 1982, vi + 
309 pp, $18 (P). [ISBN: 0-387-11197-2] Proceedings of an international conference held at the Techn- 
ical University of Clausthal, West Germany, July 23-25, 1980. JAS 


Applications (Physics), P. Stochasticity and Partial Order: Doubly Stochastic Maps and Unitary Mix- 
ing. Peter M. Alberti, Armin Uhlmann. Math. & Its Appl., V. 9. D Reidel Pub, 1982, 123 pp, 
$28.50. [ISBN: 90-277-1350-2] The classical theory of majorization (an important partial order rela- 
tion on n-tuples of positive numbers with the same sum) is generalized to state spaces of W*- 
algebras, with applications to linear system theory, thermodynamics and stochastics. LCL 


Applications (Physics), P. Point Group Symmetry Applications: Methods and Tables. Philip H. 
Butler. Plenum Pr, 1981, ix + 567 pp, $55. [ISBN: 0-366-40523-7] A useful reference for graduate 
studies in molecular chemistry and quantum mechanics. Starting with the assumption that the reader 
knows little or no group theory but is familiar with elementary quantum mechanics, the author 
proceeds to show how to understand and make best use of the tables which follow (which include aJl 
3jm and 6} symbols for all point group schemes). LCL 


Applications (Physics), S(18), P. Inverse Scattering Papers: 1955-1962. Irvin Kay, Harry W. Moses. 
Lie Groups: History, Frontiers & Appl., V. XII. Math Sci Pr, 1982, 305 pp, $30. [ISBN: 0-915692- 
32-5] A collection of thirteen related papers on inverse scattering and spectral theory from “55 to 
“63. Emphasizes heuristic approach and development of formalism as a tool for constructing explicit 
potentials in eigenvalue problems. JS 


Applications (Physics), P. Les Incertitudes D“Heisenberg et L“Interprétation Probabiliste de la 
Mécanique Ondulatoire. Louis de Broglie. Gauthier-Villars, 1982, xlii + 304 pp, 220 FF. [ISBN: 2- 
04-015411-6] An account of de Broglie”s ideas on the probabilistic interpretation of wave mechanics. 
The main text, written between 1950 and 1952 but never before published, develops the generally- 
accepted theory (of Bohr, et al.). Notes written over the next 25 years show how his views shifted 
in the direction of causality and hidden variables. JD-B 


Applications (Physics), T(15~17: 2), S, L. Cosmology, Physics, and Philosophy. Benjamin Gal-Or. 
Springer-Verlag, 1981, xv + 522 pp, $28. [ISBN: 0-387-90581-2] An extraordinary book, linking 
separate parts of cosmology and physics with modern results to provide the basis for a contemporary 
version of natural philosophy. Mixes personal philosophy and religious/mystical images with tradi- 
tional science in a highly idiosyncratic manner. The author suggests this book as a basis for a 
core curriculum course, but much of it requires vector calculus for reasonable understanding. LAS 


Applications (Physics), P. Random Fields: Rigorous Results in Statistical Mechanics and Quantum 
Field Theory. Ed: J. Fritz, J.L. Lebowitz, D. Sz4sz. Elsevier North-Holland, 1981, $186 set [ISBN: 
963~8021-49-7]. Volume I, 568 pp; Volume II, 540 pp. Typeset proceedings (note price!) of a June 
1979 colloquium organized by the Bolyai and Bernoulli societies at Esztergom, Hungary. LAS 


Reviewers 


RJA: Richard J. Allen, St. Olaf; RB: Richard Brown, Carleton; JNC: Judith N. Cederberg, St. Olaf; 
CEC: Clifton E. Corzatt, St. Olaf; JD-B: John Dyer-Bennet, Carleton; JRG: Jennifer R. Galovich, St. 
Olaf; SG: Steven Galovich, Carleton; JG: Jack Goldfeather, Carleton; PH: Paul Humke, St. Olaf; RBK: 
Roger B. Kirchner, Carleton; RSK: Richard S. Kleber, St. Olaf; JK: Joseph Konhauser, Macalester; 
LCL: Loren C. Larson, St. Olaf; GHM: George H. Mills, Carleton; RWN: Richard W. Nau, Carleton; AO: 
Arnold Ostebee, St. Olaf; AWR: A. Wayne Roberts, Macalester; MS: Michael Schneider, Macalester; JS: 
John Schue, Macalester; SS: Seymour Schuster, Carleton; JAS: J. Arthur Seebach, Jr., St. Olaf; KS: 
Kay Smith, St. Olaf; LAS: Lynn Arthur Steen, St. Olaf; TAV: Theodore A. Vessey, St. Olaf; MW: Martha 
Wallace, St. Olaf; FLW: Frank L. Wolf, Carleton; PZ: Paul Zorn, St. Olaf. 
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In the second game, Right colors two edges after each time Left colors one. For n > 3, can 
Right always produce a larger clique? She can for n = 4 or 5. 


In the third game they color alternately, as in the first game, but Left wins if he can produce a 
vertex with larger Lilac valence than any Red valence, i.e., if he can make more Lilac edges at a 
vertex than Right can make Red edges at any vertex. If the maximum Lilac and Red valences are 
equal, Right wins. Who does win? 


A Conjecture of Marian Deaconescu 


Let I] = p,p, --- p, be the product of the first n primes. Then 
i [1 + p is prime for at least one prime p, p, < p < I? 


Erdés agrees with this conjecture and expects that the least such p is much smaller than I]; in fact 
that p < n° for some constant c. Deaconescu has verified the conjecture for n < 1000. 


A PENTAD OF POINTED PROBLEMS 


Quite often the MONTHLY gets problems which are too unsolved to go into the elementary or 
advanced sections, even with stars on, while they may be too brief to stand alone in this section. 
Here are five examples. 


Some Problems in Geometric Probability 
Paul R. Chernoff, University of California, Berkeley, CA 94720 


Choose n points P,, P,,..., P, at random from the unit disk in R*. What is the probability that 
the polygon P, P,... P,P, is convex? What is the probability that it’s a Jordan curve? If the points 
are chosen from the unit ball in R°*, what’s the probability that the polygon is knotted? 

I have no idea how to solve these, I believe that Dvoretzky has done some possibly related 
work on Brownian motion trajectories. I think that the first question may be easy, but that the last 
one is hard. 


Find an Invertible Linear Combination 


Robert Hartwig, North Carolina State University at Raleigh, NC 27650 


If Q,,Q,, 03, O4 aré square matrices of order n with elements in a field F and the matrix 
ih Q; 
Q, Q 


a,Q, + a,Q, + a,Q, Is invertible? 
What about the same problem with “field” replaced by “division ring” or “the ring of 
integers”? A necessary condition for matrices over a commutative ring with unity, is that the 


A OQ]... B 
0 4 | Similar to | 4 B 


\4 les | _ ih mall ° | 

0 Al|Q, Qs Q, Q4\LO B 

implies AQ, = Q;B for each i, and hence since R is commutative, AU = UB so that an affirmative 
answer to the question implies the splitting condition. 


of order 2n is invertible, are there scalars a,, a, a3, a, in F so that U=a,Q, + 


splitting condition holds; that | implies A similar to B. For, if A, B are 


n by n matrices, then 


1983] UNSOLVED PROBLEMS 121 


Moving Triangles in the Plane 


Peter J. Giblin, University of North Carolina at Chapel Hill, NC 27514, and University of Liverpool, 
Liverpool, L69 3BX, England 


In spite of its unnatural look, this problem arose in research into caustics by reflection, 
undertaken with J. W. Bruce and C. G. Gibson (Oxford Quart. J., 1982): 


A, B, C each move on curves in the plane. I want to impose two sets of conditions: 

(1) The normal to the curve on which A moves is to be, at each instant, the bisector of angle 
CAB. Similarly for the normals at B and C. [Note that this implies BC + CA + AB = constant 
(Math. Mag., Problem 1151, 1982). If A, say, is permanently fixed, then the condition should be 
interpreted as saying that the bisector of angle CAB is permanently fixed, but in fact this is 
incompatible with condition (2) below. ] 

(2) The curvature of the curve on which A moves is to have, at each instant, the value 
(cos*a)/s, where 2s is the constant perimeter noted above, and 2a is the angle CAB. Similarly the 
curvature of the curves on which B and C move are to be (cos*B8)/s and (cos*y)/s, where 28, 2y 
are the angles ABC and BCA. 


Problem. Do such moving triangles exist? 


The Gauss-Lucas Hull of a Set of Complex Numbers 
Lee A. Rubel, University of Illinois, Urbana, IL 61801 


The Gauss-Lucas Theorem [see M. Marden, (The) Geometry of (the Zeros of) Polynomials (in 
a Complex Variable), Math Surveys #3, Amer. Math. Soc., 1949, pp. 14 & ff; 2nd ed., 1966, pp. 
22 & ff.] states that the zeros of the derivative of a complex polynomial lie in the convex hull of 
the zeros of the polynomial. This problem pursues the G-L Theorem further. 

If E is a subset of the complex plane, let E' = {z € C: p’(z) = 0 for some polynomial p, all of 
whose zeros (possibly multiple) lie in E). For an ordinal number a, let E* = U((E*)!: B < a). 
Let E* be the E% for the first a for which (E%)' = E*. The problem is to get a geometric 
description of E* in terms of E. This seems challenging even when E is a doubleton, say 
E = (0,1), when E' = (r: 0 <r < 1, r rational). To see that this is really E', we may write any 
polynomial p, all of whose zeros lie in FE, as p(z) = Az™(z — 1)”. Then 


p(z)_m sn _(mtnjz—m 
pz) 2 2-1 2(z-1) 
so that p’(z) = 0 precisely when z = m/(m + n), as claimed. 


Problem 1. What is (0, 1}*? 


Problem 2. Is (0, 1}* the set of all algebraic numbers in the closed interval [0, 1], all of whose 
algebraic conjugates also lie in [0, 1]? 

Since it is easy to see that {0, 1}* is contained in this last set, the burden is to prove or disprove 
the reverse containment. Many related questions suggest themselves. 


Rational Roots of a Polynomial 
Lee A. Rubel, University of Illinois, Urbana, IL 61801 
Problem: Let p be a real polynomial of degree n such that the equation p(x) = 0 has n distinct 


rational real roots. Must there exist a real number e + 0 such that the equation p(x) = « also has 
n distinct rational real roots? 
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I believe on other evidence that the answer is probably affirmative. It is trivially so for n = 1 
and n = 2, and nontrivially so for n = 3—here is a proof by Richard Bishop. 

Suppose that x* — px + qg=0 has three rational roots. Then x? — px + gq, = 0 has three 
rational roots for g, running through a dense set of rational numbers in an interval which includes 
q (possibly at an end). 


1. Let the real roots of x? — px + g, = 0 ber, s, ¢. For these roots to be real, the discriminant 


—p?/271 + q;/4 must be < 0 so that r, s, ¢ exist as “functions” of g, for |g,| < /4p°/27. By the 
vanishing of the x” term, r +s +¢=0,t= —r-—s. Then 


2 2 
p=-(strtsth=P+nts?=(r+5) +3(5). 
2 


2. The rational values of (7, 5) on the conic yield rational values of g, for which there are three 
rational roots. Thus, the result reduces to the following: 


2 2 
The possibilities (7, 5, ¢) as g, varies are parametrized by r, s on the conic (r + >| + (5 ] =p. 


3. Let (uo, vo) be a rational point on u* + 3v* = p. Then it is easily checked that the linear 
transformation 


U=Uyy — 3%9Z, V= Uy t UZ 
transforms the conic into y* + 3z* = 1 and transforms the rational points of u? + 3v? = p to 
those of y* + 3z* = 1. Note that this reduces the theorem to the case p = 1. 
4. The conic y? + 3z* = 1 has a rational parametrization 


a* — 3b* _  2ab 


z= >... 
a* + 3b? a* + 3b? 


The remaining cases, even n = 4, seem difficult. 


ON THE POSING OF PROBLEMS 


Modeling is rightly receiving increasing attention as an important part of mathematics. A 
problem which arises from a practical situation usually needs formalizing before it can be 
attacked. In the course of this process, some features of the problem may get omitted or modified, 
so that the first problem may be to decide what the problem is. Different interpretations will lead 
to different results; it is often a matter of opinion as to which ones are interesting. It is a pity if a 
problem is lost because it is initially not well-posed, or if the questions are properly posed but turn 
out not to be the best ones to ask. 

The following is one of a number of editorial paraphrases that it may be convenient to make of 
papers that have been submitted, or of correspondence and conversations made with referees and 
others. Readers may be able to ask, and answer, better questions than those which occurred to the 
author, the referee and the editor. 


A Problem Suggested by Celtic Art 
Douglas A. Engel, 2935 W. Chenango, Englewood, CO 80110 


Ancient Celtic artists sometimes used the following device to produce braided patterns, 
although they did not exactly follow the instructions given here, which have been crystallized into 
a precise problem. 

Draw a graph on a rectangular array of m rows of n points, where m and n are coprime and 


122 RICHARD K. GUY [February 


I believe on other evidence that the answer is probably affirmative. It is trivially so for n = 1 
and n = 2, and nontrivially so for n = 3—here is a proof by Richard Bishop. 

Suppose that x* — px + qg=0 has three rational roots. Then x? — px + gq, = 0 has three 
rational roots for g, running through a dense set of rational numbers in an interval which includes 
q (possibly at an end). 


1. Let the real roots of x? — px + g, = 0be,r,s, ¢. For these roots to be real, the discriminant 


—p?/271 + gq; /4 must be < 0 so that r, s, ¢ exist as “functions” of g, for |g,| < /4p°/27. By the 
vanishing of the x” term, r +s +¢=0,t= —r-—s. Then 


2 2 
p=-(strtsth=P+nts?=(r+5) +3(5). 
2 


2. The rational values of (7, 5) on the conic yield rational values of g, for which there are three 
rational roots. Thus, the result reduces to the following: 


2 2 
The possibilities (7, s, f) as g, varies are parametrized by r, s on the conic (r + >| + (5 ] =p. 


3. Let (ug, v9) be a rational point on u* + 3v* = p. Then it is easily checked that the linear 
transformation 


U=Uyy — 3%9Z, V= UV t UZ 
transforms the conic into y* + 3z” = 1 and transforms the rational points of u? + 3v? = p to 
those of y* + 3z* = 1. Note that this reduces the theorem to the case p = 1. 
4. The conic y? + 3z* = 1 has a rational parametrization 


a* — 3b? _ 2ab 


z= >. 
a* + 3b? a* + 3b? 


The remaining cases, even n = 4, seem difficult. 


ON THE POSING OF PROBLEMS 


Modeling is rightly receiving increasing attention as an important part of mathematics. A 
problem which arises from a practical situation usually needs formalizing before it can be 
attacked. In the course of this process, some features of the problem may get omitted or modified, 
so that the first problem may be to decide what the problem is. Different interpretations will lead 
to different results; it is often a matter of opinion as to which ones are interesting. It is a pity if a 
problem is lost because it is initially not well-posed, or if the questions are properly posed but turn 
out not to be the best ones to ask. 

The following is one of a number of editorial paraphrases that it may be convenient to make of 
papers that have been submitted, or of correspondence and conversations made with referees and 
others. Readers may be able to ask, and answer, better questions than those which occurred to the 
author, the referee and the editor. 


A Problem Suggested by Celtic Art 
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just one of m and n is even. The mn/2 edges of the graph form a matching, or regular graph of 
valence one. In Figure l(a), m = 2 and » = 3. Next arrange mn copies of the graph in n rows of 
m, to produce a square array of mn rows of mn points connected in pairs by m*n*/2 edges, as in 
Figure 1(b). Thirdly rotate this square array through 90° about its centre, as in Figure l(c). 
Finally, superimpose this rotation onto the original square array to give a pattern as in Figure 
1(d). This is a regular graph of valence 2 and so consists of a number of disjoint cycles. For 
example, the 36 edges in Figure 1(d) form a 4-cycle (shown dotted) and a 32-cycle. 

The Celtic artists preferred their finished product to be a single m?n?-cycle or hamilton cycle. 


LL 
2. Le. 


Fic. | 


Problem 1. What are the conditions on the original matching in order that the final pattern is a 
(single) hamilton cycle? 

If m = | and n = 2k, then we always generate k? 4-circuits, so that there is a hamilton cycle 
only in the trivial case m = 1, n = 2. Some solutions for the case m = 3, n = 4 are shown in 
Figure 2. [Mr. Engel has sent this editor a model of the fourth of these, made from wire and 
perspex; it is now on display in his Department.] 

It may be possible to answer the next two problems by using a computer. 


Problem 2. Find all matchings which lead to a hamilton cycle when m = 3 and n = 4. 
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1 and mn even for which no matching 


5. 


iL 
AL] 


Fic. 2 
4andn 


ncrease, solutions seem harder and harder to find, but this is a common 


in combinatorial searches. The number of different matchings is something like 
t, even if the number of solutions increases exponentially, say like c”’” for some 


there values of m,n with (m,n) 


Problem 3. Find such a matching when m 
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constant c, then the chance of finding one at random is (c’e/mn)”"/* which soon becomes 
hopelessly small as m and n increase. 


Problem 5. Find bounds, or even an asymptotic formula, for the number of distinct solutions 


for a given m and n. 
If m = 2 and n is odd, then solutions are known (Figure 3) which have a fourfold symmetry of 


rotation (through 90°). 


\ x 
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FIG. 3 


Problem 6. Find such symmeitical solutions for 3 < m < n. 


Similar problems may be formulated for the case where the square array is reflected as well as 
rotated before being superimposed on the original. We can also ask how to formulate correspond- 
ing problems in three dimensions, based on cyrstallographic lattices. 


Reference 


George Bain, Celtic Art, Dover, New York, 1973. 


MISCELLANEA 


93. I once heard the great Richard Owen [the biologist] say... that he would like to see Homo 
Mathematicus constituted into a distinct subclass, thereby suggesting to my mind sensation, 
perception, reflection, abstraction, as the successive stages or phases of protoplasm on its way to 
being made perfect in Mathematicised Man. 


—J. J. Sylvester, 1870. (Reprinted in Collected Mathematical 
Papers, vol. 2, Cambridge, 1908, p. 652.) 
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Now this extension is uniformly continuous on [0, 1], so 6 is uniformly continuous on (0, 1]. Now 
we regard 6 as a function of both x and e, and the proof is complete. 


The uniformly continuous functions are not characterized by the property that they have a 
uniformly continuous modulus of continuity, as the following example shows. 


EXAMPLE. The function f(x) = 1/x .on (0, 1] is continuous, but not uniformly continuous. 
‘However, one may choose 8(x, ¢) = ex”/2 which is uniformly continuous. 


The theorem for the Lipschitz condition is very easy to prove. 


THEOREM 3. If f: X — Y satisfies a Lipschitz condition on X, then 8: X X (0,00) — (0, 00) may 
be chosen to satisfy a Lipschitz condition. 


Proof. Let M > 0 be a real number so that p( f(x), f(y)).< Mo(x, y) for all x,y € X. Then 
5(x, e) = e/M is a modulus of continuity of f which satisfies a Lipschitz condition where the 
constant is 1/M. 


References 


Dugundji, Topology, Allyn and Bacon, Boston, 1966. 
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1. J. 
2. S. B. Seidman and J. A. Childress, A continuous modulus of continuity, this MONTHLY, 82 (1975) 253-254. 


APPLICATIONS OF A SIMPLE COUNTING TECHNIQUE 


MELVIN HAUSNER 
Courant Institute of Mathematical Sciences, New York University, New York, NY 10012 


In this note, we show how the simple counting technique of Theorem 1 can be applied in 
several different settings. Throughout, X denotes a finite set, and p denotes a prime number. For 
A C X, let |A| represent the number of elements in A. 


1. THEOREM. Let f: X — X with f? = I (the identity map). Let Xo be the set of fixed points of X. 
Xo = (x © X|f(x) = x). Then |X| = |X_| mod p. 


Proof. For any x € X, we define the orbit X of x as the set (x, f(x),..., f? '(x)). It is an easy 
matter to show that the orbits partition X. Clearly, |x| = 1 if and only if x = f(x), that is, if 
x € X,. Now we claim that if |x| > 1, then |x| = p. For if we had any duplication in x, then 
f'(x) = f/(x) for some i, j,0 <i <j < p, so that f/~‘(x) = x. Since f?(x) = x and (j — i, p) 
= |, it follows that f(x) = x, and hence |x| = 1. 

Finally, since there are |X| orbits of length 1, and all other orbits, say ” of them, have length p, 
we have | X| = |Xo| + np. This yields the required congruence. | 


Probably the most familiar use of Theorem 1 is for the solution of the following prob- 
lem in group theory. “If G is a group of even order, prove it has an element a # e (the identity) 
satisfying a* = e.” (For example, see Herstein [1], p. 35, problem 11.) The familiar proof pairs off 
x and x” ' when they are distinct. The remaining even number of group elements satisfy x = x7! 
or x” = e. Since e is one such element, there are an odd number of elements x + e, with x* = e. 
In our notation, X = G, p = 2, f(x) =x ', so f? =I, and Xp = {x|x = x~') = {x|x* = e}. By 
Theorem 1, |X| = |X| = 0 mod 2. McKay [3] gives a nice direct generalization of this proof which 
we put into our terminology. 


2. EXAMPLE (CAUCHY’S THEOREM for Groups). Let G be a finite group, p a prime dividing |G\. 
Then G has an element of order p. Indeed the number of such elements is congruent to —1 mod p. 


Proof. Let X = the set of p-tuples (x,,..., x,) where x; © G, and xx. --- x, = e. Note that 
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|X| = |G|?~' = Omod p. Since x,x,--- x, =e implies x, --- x,x, =e, we may define 
f(%1,-++, Xp) = (X2,--., X,, X,) and we have f: X > X, f? = I. What is Xo? f(x) = x if and only 
if x =(g,g,...,g) with g? =e. Thus, |X,| = the number of ge X satisfying g? =e. By 
Theorem 1, |X,| = |X| = Omod p. Since |X| = 1 + number of elements of order p, this is the 
result. 


3. EXAMPLE (FERMAT’S THEOREM). n? = n mod p. 


Proof. Consider the set X of all lattice points (x,,..., x,) with 1 < x, < n and let f(x),..., 
Xp) = (X75--+5 Xp, X). Clearly f? = I and |X| = n?,|Xo| = n. Theorem | gives the result. 


4. EXAMPLE (LUCAS’ THEOREM). Suppose 


CF) = (i) ) (emo 


with 0 < n,,r, < p. Then 


a 


b 


Proof. If we writen = Np + no, r = Rp + ro, where 0 < no, % < p, it suffices to prove 


n\ _{N\("o 
(7) = (72 )mod p. 
We define A; = ((i, 1),...,(i, N)} fori = 1,..., p and B = ((0,1),...,(0, 7,)). Then, setting 
A=A,U-:-UA,UB 


(Note: (5) = 0ifb > a) 


we have |A| = Np + ny = n. We now define f: A — A by cyclically moving the 4,’s and keeping 
B fixed: 


fGi,x) =(i+ 1,x) l<i<p-1 


f(p,x) =, x) 
f(0, x) = (0, x). 
Thus, 
f(A) = A410. <i < p- 1),f(A,) = A: f(B) = B. 
Clearly f? = I. 


We now take X as the collection of subsets C C A with |C| = r..Since f: A > A, facts naturally 
on subsets of A: f(C) = (f(x)|x © C). Since f is 1-1, |f(C)| = |Cl|, so f: X > X, with f? = 1. 


Clearly, . 
_(n 
|X| = (7): 
We now find |X|. Any subset C of A can be written uniquely as 


C=C,\U-UGUG 


where C, C A,,C, € B. Since f sends the A, cyclically around, and keeps B fixed, we see that 
f(C) = Cif and only if 


C, =f’ '(C,) i= 1,...,p. 


For C € X, we must have |C| =r, and if C € Xo, we have |C| = p|C\| + |Ch| =r= Rpt 7. 
Note that 0 < |C)|, 79 < p. Thus the cardinality restriction on C is satisfied if and only if |C,| = R, 
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n 
|Co| = 7. But there are (| such choices for C, and | 7] independent choices for Cy. Thus, 


R 
_(N\{ %o 
xal= (2) 2} 
Theorem | then provides the desired conclusion. 
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THE MEANING OF THE CONJECTURE P + NP FOR MATHEMATICAL LOGIC 


JAN MYCIELSKI 
Department of Mathematics University of Colorado, Boulder, Colorado 80309 


I think that the conjecture P + NP is not as widely taught in courses of mathematical logic as 
it should be, in view of its capital importance for the foundation of mathematics. Therefore I am 
writing this note in the hope that all logicians will always include it in the introductory courses of 
their subject although it does not appear yet in the appropriate books. The original paper of S. 
Cook [1], where the conjecture was formulated, was indeed written from the point of view of logic 
but it became the domain of computer scientists (see [2]), particularly because of a paper of Karp 
[3] where the combinatorial or computational aspects of the conjecture were developed in a very 
suggestive way. 

Let us assume that the teacher has already presented the concept of a first order theory and the 
concept of a Turing machine (neither the concepts of a decidable theory nor that of a recursive 
function are needed). Then he may proceed as follows: 

By a normal theory we shall mean a theory which is formalized with a finite alphabet in first 
order logic with equality and is axiomatizable by a finite set of axioms and axiom schemata in 
which one can prove dxy[x # y]. (In [4] it is proved that every theory which is recursively 
axiomatizable and contains a minimal amount of arithmetic or set theory is normal.) By a proof in 
a normal theory we mean a Hilbert style proof from the axioms. 

Let © be a finite alphabet and »* the set of all words, i.e., finite sequences of elements of ©. 
For any € € 1*, |§| denotes the length of . Now we introduce a more abstract concept of a 
theory which we will call a t7-theory. | 

A t1-theory is a set of pairs TC L* X L* such that there exists a polynomial P(x, y) anda 
Turing machine M such that, for any (Tt, 7) € 2* X L*, M can decide in time < P(|t|, ||) if 
(1,7) ET. 

If (t, 7) € T, then 7 is called a theorem of T and 7 is called a proof of + in T. 

Every normal theory defines a tz-theory since the time necessary to check the correctness of a 
Hilbert style proof in a normal theory can be estimated from above by a polynomial in the length 
of that proof. 

Now, a ta-theory T will be called amenable (to automatization) iff there exists another 
polynomial P(x, y) and another Turing machine M) such that, given any word + € * and any 
positive integer n, M, can decide in time < Po(|T|, 7) if there exists aw € L* with |7| < n and 
such that (7,7) € T. (Notice that if we replaced the condition < P)(|t|,) by the condition 
< Po(|t|, c”) where c = card ), then the concept would trivialize since every tz-theory would be 
amenable. In fact, given a time Po(|t|, c”), the machine can form all sequences of symbols of 
length n and find out if any of them is a proof of 7.) 

It is clear that, after Gédel’s 1931 discovery that all sufficiently strong theories are undecidable, 
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|Co| = 7%. But there are (* such choices for C, and | >| independent choices for Cy. Thus, 
R To 


_(N\{ %o 
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the next question which presents itself is to ask if the tz-theories (corresponding to normal 
theories) are amenable. But we had to wait until 1970 (the paper of Cook [1]) for a clear statement 
of that question. It can be asked in many equivalent ways and the proofs of their equivalence are 
very ingenious (see [2]). Now we will state three such ways which are most striking to the logician. 


PROPOSITION. The following three statements are equivalent to each other: 


(1) There exists a ta-theory which is not amenable. 

(ii) Every t1-theory which is defined by a normal theory is not amenable. 

(iii) P = NP. 

This proposition follows immediately from the theorem of Cook that the set of satisfiable 
formulas of propositional calculus is NP-complete. (We refer the reader to [1], [2], [3].) It 1s not 
known if P # NP; this is the conjecture of Cook. I believe that this is the most outstanding 
problem of contemporary mathematics (I rank it higher than the Riemann-{ conjecture). The 
above proposition is the best way of explaining the conjecture’s capital importance in foundations 
of mathematics. (Its great importance in theoretical computer science is well known (see [2]).) 
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FROM CENTER OF GRAVITY TO BERNSTEIN’S THEOREM 


RAY REDHEFFER 
Department of Mathematics, University of California, Los Angeles CA 90024 


Let w(x) be continuous for x > 0 and positive for x > 0. If w(x) is thought to be the density 
of a rod at point x, then 


Jotw(t) dt 
Jow(t) at 
is the center of gravity of the part of the rod on [0, x] and, on physical grounds, this expression 


must be an increasing function of x. The mathematical reason is that the factor ¢ in the numerator 
is increasing. In fact if ¢ is increasing for x > 0 and ¢w is continuous for x > 0, then 


foo(t)w(t) at 
() fgw(t) dt 


This is seen when we differentiate by the quotient rule; the numerator of the resulting expression 


is increasing for x > 0. 
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w(x) (x) w(t) dt — / w(t)o(t) at] > 0. 
0 0 
More generally, for any continuous function h let 


x Ln fy 
J(hix)= ff of h(t,) dt, --- dt,_,dt,, n>2 


J,(h, x) = fon(s) at. 


Then, with w and ¢ as above, 
J, (ow, x) 


(2) J, (w, x) 


This is true for n = 1 by (1). If it holds for n — 1 then J,_ ,(@w, x) = ®(x)J,_ |(w, x) where ® is 
increasing. With W(x) = J,_,(w, x) the numerator and denominator in (2) are respectively 


[O(a ae, [wo at 


Hence, (2) follows from (1) applied to (®, W) instead of (¢, w). 

Suppose now that f is a real-valued function satisfying f‘” (x) > 0 for 0 < x < p, where p > 0 
and n = 0,1,2,.... Bernstein’s theorem states that the Taylor series for f converges to f(x) on 
[0, p) (and hence f is the restriction of an analytic function f(z), |z| < p). 

To prove Bernstein’s theorem, let us integrate f‘”)(x) from 0 to x repeatedly as in [3]. The 
result is 


is increasing for x > 0. 


2 n—1 
(3) F(x) = £00) + FO) x + FO) 5p + 0 + FPO Tay + Ral) 
where for n > 2 
— (* fi. (2 ¢~ _ _ (n) 
R,,(x) If lr (t,) dt, dt, at, J,( f ,x). 


Since f("* (x) > O the function ¢ = f” is increasing and (2) with w = 1 shows that R,(x)/x” 
is increasing also. Hence 
R,(x) _ RACY) 

(4) . "3 
The hypothesis f(x) > 0 shows that R,(x)>0 and Equation (3) with f((0)>0, j= 
0,1,2,...,n — 1 gives R,(r) < f(r). Thus, 
(5) 0<R,(r)<f(r). 
Combining (4) and (5) gives 0 < R,,(x) < (x/r)"f(r), hence R,,(x) — 0 for 0 < x < r, and this 
is Bernstein’s theorem. 

In comparing with other procedures [1], [2] it should be stated that, once monotonicity of 


R,,(x)/x”" is established, completion of the argument by (4) and (5) is common to all. The novelty 
here consists in the proof of monotonicity. 


O<x<r<op. 


IN 
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A NOTE ON LAGRANGE’S THEOREM 


WELLS JOHNSON 
Department of Mathematics, Bowdoin College, Brunswick, Maine 04011 


It has become traditional in elementary abstract algebra courses to prove Lagrange’s theorem 
(that the order of a subgroup divides the order of the group) and then to deduce from it the 
immediate corollary that in a finite group the order of an element always divides the order of the 
group. While this approach is undoubtedly the most efficient way to prove these results, we 
suggest below an alternative development that may be better pedagogically. Teachers do not 
always appreciate the fact that beginning algebra students often find the proof of Lagrange’s 
theorem abstract and difficult. The difficulty arises from the fact that the usual proof combines 
two concepts which students probably have not encountered previously, namely, the idea of an 
equivalence relation and the notion of right or left cosets. While most students become adept at 
applying the result of Lagrange’s theorem in different situations, only the very best ones seem to 
be able to reproduce its proof with any accuracy and understanding. 

Our approach has been to reverse the order in the presentation of these two theorems. Many 
texts already define the order of an element in a group before defining the concept of a subgroup. 
After computing a few examples, students are easily led to the conjecture that the order of an 
element divides the order of the group. Historically, of course, this is a much older result than 
Lagrange’s theorem. In number theory, the ideas go back to Fermat in the form that a prime p 
divides a? — a for any integer a. 

This result is also easier to prove, especially if the group G is abelian. If x is a fixed element of 
G, then the map “left multiplication by x,” f,: G — G defined by f,( 2) = xg, is a permutation of 
the elements of G. In the case that G is abelian and has order n, then 


Ileg= I Ge) = x" IL 3, 


geG 


and thus x” = e, the identity of G. It follows easily that the order of x divides n. 

In the general nonabelian case, the proof is also easy if we are permitted to use some 
elementary facts about permutations. The permutation f, defined above decomposes into disjoint 
cycles, each of the form (g xg x7g ... x*~'g). The length k of a cycle exactly equals the order of 
x in G and hence is independent of the choice of g that begins the cycle. Thus all cycles have the 
same length, and the order of x times the number of distinct cycles equals the-order of G, proving 
the result. 

Having proved this theorem, we can now motivate the concepts needed for the proof of 
Lagrange’s theorem. The idea of an equivalence relation can be introduced quite naturally by 
declaring two elements of G to be “equivalent” if they belong to the same cycle of the 
permutation f,. It is easy to show, moreover, that two elements g and h are equivalent if and only 
if the product gh” ' is a power of x, i.e., in the cyclic subgroup generated by x. Now we are ready 
to generalize to arbitrary subgroups. The decomposition of f, into disjoint cycles generalizes to the 
decomposition of G into disjoint right cosets and Lagrange’s theorem follows easily. The point is 
that Lagrange’s theorem is a natural generalization of the decomposition of a permutation into 
disjoint cycles, and students who fail to see this miss an important conceptual and historical 
connection. 

An added dividend to this approach is that we establish very early on in the course the 
importance of the fact that left multiplication by x is a permutation of G. This not only aids in the 
construction of small group tables, but the statement and proof of Cayley’s theorem are not quite 
the mystery to beginning students that they usually are. Once equivalence relations are understood 
as an abstract principle, then the development of the class equation and its generalization to group 
action on sets can be motivated. The latter yields some very nice proofs of Sylow’s theorems. 
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Developed in this way, an introduction to group theory proceeds from some rather simple, 
concrete results to some nontrivial general theorems, following a progression of ideas that are tied 
very closely to one another. The presentation demonstrates how mathematicians start with basic 
facts that depend upon some explicit computations, and then continually rework, reformulate, and 
generalize the ideas, transferring them to different settings and situations. Perhaps this kind of 
mathematical evolution is one of the most important lessons that we can teach in an introductory 
abstract algebra course. 


PROBLEMS AND SOLUTIONS 
EDITED BY DAVID BORWEIN, J. L. BRENNER, AND VLADIMIR DROBOT 


EDITOR EMERITUS: EMORY P. STARKE. COLLABORATING EDITORS: LEONARD CARLITZ, GULBANK D. 
CHAKERIAN, RICHARD A. GIBBS, RICHARD M. GRASSI, ISRAEL N. HERSTEIN, MURRAY S. KLAMKIN, DANIEL 
J. KLEITMAN, MARVIN MARCUS, W. C. WATERHOUSE, ALBERT WILANSKY, S. F. BAy AREA PROBLEMS GROUP: 
LEROY BEASLEY, VINCENT BRUNO, DAN FENDEL, JAMES FOSTER, CLARK GIVENS, ROBERT H. JOHNSON, 
DANIEL JURCA, FREDERICK W. LUTTMANN, LOUISE E. MOSER, DALE H. MUGLER, M. J. PELLING, HOWARD 
E. REINHARDT, BRUCE RICHMOND, AND EDWARD T. H. WANG. 


Send all proposed problems, in duplicate if possible, to Professor Vladimir Drobot, Department of Mathe- 
matics, University of Santa Clara, Santa Clara, CA 95053. Please include solutions, relevant references, etc. 

An asterisk (*) indicates that neither the proposer nor the editors supplied a solution. 

Solutions should be sent to the addresses given at the head of each problem set. 

A publishable solution must, above all, be correct. Given correctness, elegance and conciseness are preferred. 
The answer to the problem should appear right at the beginning. If your method yields a more general result, so 
much the better. If you discover that a MONTHLY problem has already been solved in the literature, you sheuld of 
course tell the editors; include a copy of the solution if you can. 


ELEMENTARY PROBLEMS 


Solutions of these Elementary Problems should be mailed in duplicate to Professor Vladimir Drobot, 
Department of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by June 30, 1983. Please place 
the solver’s name and mailing address on each (double-spaced) sheet. Include a self-addressed card or label (for 
acknowledgment). 


E 2986*. Proposed by Abdul Aziz, University of Kashmir, India. 


If P(z) is a polynomial of degree n with complex coefficients having all its zeros in |z| > K, 
where K > 1, prove or disprove the following two assertions: 


(i) |P(K?z) — P(z)| < (K” — 1)Max,,)-;|P(z)} for |z| = 1. 
(ii) |P(K?z)| — |P(2)| < (K" = 1]z/"Max,,.,/P(z)} for |z| > 1. 


E 2987. Proposed by George Tsintsifas, Thessaloniki, Greece. 


Let S, = A,A,... A, , be an n-simplex in R” and M a point inside its circumsphere S: (0, R). 
The straight line A,M intersects the sphere (0,R) at the point A;. We denote K = 
v7t ||4,M|/|MA’‘|. Let G be the centroid of S,,. Prove: 


ay K>n+1 iff M lies outside the sphere (s) with diameter OG. 
b) K=n+1 iff M les on the sphere (s). 
c) K<n+1 iff M lies in the interior of the sphere (s). 
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generalize the ideas, transferring them to different settings and situations. Perhaps this kind of 
mathematical evolution is one of the most important lessons that we can teach in an introductory 
abstract algebra course. 
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CHAKERIAN, RICHARD A. GIBBS, RICHARD M. GRASSI, ISRAEL N. HERSTEIN, MURRAY S. KLAMKIN, DANIEL 
J. KLEITMAN, MARVIN MARCUS, W. C. WATERHOUSE, ALBERT WILANSKY, S. F. BAy AREA PROBLEMS GROUP: 
LEROY BEASLEY, VINCENT BRUNO, DAN FENDEL, JAMES FOSTER, CLARK GIVENS, ROBERT H. JOHNSON, 
DANIEL JURCA, FREDERICK W. LUTTMANN, LOUISE E. MOSER, DALE H. MUGLER, M. J. PELLING, HOWARD 
E. REINHARDT, BRUCE RICHMOND, AND EDWARD T. H. WANG. 
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matics, University of Santa Clara, Santa Clara, CA 95053. Please include solutions, relevant references, etc. 

An asterisk (*) indicates that neither the proposer nor the editors supplied a solution. 

Solutions should be sent to the addresses given at the head of each problem set. 

A publishable solution must, above all, be correct. Given correctness, elegance and conciseness are preferred. 
The answer to the problem should appear right at the beginning. If your method yields a more general result, so 
much the better. If you discover that a MONTHLY problem has already been solved in the literature, you should of 
course tell the editors; include a copy of the solution if you can. 


ELEMENTARY PROBLEMS 


Solutions of these Elementary Problems should be mailed in duplicate to Professor Vladimir Drobot, 
Department of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by June 30, 1983. Please place 
the solver’s name and mailing address on each (double-spaced) sheet. Include a self-addressed card or label (for 
acknowledgment). 


E 2986*. Proposed by Abdul Aziz, University of Kashmir, India. 


If P(z) is a polynomial of degree n with complex coefficients having all its zeros in |z| > K, 
where K > 1, prove or disprove the following two assertions: 


(i) |P(K?z) — P(z)| < (K” — 1)Max,,)-;|P(z)} for |z| = 1. 
(ii) |P(K?z)| — |P(z)| < (K” — Dz|"Max),).|P(z)), for |z| > 1. 


E 2987. Proposed by George Tsintsifas, Thessaloniki, Greece. 


Let S, = 4,A,...A,,,, be an n-simplex in R” and M a point inside its circumsphere S: (0, R). 
The straight line A,M intersects the sphere (0,R) at the point A;. We denote K= 
ort ||4,M|/|MA’|. Let G be the centroid of S,,. Prove: 


ay K>n+1 iff M lies outside the sphere (s) with diameter OG. 
b) K=n+1 iff M les on the sphere (s). 
c) K<n+1 iff M lies in the interior of the sphere (s). 
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E 2988. Proposed by Mihaly Bencze, Braga, Romania. 


For what values of n do the inequalities 


n+2 
3 


3 n 
n+ In <1+724+ 73 4+---+vVn <2n-In(n+ 1) 


hold? 


SOLUTIONS OF ELEMENTARY PROBLEMS 
kth Divided Difference of a Function 


E 2900 [1981, 538]. Proposed by Edward Neuman, University of Wroclaw, Poland. 


The kth divided difference of a function f at the points fo, ¢,,..., t, 1s denoted by [fo, ¢),..., 
t, | f and is defined. inductively by 


Ltolf=f (to). [to. t5-+- teailf= ([ty, to5.--, trail f- [tos tis, tel )/ (teri ~~ ty). 
Show that, for positive integers k, /, 
[tos ty... 5 ty ant = Vtytys --+ ty, 
the sum being extended over 0 <j, <j, < °°: <j, <k. 


Solution by Miroslav D. ASié, London School of Economics. A solution to this problem can be 
found in [2], pp. 7-8, where it is solved by putting a conveniently chosen rational function into 
partial fractions. Here we give another proof based on the well-known formula [2, p. 10] for 
divided differences: [f9,..., ¢,]f = D(f)/d, where d= Ili<j (¢; — t,) and D(f) = 
det A ys x¢K41, Where A is the matrix with rth column [f(t,), (f7', 4°7*,..., * O <r<k). 
In our case we have f(t) = ¢**'. The determinant can be evaluated in much the same way as the 
Vandermonde determinant. For other methods of evaluating it, see [1, example 9, p. 27] or [3, 


§335, pp. 329-331]. 


References 


1. W. L. Ferrar, Algebra, Oxford 1941. 
2. L. M. Milne-Thomson: The calculus of finite differences, Macmillan, London, 1951. 
3. T. Muir, Determinants, Dover, 1960. 


Also solved by V. Anantharam, K. L. Bernstein, P. S. Bruckman, L. Kuipers (Switzerland), O. P. Lossers 
(Netherlands), W. A. Newcomb, B. N. Parlett, M. R. Railkar (India), R. E. Shafer, J. M. Stark, P. -Y. Wu (Taiwan), 
an anonymous solver, and the proposer. 


ADVANCED PROBLEMS 


Solutions of these Advanced problems should be mailed in duplicate to Professor David Borwein, Department of 
Mathematics, University of Western Ontario, London, Ontario, Canada N6A 5B9, by June 30, 1983. The solver’s 
full post-office address should be on each sheet. 


6421. Proposed by J. Beck, F. Galvin, and J. Pach, Hungarian Academy of Sciences and 
University of Kansas. 


Let E be a set of points in the plane with the property that every closed disc of radius | 
includes at least one element of E. Prove that there exists a straight line L such that the 
orthogonal projection of E onto L is everywhere dense in L. 
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SOLUTIONS OF ADVANCED PROBLEMS 
Dense Chains 


5540 [1967, 1269]. Proposed by G. F. Schumm, University of Chicago. 


If X and Y are chains, then X is said to be dense in Y if for every A, B € Y, A # B, there exists 
F € X such that A C F C B (where € denotes a proper subset). Prove or disprove that for an 
uncountable chain Z there exists a countable chain dense in it if and only if there exists a 
countable set T such that (B — A) NT # © for all A, B € Z, A C B. Does there exist for every 
cardinal number 8, a chain of cardinality 2° having a chain of cardinality 8 dense in it? 


Solution by M. Machover, St. John’s University, New York. In the first part the implication 
does not hold both ways. For Z = {((— 0, r)|r © R} U {((— 00, 0]} is an uncountable chain of 
intervals ordered by set inclusion and T = Q (the rationals) has the required property. Yet no 
chain X can be dense in Z for there is no M such that (— 0,0) C MC (— 0,0]. 

With an additional hypothesis we can make the bi-implication true and prove a more general 
theorem. By a jump in a chain Z we mean an ordered pair (A, B) with A, B € Z, A C B, and 
with no set of Z strictly between A and B. Denseness of X in Y is interpreted so as not to require 
that Y contain X as a subchain since otherwise any chain Z with a jump would be a trivial 
counterexample. 


THEOREM 1. A chain Z of cardinality greater than the infinite cardinal « has a dense chain of 
cardinality a if and only if, 


(1) There exists a set T, card (T) < a, such that (B\.A) T # © forall A,B © Z,A CB; 
(2) Z contains no singleton jumps, i.e., for every jump (A, B) in Z, card (B\ A) > 2. 


Proof. First, suppose Z has a dense chain X of cardinality a. Then, as the above illustration 
shows, Z does not contain any singleton jumps. Let X, = {N\ M|M, N © X, M Cc N} which has 
cardinality a and @ € X,. Let T, be a choice set for X,, that is, from each pair M,N © X, MCN 
choose a t © N\ M (the same ¢t may be chosen more than once). Then card(T,) < a. 

If Z has no jumps, define T, = @; otherwise, let J be the set of jumps in Z. Clearly, if 
(A,, B,), (A>, B,) are distinct jumps, then either B, C A, or B, C A,. It follows that card(J) < a 
since there is an M € X strictly between A and B for every jump (A, B). For each such jump 
choose t € B\.A thereby obtaining a set 7,, card(7,) < a, and set finally T = 7, U T,. Then 
card(T) < a and if A,B e€ Z,A CB and (A, B) is a jump it follows TO (B\ A) DIAN 
(B\.A) # @. Otherwise 1D € Z and M,N €X such that AC MCDCWNCB, so that 
TO (B\ A) DT, N(B\A)# @. 

Conversely, suppose properties (1) and (2) hold. Let 7, = {tf © T|F4A, BE Z,ACB,tEe BY 
A), so that card(T,) < a. For t € T, define the nonempty sets 


A,(t)={A€EZ|AB Ee Z,ACB,t€ B\ 4}, 
B,(t) ={BEZ|3A © Z,ACB,tE B\A) 


and define functions h, g on T, by, 
h(t)= U{A|A EA (t)}, g(t) = N{BIBEB(t)}, CET. 


If J is the set of jumps in Z, then card(J/) < a because the map that assigns to (A, B) € J any 
element in (B\.A) 1 T is (1, 1) with range in 7. For every (A, B) € J, B\.A has at least two 
elements, say b,, b,, and we define a map k on J by setting in every case k(A, B) = A U {bj}. 

We assert that X, = A(T,) U g(T,) Uk(J/) is a chain dense in Z. For, an analysis by cases, 
using in every case the linear ordering of Z, shows that X is linearly ordered by inclusion. As for 


136 PROBLEMS AND SOLUTIONS [February 


denseness let A, B € Z, A C B and choose any t € TM (B\ A). Thus t € 7, and A C A(t) C 
g(t) G B. If k(t) € Z, take M = h(t) and if h(t) € Z but g(t) € Z, take M = g(t). Otherwise 
(A(t), g(t)) € J, in which case take M = k(h(1£), g(t)). In any event Me X, and ACMCB. 

Finally, since card( X,) < a, we can obtain a dense chain X of cardinality a by adding to X, an 
arbitrary subset of Z of cardinality a. Q.E.D. 


The first part of the problem is solved by taking a = 8, in the above theorem, noting that in 
property (1) of necessity card(T) = So. The second part is answered by the following theorem, at 
least for Zermelo-Skolem Fraenkel set theory with the axion of choice (ZFC). 


THEOREM 2. Let a be the proposition that for every cardinal number ®& there exists a chain of 
cardinality 28 having a chain of cardinality 8 dense in it. If GCH denotes the generalized continuum 
hypothesis, then in ZFC,GCH => @ but m is not equivalent to GCH. 


Proof. Sierpinski proved [W. Sierpinski, Sur un probléme concernant les sous-ensembles 
croissants du continu, Fund. Math., 3(1922) 109-112] that for every & there exists a chain of 
cardinality greater than 8 containing a dense chain of cardinality 8, so that from an assumption 
of GCH it would follow that 7 was true. In fact, regarding & in the usual way as an ordinal and an 
ordinal as equal to the set of all smaller ordinals, a slight modification of Sierpinski’s proof yields 
(assuming GCH) the following example. Let Z, = {f|f: 8 — (0, 1}) and totally order Z, by f, < f, 
if and only if f, = f, or f,(a@) < f,(@) at least ordinal a where they differ. Let X = {f © Z,)|da < 
8, f(B) = 0 fora < B < 8), Y ={(f © Z,|da < 8, f(B) = 1 fora < B < 8) and set Z = Zy \Y. 
Then card(Z) = 2", card(X) = &, and X C Z is dense in Z. To pass from chains of functions to 
chains of sets use the order-isomorphism f > {g € Z|g < fy). 

On the other hand W. Mitchell proved [W. Mitchell, Aronszajn Trees and the Independence of 
the Transfer Property, Annals of Mathematical Logic 5(1972) No. 1, 21-46] that the assertion 7*: 
“if S is any infinite set, then ?(S) (power set) contains a chain of cardinality 2°") ” is in ZFC 
implied by GCH but is not equivalent to it. So to complete the proof we need only show that 
aq <= 7* in ZFC. Given a* let card(S) = 8 and Z, be a chain in ?(S) of cardinality 2*. Deleting 
from Z, al! sets B which occur in singleton jumps (A, B) we obtain a chain Z of cardinality 2* 
without singleton jumps and it follows from Theorem 1 with T = S that Z has a dense chain of 
cardinality &. 

Conversely, given 7 let card(S) = 8 and Z be any chain of sets of cardinality 2® having a 
dense chain X of cardinality 8. Let h: X — S be a (1, 1) correspondence and define g: Z > #(S) 
by g(A) = {x € S|h7'(x) C A), A © Z. Then g(Z) is a chain of subsets of S having cardinality 
2”. 


Borel Subsets of a Product Space 


6023* [1975, 308]. Proposed by S. J. Sidney, University of Connecticut. 


If for each k in the uncountable index set K, J, denotes a copy of [0,1] and U, denotes the 
copy of (0, 1] contained therein, prove or disprove that [1,U, is a Borel set in the compact space 
I1,J,. 


Solution by M. J. Pelling, Balliol College, Oxford. Let us generalize this problem and consider 
Q =[1,U, where each U, is a proper Borel subset of I, of Lebesgue measure 1. We prove that Q 
is not a Borel set in X¥ = I], J, by constructing a regular Borel measure » on X and showing that Q 
is not measurable. This will depend on showing that the measure of open and closed sets in X can 
be defined in a particularly simple way. 

Construction of ». Let B be the collection of all sets J, X L’ ¢ X where J, is any open subset of 
any copy L = [1?_,J,, of [0,1]", n > 1, and L’ = 11,4, J,. B is a base for the open sets of X. 
Lebesgue measure in (0, 1)” or its copy L will be denoted m (n takes all values 1,2,... but context 
will clarify any ambiguity in m). For sets J € B define p(J) = m(J,) if J = J, X L’. 
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6023* [1975, 308]. Proposed by S. J. Sidney, University of Connecticut. 


If for each k in the uncountable index set K, I, denotes a copy of [0,1] and U, denotes the 
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To extend » to a regular Borel measure on X, we employ the theory of p. 231f in Halmos’ 
Measure Theory (D. Van Nostrand, 1950), referred to hereafter as HMT. On the compact 
(= closed as X is compact Hausdorff) subsets of X define 


c(F)= inf p(J), Fcompact. 
FcoJeEB 

To show that c is a content (HMT p. 231) for X, we verify Halmos’ defining conditions (a)—(d). It 
is obvious that 

(a) 0<c(F)<l<o, 

(b) F.C R= c(F\) <c(F). 

(c) If F, and F, are disjoint, then since X is normal and compact and B is a base for the open 
sets closed under finite unions, there are sets J,,J, in B such that Fi CJ), F, CJ, and 


J, AJ, = S. Since p(J) < w(J’), if J CJ’ and p(J) + w(J’) > w(J UJ’) with equality if J, J’ 
are disjoint, for any J, J’ in B, it follows readily that 


c(F, UF,) =c(F,) + c(B4). 
(d) If F, and F, are not necessarily disjoint, then from the properties of w stated in (c) we have 
c(F, UF) < c(F,) + c(B). 
Hence c is a content for X. 
LEMMA. The content c is regular (HMT p. 237). 


Proof. Given F compact and e > 0, suppose F Cc J, X L’ € Band m(J,) < c(F) + «. If F* is 
the projection of F into L, then F* is compact, F* c J, and F c F* X L’. Now m, regarded as a 
content in L, is certainly regular so there is a compact F, C L such that F* Cc Int F, C F, and 
m(F,\ F*) < ¢«. If D = F, X L’, then D is compact, Int D = (Int F,) x L’, F c Int DC D and 
c(D) = mF, < mF*¥ +e<mJ,+e<c(F)+ 2e. Hence, 


c(F) = inf{c(D)|F ¢ Int D Cc D compact} 


so that by definition c is regular. Q.E.D. 

It follows from HMT p. 237, Theorem A, that c induces a regular Borel measure on X, p Say, 
such that for compact F, c(F) = p(F). Our notation is permissible, i.e., this measure agrees with 
the » originally defined on B, since given J, X L’ € B, 


mJ, =1—m(L\J,) =1-c(L\ J, X L'!) = 1 -— w(X\(, X L’)). 


All the above holds if K is countably infinite but since then X has a countable base for its open 
sets it is easily shown that p is actually the product measure on X, each J, being assigned the Borel 
measure obtained by restricting Lebesgue measure. In this case we denote p by p,,. When K is 
uncountable, properly extends the product measure. 


The measure of open and closed sets. In the course of proving the lemma above it was shown 
incidentally that for compact F 


u(F)= inf p(F, XL’) where F, is a closed subset of L. 
FCOFXL’ 


Taking complements it follows that for any open G in X, 
w(G)= sup p(J, x L’) where J, is an open subset of L. 


JI,XL'CG 


Take a sequence J; X L; ¢ G such that p(J; X L;) > p(G). Then 


oO 
USJxXL=TXM CG 


i=] 
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where T is an open set in some countably infinite product M=[Tj2, J, and M’ =II,4,, [x- 
Further 1(G) = p(T X M’) = p,,(T). 

If now G= U,J,; J, © B, let S, be the projection of J, into M so S = U,S, will be the 
projection of G into M. S and the S, are all open in M so that as M has a countable base for its 
open sets, there is a countable collection S,,S,,... of the S, such that S = U%, S,. Each 
Lo (S; \T) = 0 else we would have p(G) > n(T X M’) and so 


Pao(S\T) < DL Ho(S;\T) = 0 
i=] 
Thus p,(S) = w,(T) = #(G) where S is open in M and GC S X M"’. So we have shown the 
existence of a countable product M and open sets T and S and M such that 
TX M CGOSX M5 p(G) = p(T X M’) = p,.(T) = w(S X M’) = 1,,(S). 


Taking complements, if F ¢ X is closed, there is a countable product M and closed sets U, Vin M 
such that - 


UX M’ CFOCVXM’';pn(F)=n(UX M’) =p,,(U) = BV X M’) = (VY). 
Conclusion. By the above, and the regularity of », if P C X is Borel, 


u(P)= inf {pn (S)jopenScCM}= sup {pn,(U)|closed Uc M}. 
PCSXM’ UX M'CP 

If then Q were Borel, the second equation shows that 4(Q) = 0 (any U must be empty). In the 
first equation suppose M = I1'2, I, and Q ¢ S X M’. Thenp,, (172, U,,) = 1 so that w,,(S) = 1 
and thus (Q) = i. This contradiction shows that Q cannot be Borel. 

By starting from different Borel measures, other than Lebesgue measure, on [0, 1], it is possible 
0 prove by the same kind of method that other sets in X are not Borel. For example if 

= (a,, b,),0 < a, < b, < 1, then TI, U, is not Borel. 


An Exponential Sum 


6344 [1981, 352]. Proposed by M. Machover, St. John’s University. 


Let 0,,..., 4, be distinct angles (mod 277). Let A,,..., A,, be nonzero complex numbers. Let g 
be a function defined on the open interval (a, b) such that (x, g(x)) describes a continuous 
simple arc as x ranges over (a, b). Consider the sum 


S(x) = b4, exp(i ((sin 6, )x + (cos 8, )g(x))). 


(a) For each even integer n > 2 find A’s, 6’s, g, and (a,b) such that S$(x) =0 for all 
x € (a, Db). 

(b) Show that, if n = 3, then there is no choice of A’s, @’s, g and (a, b) such that S$(x) = 0 for 
all x € (a, b). 

(c)* Conjecture: if n is odd, then there is no choice of A’s, 0’s, g and (a, b) such that S(x) = 0 
for all x € (a, b). 


Solution by the proposer. (a) It is easy to see that S(x) = 0 for all x € (a, b) 1f g(x) = 0 for all 
x € (a, b) and 
6=17-90 A,= —-A,_, forj = 1,2,...,n/2. 


n—j? J n—j 


(b) Suppose that 4,A,A, # 0 and 


3 
(1) > 4,exp(i(xsin 6, + ycos 0,)) = 


j=l 


138 PROBLEMS AND SOLUTIONS [February 


where T is an open set in some countably infinite product M = I1j2, I, and M’ =I] pk, Ik: 
Further 1(G) = p(T X M’) = p(T). 

If now G= U,J,; J, © B, let S, be the projection of J, into M so S = U,S, will be the 
projection of G into M. S and the S, are all open in M so that as M has a countable base for its 
open sets, there is a countable collection S,,S5,... of the S, such that S = U%, S,. Each 
Lo (S; \T) = 0 else we would have p(G) > n(T X M’) and so 


i=1 
Thus p,(S) = w,(T) = #(G) where S is open in M and GC S X M". So we have shown the 
existence of a countable product M and open sets T and S and M such that 


TX M' CGCS XM’; p(G) = (TX M’) = p,,(T) = w(S x M’) = (5S). 


Taking complements, if F C X is closed, there is a countable product M and closed sets U, Vin M 
such that - 


UX M' CFCVX M’';y(F)=p(U X M’) =p, (U) = BV X M’) =p (V). 
Conclusion. By the above, and the regularity of pw, if P C X is Borel, 


u(P)= inf {pn (S)lopenScM}= sup {pn,(U)|closed UC M}. 
PCSXM’ UXM’'CP 

If then Q were Borel, the second equation shows that 4(Q) = 0 (any U must be empty). In ihe 
first equation suppose M = [[2, I, andQ ¢ S X M’. Thenp,,(]j2, U,) = 1 so that w,,(S) = 
and thus n(Q) = i. This contradiction shows that Q cannot be Borel. 

By starting from different Borel measures, other than Lebesgue measure, on [0, 1], it is possible 
0 prove by the same kind of method that other sets in X are not Borel. For example if 

= (a,, b,)|,0 < a, < b, < 1, then TI, U, is not Borel. 


An Exponential Sum 


6344 [1981, 352]. Proposed by M. Machover, St. John’s University. 


Let 0,,..., 4, be distinct angles (mod 277). Let A,,..., A,, be nonzero complex numbers. Let g 
be a function defined on the open interval (a, b) such that (x, g(x)) describes a continuous 
simple arc as x ranges over (a, b). Consider the sum 


S(x) = ba, exp(i ((sin 0, )x + (cos 8, )g(x))). 


(a) For each even integer n > 2 find A’s, @’s, g, and (a,b) such that S(x)=0 for all 
x € (a, b). 

(b) Show that, if n = 3, then there is no choice of A’s, @’s, g and (a, b) such that S(x) = 0 for 
all x € (a, b). 

(c)* Conjecture: if n is odd, then there is no choice of A’s, 0’s, g and (a, b) such that $(x) = 0 
for all x € (a, b). 

Solution by the proposer. (a) It is easy to see that S(x) = 0 for all x € (a, b) if g(x) = 0 for all 
x € (a, b) and 

O,=a—-8,-;, A;= —Ay-; 


(b) Suppose that 4,A,A, # 0 and 


forj = 1,2,...,n/2. 


3 
(1) > 4,exp(i(xsin 6, + ycos ,)) = 


j=l 
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for y = g(x), a < x < b. It suffices to show that two of the angles must be equal (mod 277). We 
may rewrite (1) in the form 
(2) A, exp(i(x(sin @, — sin8,) +y(cos 6, — cos 6;))) 
+ A,exp(i(x(sin #, — sin6,) + y(cos 6, — cos @,))) = —A3. 
If we multiply each side of (2) by its conjugate and simplify we obtain 


(3) x(sin 8, — sin@,) + y(cos 6, — cos 6,) = cos” 'a — arg A, + arg A; 

where a = yaya 4s’ — |A,|* — |A,|*). Similarly we can deduce from (1) that 

(4) x(sin@, — sin@,) + y(cos 8, — cos @,) = cos” 'b — arg A, + arg A; 

where b = yaya 4a” — |A,|* — |A3|*). The continuity of g ensures that the multiple-val- 


ued right-hand sides of (3) and (4) remain constant as x ranges over (a, b). Since the linear system 
(3), (4) is satisfied by every point (x, y) on the arc, the determinant of the system 


(5) (sin 8, — sin 6, )(cos 0, — cos #,) — (sin @, — sin @;)(cos 8, — cos 9, ) 
= sin(6@, — 6,) + sin(@, — 6,;) + sin(@, — 6,) = 0. 
It is readily seen that (5) implies that at least two of the angles are equal (mod 277). 


The proposer appeals to continuity to justify (5). His argument, however, remains valid for any g since the points 
(x, y) satisfying (3) and (4) form an uncountable set whereas the set of possible pairs of right-hand side values is 
countable. Thus there are at least two pairs of values (x, y) satisfying (3) and (4) with the same right-hand pair. 
Conjecture (c)* remains open. 


REVIEWS 


EDITED BY ALLAN L. EDMONDS AND JOHN H. EWING 
COLLABORATING EDITOR FOR FILMS: SEYMOUR SCHUSTER, CARLETON COLLEGE 


Studies in Algebraic Geometry (M.A.A. Studies in Mathematics, Vol. 20). Edited by A. Seidenberg. 
Mathematical Association of America, Washington, D. C., 1980. xii + 143 pp. $16.00. 


KENNETH R. MOUNT 
Department of Mathematics, Northwestern University, Evanston, IL 60201 


Tracing genealogy (teacher to student) is usually of questionable utility. In the case of the book 
edited by Seidenberg (O. Zariski’s student), contemplating the line from the founders of algebraic 
geometry, that is from the founders as anointed by van der Waerden (one of the contributors to 
the volume), to the editor and contributors is at least amusing. For this family tree, the word 
algebraic in the phrase algebraic geometry is paramount. 

The fundamental source of problems and techniques in algebraic geometry is the study of plane 
algebraic curves. A plane algebraic curve is the set of zeros in a plane (over the complex numbers) 
of an irreducible polynomial in two variables, or it 1s the projective version of such a curve which 
is the set of zeros in the projective plane (over the complex numbers) of a homogeneous 
polynomial in three variables. 

Such studies have ancient roots, but most commentators on the history of algebraic geometry 
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agree that one must consider the major development of the field to have started with the 
investigation of curves with more complex structure that that of the conics. 

Shafarevich (c.f. [2]) points to questions asked and answered about integral calculus by the 
Bernoullis, Euler, and Abel as the first steps on the road to modern algebraic geometry. Euler’s 
papers on elliptic integrals were written in the late 18th century and treated the question of 
integrating expressions of the form dx/ p(x) when p(x) is a polynomial of degree 4 or less. 
Specifically he asked for the solution of the differential equation 


dx/\p(x) + dy/\p(y) =0 


and succeeded in finding the general solution. Euler catalogued the solutions for all the possible 
forms for p(x); however, it is a bit easier to see why one looks upon this as a starting point for 
algebraic geometry if p(x) is restricted to be a polynomial of degree 3 without multiple roots. One 
such example is p(x) = 1 — x°. In the case of this example it is easy to see that the integral 
fdx/ p(x) can be easily interpreted as the integral fdx/y on the curve with equation 


f(x,y) =y? -(1 - x*) = 0. 
This curve is topologically a torus with some holes at infinity. Normally one plugs the holes by 
embedding the curve in the projective plane and forming the closure of the embedded curve. The 
equation for this closure is relatively easy to find. For the example f(x, y) = 0, replace x by X/Z, 
replace y by Y/Z and multiply the result by Z°. The result is the homogeneous equation 


F= ZY’ —(Z?— X?)=0. 


Actually in this example the homogeneous equation is that of a nonsingular curve in the projective 
plane. The curve is nonsingular because the partials 0F/0X,0F/0Y and dF/d0Z do not 
simultaneously vanish at any point on the curve in projective space. For the curve with equation 
f(x, y) = 0 the differential dx/y determines a differential everywhere nonsingular on the projec- 
tive curve. This means that for each point on the curve, together with a parameterization of the 
curve around the point, the differential can be written in the form g dt where g is analytic at the 
point. In other words dx/y determines a differential of the first kind. A line integral of a 
differential of the first kind is an Abelian integral. No such differential of the first kind exists on 
the projective line, or equivalently, no such differentials of the first kind exist on nondegenerate 
conics. In fact the nonsingular cubic curves are topologically tori (genus 1 (real) surfaces) with a 
decidedly more intricate structure than the conics. Indeed the integral {dx /y cannot be written as 
a rational function in the simple functions x, y sin x, log x, etc. Euler did show that the sum 


fax/[p() + fob) 


can be written as fdt/ yp(t) and found an explicit representation for the function t. 
In his studies, Abel cut the curve y* = p(x) with a curve chosen from a linear family of curves 
and studied the sum of the integrals 


q(Qi) 


dx/y + +++ + [or'ax/y 
q q 


where the points g(1),..., g(m) are the points of intersection of the curve with a variable member 
of the family, The result of this investigation is one of the classic theorems of the subject from 
which one may derive, among other things, the general solution of the differential equation of 
Euler’s studies. A quick classical discussion can be found in Lefschetz’s book [1]. For Abel, the 
methods of study were those of the analyst. 

In his 1857 paper on Abelian integrals, Riemann introduced the concept of the field of rational 
functions on a curve. For a projective curve this field consists of the restriction to the curve of all 
the functions with expressions P/Q where P and Q are homogeneous polynomials of the same 


1983] REVIEWS 141 


degree. Along with this concept he considered the idea of birational isomorphism. Curves are 
birationally isomorphic if the function fields are isomorphic (over the complex numbers). Conics 
are all birationally isomorphic to the projective line, for example. 

In his discussion of the history of algebraic geometry (c.f. [3]) van der Waerden attributes the 
origin of algebraic geometry to M. Noether and the Italian school. The reason for this is that with 
these gentlemen algebraic techniques came to the fore. Noether’s teacher Clebsch had begun the 
study of fields of rational functions on a surface. A surface is the set of zeros of a homogeneous 
polynomial in four variables. More formally stated, Clebsch began the study of the birational 
geometry of surfaces. Clebsch also had studied algebraic curves and had shown that the number of 
linearly independent Abelian integrals on a curve could be expressed in terms of the degree of the 
curve (1e., the degree of the defining equation of the curve) and the number and type of 
singularities of the curve. This result expresses analytic (actually topological) invariants in terms 
of the algebraic (i.e., formal) properties of the curve. 

It was M. Noether’s position that Abelian integrals could be replaced with purely algebraic 
tools. He further asserted that it is the discussion of the field of functions of a curve or surface 
which is of principal interest. Noether used the concept of point group for his investigations of 
curves. A point group is a finite collection of points on a curve, each point with an assigned 
integer multiplicity. Modern geometers would call these point groups cycles while algebraists of 
the arithmetic school would call them divisors. As Abel had indicated the point groups of interest 
for the discussion of plane curves are those which are contained in the intersection of the curve 
with a second curve (with intersection multiplicities attached). For surfaces the natural generaliza- 
tion, used by Noether, was a discussion of the curves cut out on the surface by families of 
surfaces. 

The ideas of Clebsch and Noether were developed and polished by the Italian school of 
geometry. Founders of this school were Cremona, C. Segre and Bertini. A student of Veronese and 
Segre, Castelnuovo, used numerical invariants to classify those surfaces with function fields 
isomorphic to the field of functions of the projective plane. Castelnuovo’s student, Enriques, 
completed this birational classification of surfaces during the early part of this century. 

The work of the Italian school, brilliant though it was, had some very shaky algebraic 
foundations. In particular the intersection theory, that is, the assignment of intersection multiplici- 
ties, was suspect. A major contributor to the work of shoring up the algebraic foundations of the 
Italian school was Emmy Noether, Max Noether’s daughter. B. L. van der Waerden used Emmy 
Noether’s work to build a solid algebraic basis for the theory of intersection. A short history of 
this development and the relations between van der Waerden’s and E. Noether’s work was 
published recently by van der Waerden in [3]. 

The contribution to Seidenberg’s volume from van der Waerden is a discussion of the solution 
of one of the central issues of his intersection theory, using a powerful theorem proved by a 
student of Enriques, O. Zariski. 

Much of the theory of Abelian integrals is best explained by noting that the nonsingular 
algebraic curves of genus 1 have a group structure. The multiplication on the curve can be 
described by rational functions which gives the curve the structure of an algebraic group. M. 
Rosenlicht, another student of O. Zariski, discusses some of the elementary theory of algebraic 
groups in his contribution to Seidenberg’s collection. Abelian varieties, among which one finds the 
elliptic curves (curves of genus 1), are not discussed in this paper but Rosenlicht gives an elegant 
introduction to a second fundamental type of algebraic group, the linear algebraic group. 

The development of modern intersection theory was also heavily influenced by A. Weil’s work 
on the subject. Weil’s proof of the Riemann hypothesis for finite function fields used the 
intersection theory and correspondence principles of the Italian school. The principles were used 
to study curves defined over fields of nonzero characteristic which meant that an intersection 
theory in nonzero characteristic was crucial. The multiplicity theory of van der Waerden was 
applicable in nonzero characteristic; however, Weil’s development of intersection theory is distinct 
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from van der Waerden’s and was firmly rooted in the theory of local rings and power series rings. 
It is Weil’s theory, and the local geometry of Zariski, Chevalley, and their disciples, that motivated 
a great deal of the commutative algebra that has developed since 1950. 

The study of the relation between local theory and the global results on multidimensional 
varieties has blossomed into the Serre-Grothendieck school wherein homological algebra, linear 
algebra run amuck, plays a key role. The article by J. Ohm, a student of Seidenberg, gives the 
reader a taste of this rich broth of commutative algebra and homological methods. 

S. Kleiman, another of the Zariski-Harvard school, is the author of the last paper in the 
collection. Kleiman leads the reader into the subject of the enumerative theory of conics where 
again intersection theory plays a major role. In this paper one is carried from the geometry of the 
ancient Greeks to the modern discussions of intersection theory by Grothendieck. 

It’s a lovely collection—enjoy. 
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Conformal Mappings on Riemann Surfaces. By Harvey Cohn. Dover, New York, reprinted 1980. 
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JAMES A. JENKINS 
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In a book designed to provide an introduction to the theory of Riemann surfaces, the reviewer 
would expect to find an essential core of material consisting of the following items. 


I. The definition of a Riemann surface as a compiex analytic manifold of dimension one; that is, 
a connected Hausdorff space covered by domains D,, each of which has an associated homeomor- 
phic mapping ¢; onto a plane disc, so that for overlapping D; and D,, the map 6,0; is a 
conformal mapping of ¢; (D; N D,) (Le., a (1, 1) mapping by a regular (= holomorphic) function). 
The ¢, are called local uniformizing parameters, and the D, are called coordinate neighborhoods. 


Il. The existence of meromorphic functions and differentials on a Riemann surface. A complex 
valued function f defined on a subdomain of a Riemann surface is called meromorphic if, for 
every local uniformizing parameter $,, f¢; ' is meromorphic. A meromorphic linear differential 
assigns to each coordinate neighborhood D, a meromorphic function f;(z) in $,(D,) with the 
property that if D, and D, are overlapping coordinate neighborhoods, then f,(z) = f,(w) 4, where 
w = $6; '(z). More generally, a linear differential assigns to each D, twe functions u, v, which 
transform so that udx + vdy(z = x + iy) is invariant. A second order differential is given by one 
function C which transforms so that C dx dy is invariant. (The exterior product of two linear 
differentials gives a second order differential.) A harmonic function 4 on a subdomain of a 
Riemann surface is one for which for each local uniformizing parameter $,, hd; ' is harmonic. A 
harmonic differential is locally the differential of a harmonic function. One can also speak of a 
regular mapping f of a Riemann surface R into a Riemann surface S, requiring that for 
appropriate local uniformizing parameters, $,/$; ' be regular. 


Ill. Covering surfaces and uniformization. For Riemann surfaces R and S, R is called a 
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harmonic differential is locally the differential of a harmonic function. One can also speak of a 
regular mapping f of a Riemann surface R into a Riemann surface S, requiring that for 
appropriate local uniformizing parameters, $,/$; ' be regular. 


Ill. Covering surfaces and uniformization. For Riemann surfaces R and S, R is called a 
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covering surface of S if there exists a regular mapping z of R into S. The surface R is called an 
unbranched covering if 7 is a local homeomorphism, an unbordered covering if for any path «@ in 
S with initial point A, and B in R covering A (7B = A), there is a path B in R with 7B = a. The 
covering surfaces of S have an obvious hierarchical structure. More particularly, for unbranched, 
unbordered coverings this corresponds to subgroup structure of the fundamental group. For these 
there is a maximal element which is simply-connected—the universal covering surface. 

Uniformization of a Riemann surface R means finding a homeomorphic mapping into the 
sphere which provides a local uniformizing parameter for a neighborhood of each point. This is 
possible only if every simple closed curve on R separates R (such a surface is called “schlichtartig’’). 
Koebe showed that in this case it is possible to prove uniformization by the study of a particular 
meromorphic differential. A simply connected Riemann surface is schlichtartig, however, so that 
the universal covering surface U of any Riemann surface R admits such a mapping. The image can 
be taken as the sphere, the plane or a disc, the first two occurring only in special cases. The 
covering transformations of U (self-conformal mappings ¢ with 7@ = 7) correspond in the general 
case to linear transformations of the disc D. They form a group I such that identifying points of 
D congruent under I’, we obtain a Riemann surface conformally equivalent to R. 


IV. The Riemann-Roch Theorem. On any Riemann surface if we assign lower bounds for the 
orders of zeros and poles (counted as negative) for either meromorphic functions or meromorphic 
linear differentials at an isolated set of points, we obtain a complex linear space. On a compact 
Riemann surface each of these spaces has finite dimension. If we take bounds for the orders of 
zeros and poles of functions and of differentials satisfying a certain complementation condition, 
then the dimensions of the two spaces differ only by an amount depending on the sums of the 
bounds. This result has many important applications. 

Additional contents of the book might take several forms. There are many directions in which 
one could provide extensions of these central results. On the other hand, space might be devoted 
to motivation (of the basic definitions and the entities to be developed) or the provision of 
prerequisite material. 

The author states a very special purpose for the book in hand, namely to provide an 
introduction to Riemann surface theory for a student whose preparation consists of “an ordinary 
semester of complex analysis.” Comparing the contents with those suggested above, we find that 
the existence proofs are given only for a finite triangulated Riemann surface (finite means either 
compact or equivalent to a subdomain of a compact surface bounded by a finite number of 
contours). Covering surfaces in the general sense are mentioned only very sketchily in an 
appendix, and there is no general statement concerning uniformization. Thus'we have the 
interesting phenomenon of a book on Riemann surfaces with forty-five bibliographical references 
in which the name of Koebe does not appear. Not surprisingly there is no attempt to go beyond 
the basic material and evidently the author’s big thing is motivation. It is never stated explicitly 
that the book is a fairly faithful presentation of the author’s lectures but in the absence of contrary 
evidence it is reasonable to assume that. This being so, it seems a student would find the recitation 
of results in the first three chapters terribly boring. Particularly disappointing is the lacklustre 
treatment of analytic continuation since the best motivation for the basic concepts and definitions 
of the theory of Riemann surfaces is provided by a good treatment of this topic, for example like 
that in Ahlfors’ Complex Analysis (preferably the first or second edition; the third edition seems 
in need of a good dose of antibiotics). 

As for provision of prerequisite material, the author makes no organized attempt in this 
direction in fields other than complex variables. Indeed, with one exception, very little need be 
provided. Only the simplest concepts of point set topology and algebra are needed. (If one uses 
the method of orthogonal projection, a few basic results from Lebesgue theory are required. The 
author avoids this by using the alternating method from potential theory but with an attendant 
loss of generality.) The exception is the subject which the reviewer knows by the name of algebraic 
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topology. One can hardly assume a knowledge of this, particularly in the somewhat old-fashioned 
form in which it is most useful in the study of Riemann surfaces, even with students more mature 
than those envisioned by the author. The author does give a rather offhand introduction to 
homology but it is quite distressing to find the presumably novice students being referred in 
cavalier fashion to “textbooks in topology” and “the literature.” 

The book is liberally provided with exercises. This is by no means an unmixed blessing since 
the author is consistently guilty of writing the book in the exercises. In the instance of a student 
who has expert guidance, a case can be made in support of this procedure (though not so as to 
convince the reviewer), but for one attempting to master the book independently it can be 
terminally frustrating, despite the promotional blurb on the back. (Can publishers be arraigned 
for lack of truth in advertising?) 

The book does contain certain nice insights, but they seem best suited to someone who has 
already a reasonable knowledge of the material (as opposed to the stated intended audience). On 
the other hand, there are entirely unnecessary misstatements (in distinction to “compromises’’). 

If, then, it is desirable to learn of the theory of Riemann surfaces as early as possible, is there 
any alternative? The reviewer shares the author’s belief in the importance of the subject though 
not necessarily his sense of urgency. A student would be much better off first to take a good basic 
one-year course in Function Theory. This would provide a full understanding of the material in 
the first part of this book. Then, in one semester, it would be possible to give a very presentable 
introduction to the theory of Riemann surfaces. If two semesters could be employed, it would be 
possible to give a thoroughgoing, mathematically complete treatment extending far beyond the 
material of this book. 

For a person who wants only some idea of what Riemann surfaces are about, the book has far 
too much detail. Someone who really wishes to understand the mathematics of Riemann surfaces 
would not obtain the desired complete development and would get no clear idea of the nature of 
mathematical proof. In teaching a course on Riemann surfaces, to follow the author’s format 
would be at best unwise, and possibly disastrous. 


LETTERS TO THE EDITOR 


Material for this department should be sent to Professor P. R. Halmos, Department of Mathematics, Indiana 
University, Bloomington, IN 47405. 


Editor: 


A CUPM panel presented an excellent report on the topic of minimal mathematical com- 
petencies for college graduates (this MONTHLY, vol. 89, pp. 266-272). 

Generally, I am in agreement with the report, but there is one modification that I wish to 
recommend, namely, to change topic 14, which currently is “statistics and its dangers.” Lists 
published by CUPM panels tend to become the “bible” on a topic. In this case “verse” 14 should 
be changed to reflect a modern concept of statistics and promote a positive attitude about the 
subject. Just as mathematicians and accountants are not clearly separated by some (as indicated 
by the CUPM panel), so also are professional statisticians and sports “statisticians” confused. 
Moreover, the phrase “statistics and its dangers” indirectly propagates the misconception that one 
can prove anything with statistics. Obviously, statistics is no more dangerous than percent (topic 
9) or rounding off (topic 5). 

A danger in topic 14 now is its potential for misleading faculty who develop the proposed 
course. For example, an instructor might take 14 literally and inappropriately expend all effort 
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showing cute examples of how to mislead others with the misuse of statistics. The CUPM panel’s 
recommendation that this course be taught by an experienced semior faculty member is very 
important. Furthermore, I recommend that this person have experience in teaching statistics. 

Taking a broader view, one sees that those at the forefront of statistics are beginning to discuss 
“Statistics as a Discipline.” In fact, this was the title of the main address presented by Paul 
Minton at the second annual Conference for Texas Statisticians held at the Baylor University 
(April, 1982). The main point of Professor Minton’s talk concerned the idea of offering 
undergraduate degrees in statistics. 

There are two points I want to make. First, it is conceivable that a statistics course “for coping 
with life’ could cover most of the topics in the list given by the CUPM panel. Second, topic 14 in 
a list of topics for this statistics course could be “mathematics and its dangers.”’ Need I say more? 

Since it would be cowardly to avoid a recommendation for a replacement for topic 14, I 
suggest: 


14. The Use and Interpretation of Statistics in Everyday Life. 


Danny W. Turner 
Department of Mathematics 
Baylor University 

Waco, Texas 76798 


MISCELLANEA 


With Apologies to John Milton 


94. John Milton wrote a famous sonnet on being blind. This modern version looks at a 
different kind of blindness. 


When I consider how my grants are spent, 

Ere half my days in these dark halls and wide; 
My teaching talent, which I’d love to hide, 
Lodged with me useless, though my soul more bent 
To serve therewith my Provost, and present 

My teaching ratings, lest he returning chide; 

“Do Deans exact much teaching, grants denied?” 
I fondly ask: the Provost, to prevent 

That murmur, soon replies, “Deans do not need 
Either that teaching or those grants. Who best 
Does his committee work, serves best. His state 
Is Deanly: thousands at his bidding speed 

And fly to Washington, disdaining rest; 

They also serve who only loaf and wait.” 


—Edwin Hewitt 


ANSWER TO “PHOTO” ON PAGE 100 


The subject is Max Zorn; the picture was taken in the summer of 1981. He is alive and well and 
very much a part of the mathematical community in Bloomington, Indiana. 
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lofty position of previous editions’ 

—Harvey B. Keynes, Professor, University of 
Minnesota. 

Like its predecessors, the fourth edition of 
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techniques, and basic applications. This 
new edition features: ¢ finetuning of the 
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SOLUTIONS MANUAL, TRANSPARENCY 
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number theory to other areas of mathe- 
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are presented in each chapter. Use of 
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J.L. Morris, University of Waterloo, Ontario 
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Carus Mathematical Monograph, #18. 
Third Edition 


xi + 232 pages. Hardbound. 
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A gold mine of interesting, uncommon insight and examples. . .an orderly composition of 
24 partially independent elegant snapshots from the theory of sets and real functions.” 
Lynn A. Steen, commenting on the second edition of “A Primer of Real Functions” in THE 
AMERICAN MATHEMATICAL MONTHLY, 1974. 

The Third Edition includes the most significant revisions to date of this classic 
volume. Terminology has been modernized, proofs improved, and sections have been 
completely rewritten. Much new material has been added. The Primer contains the basic 
material on functions, limits, and continuity that students ought to know before starting a 
course in real or complex analysis. It is a good place for a student (or anyone else) to see 
techniques of real analysis at work in uncomplicated but interesting situations. 

The author says, “My principal objective was to describe some fascinating 
phenomena, most of which are not deep or difficult but rarely or never appear in text books 
or in standard courses.” Boas has achieved that objective. The result is a book that should 
be on every teacher’s book shelf, in every college library's, and a part of every serious 
mathematics student’s experience. 


Order your copy now from: 


THE MATHEMATICAL ASSOCIATION 
OF AMERICA 
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Ivan Niven, distinguished author of several books on number theory and 
probability, has compiled the basic elementary techniques for solving 
maxima and minima problems. Since many books and courses already cover 
calculus and linear programming techniques, the author deliberately omits 
these areas from his discussions and concentrates instead on methods in 
algebra and geometry not so widely known. These methods are organized 
according to the mathematical ideas used, and many chapters can be read 
independently without reference to what precedes or follows. Some of the 
problems presented are left.for the reader to solve with sketches of solutions 
given in the later pages. 

The book is written for an audience at or near the maturity level of 
second- or third-year college students with a good working knowledge of 
precalculus mathematics. Although calculus is not a prerequisite, a prior 
knowledge of that subject will enhance the reader’s comprehension. An 
excellent sourcebook in the area of maxima and minima, the book will serve 
as a textbook or as enrichment material for the talented undergraduate. 
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A Student Guide and Instructor’s Manual are available with the text. 
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informal, supportive style encourages students to read the text and 
develop confidence under its guidance. Concepts are introduced 
gradually with accompanying diagrams that illustrate the ‘reason- 
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in mathematics. 


A Student Guide and Instructor’s Manual are available with the text. 
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Available in textbook or workbook versions (softbound editions 
designed for PSI or self-paced courses), the texts are filled with 
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algebraic and graphing techniques. 
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ALGEBRA, GEOMETRY, AND ALGEBRAIC GEOMETRY: 
SOME INTERCONNECTIONS 


KEITH M. KENDIG 
Department of Mathematics, Cleveland State University, Cleveland, OH 44115 


1. Introduction. For centuries, men have been intrigued by the interplay between algebra and 
geometry; many important advances in mathematics have essentially come down to linking up the 
two in one way or other. The ancient Greeks established such a link when they found 
straightedge-and-compass constructions for the sum, difference, product, quotient and square root 
of lengths. Closer to our own time, Poincaré did it when he associated groups (algebra) with 
topological manifolds (geometry); of course this has grown into the whole area of algebraic 
topology. 

Possibly the single greatest step in connecting up algebra and geometry was Descartes 
introduction in 1637 of Cartesian geometry (or “analytic geometry,” as Lacroix named it in 1792). 
It laid the mathematical foundation for the calculus and Newtonian physics a half century later. It 
put algebra and geometry on a more equal footing, and because it set up a translation between 
algebra and geometry, it gave our imagination “two ends’—an algebraic one and a geometric 
one; geometric insight could often be translated into an algebraic one, and vice versa. Still today, 
an important part of a student’s exposure to analytic geometry consists in making the translations 
of algebra-to-geometry (“sketch 2x” + xy + 3y? = 0”), and geometry-to-algebra (“find an equa- 
tion of the set of points equidistant from the y-axis and the point (2, 3)’’). 

Powerful though Descartes’ union of algebra and geometry is, it is only the beginning of a 
much larger story. In most textbooks today, analytic geometry is done in the real setting of R? or 
R°. But one may replace these by other objects (complex n-space, C”, is just one example), and 
try to do analytic geometry in this new context. The extremely fundamental nature of this 
“algebra-geometry”’ linkage asserts itself, and unexpected connections, inspiration and enlighten- 
ment come pouring out. In a sense, exploring the cartesian-geometry type of connection between 
geometry and algebra (especially at the polynomial level) is “algebraic geometry.” 


2. A New Setting for Analytic Geometry. Analytic geometry was born, and grew, over the 
reals. But such a context has inherent in it certain limitations; in particular, it fails to give us 
“closure,” in both an algebraic and a geometric sense. (We explain in a moment.) Each of these 
can lead to “exceptions.” Since important parts of the history of mathematics can be regarded as 
the process of removing various kinds of exceptions, it 1s interesting to extend space to one that is 
closed algebraically and geometrically, and then do analytic geometry in it. This leads to “classical 
algebraic geometry.” We explain. 

First, R fails to be closed algebraically. Thus, for example, the real polynomial x* — a has a 
zero in R (visually, the parabola y = x” — a intersects the x-axis R ,) except when a is negative. 
Extending R to its algebraic completion C removes this exceptional behavior, and now any 
polynomial (even with coefficients in C) has a zero (1.e., the graph in C,.,, of y = p(x) intersects 
the x-axis C .)—except when the polynomial is a nonzero constant. (This is the Fundamental 
Theorem of Algebra.) This last exception arises because the two parallel copies of C don’t meet. 
But this kind of exception can be removed also, by completing geometrically, adding a “point at 
infinity” to each complex line C in C*. For this, let ax + by = 0 (a,b © C) be any complex 
l-space of C*. We add an “ideal” point P,, to it; we may think of this extension of C as a 
“Riemann sphere.” Now add this same point P,, to each of the complex lines ax + by =c 


Keith Kendig is an algebraic geometer with a side interest in computers. He received his Ph.D. from UCLA and 
spent two years at the Institute for Advanced Study in Princeton; he is the author of Elementary Algebraic Geometry 
(Springer-Verlag, Graduate Texts in Mathematics vol. 44). He also coauthored a book with his wife Joan on 
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(c € C) (ie., to all the lines parallel to ax + by = 0). Since P,, is the only point common to these 
lines, it is the point in which any two meet. Adding one such point to (all lines parallel to) each 
l-space ax + by = 0 (a, b © C) then extends C* to what is called the complex projective plane 
P *(C); it is manufactured so that any two distinct completed lines in it meet in exactly one point. 
It turns out that all the added points at infinity themselves form a complex line called “the line at 
infinity”; this line intersects any other line in one point. One can now give P?(C) a topology by 
first observing that the points (respectively lines) of P?(C) may be identified with the 1-subspaces 
(respectively 2-subspaces) of C?. (Thus “any two distinct lines of P?(C) intersect in a point” 
translates to “any two distinct 2-subspaces of C? intersect in a 1-subspace.”) One may now 
identify “open neighborhood in P*(C)” with “the set of 1-subspaces in C? intersecting an open 
set in C?”. This topology on P*(C) also induces one on the completed lines; they are in fact 
topological spheres. 

Now to do analytic geometry in P*(C), we surely want to be able to sketch curves there, just as 
we did in R*. The idea is simple: first, any nonconstant polynomial p(x, y) in C[x, y] defines an 
algebraic curve (or simply a curve) C in C,.,,, consisting of the set of all points in C,,, at which 
p(x, y) is zero. Then the topological closure of C in P?(C) is called the (algebraic) curve in P*(C) 
defined by p(x, y). At this new level, sketching some simple curves will throw some unexpected 
light on their real analogues. 

A little later, we will see how extending C* to P?(C) not only removes an exception from the 
statement of the Fundamental Theorem of Algebra, but also leads to a symmetric and much more 
general form of the theorem. 


3. A Basic Example. In this section we sketch the curve in P*(C) defined by y — x7. We first 
sketch the “complex parabola” y = x* in C, y» then we take its topological closure in P 2(C). We 
begin by noting that the graph in C,,, of y = x* is homeomorphic to C ,. Therefore the graph is a 
real two-dimensional surface in C,,, = R*. To find out what this real surface looks like, we write 
x =x, + ix, y =y, + iy, and C*7=R,....,. Then y = x* becomes y, + iy, = (x, + ix)’. 
Equating real and imaginary parts gives us 


(*) | yy = xi _ x5 
Vz = 2X 1X2 
Our surface, then, is the set of points in R, ,,,,,, simultaneously satisfying the equations in (*). 

Because this surface lies in R*, it is hard to visualize directly. Now an inhabitant of “Flatland” 
might visualize an object in R? by letting one coordinate be “time,” and piecing together the 
various slices corresponding to time = a constant. We can do the same, letting x,, for example, 
play the role of time and looking at the part of the surface in each three-dimensional slice. Each 
such part is a real curve; we will then fit the curves together to get the surface. 

We consider first x, = 0. The equation y, = 2x,x, of (*) becomes y, = 0, so the curve lies in 
the plane R , ,,. The other equation in (+) becomes y, = x?; therefore the curve for x, = 0 is just 
the original real parabola in R,,,. When x, = a #0, the equations (*) again define a real 
parabola in the corresponding 3-dimensional slice; it lies in the plane y, = 2.ax,. Now notice that 
our basic equation y = x” implies | y|/|x| > oo as |x| > oo. From this one can easily deduce that 
for each x, € R, the corresponding real parabolas all have this property in common: each branch 
approaches the point at infinity added to C,. (Let us call it P,,.) After taking the topological 
closure, we see that for each x, = a € C, the corresponding parabola together with P,, forms a 
loop; any two such loops touch at the one’ point P,,. We may topologically redraw all this as in 
Fig. 1 where, say, the heavily-drawn curve corresponds to the original real parabola, the 
lightly-drawn curves correspond to x, = a > 0, and the dotted curves on the other side of the 
surface, tox, =a < 0. 

Thus the variety in P?(C) defined by y* = x is topologically a sphere. In the ordinary real part, 
we only see a very small part of this: one loop, minus P,,. 


4. Some More Examples. We may apply the above method to sketch other graphs. Let’s look 
briefly at the complex circle defined in C,, by x* + y* = 1. This equation yields these two real 
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Fic. | 
equations: 
(48) [iste 
2X \X> + 22 = 0) 


Consider first x, = 0. Then 2x,x,+ 2y,y, = 0 becomes y, y, = 0, which implies that y, = 0 or 
y, = 0. From this it follows that the part of our surface in the 3-dimensional slice x, = 0 consists 
of the union of the real circle xf + yf = 1 in R,,, and the hyperbola xj — yy = 1 in R,,,,. The 
branches of the hyperbola intersect the line at infinity in two points (the two points in which its 
asymptotes intersect the line at infinity). Adding these two points makes the hyperbola into a 
topological circle, so we now have two topological circles touching at two points. We let the circle 
x? + y? = 1 be the equator of a sphere, and the completed hyperbola, a great circle passing 
through the north and south poles. These two great circles divide the sphere into four quarters. 
One may show that the curves corresponding to x, = a > 0 fill in two opposite quarters; the 
curves corresponding to x, = a < 0 fill in the other two quarters. We see from this that the real 
part—what we see in ordinary analytic geometry—is just the tip of an iceberg; working in P *(C) 
reveals the rest of it to us. 

This is a very powerful approach, because often the real part hardly reveals even the tip of the 
iceberg! For example x” + y” = 0, a “circle of radius 0,” is just a single point in R*. Looked at in 
P?(C), we see that the real circle x? + y? = 1 of the previous example has now shrunk to a point, 
and the hyperbola x? — y? = 1 has reduced to its asymptotes x7 — y; = 0. It appears that the 
equator of our sphere has been squeezed to a point, leaving two spheres touching at that point. Is 
this actually the topological picture of a “circle of radius 0”? We may look at it this way: Since 
x? + y* equals (x + iy)(x — iy), and since this product is zero iff one of the factors is, we see that 
our 0-radius circle is the union of two complex lines intersecting at the origin. After adding the 
point at infinity to each such line, we get the union of two spheres touching at one point. Notice 
that as soon as r becomes 0, x* + y? — r* becomes reducible over C (one can verify that 
x? + y* — ris irreducible whenever r # 0); and, in beautiful consonance with this, the associated 
topological object becomes in a sense “reducible” (two touching spheres) instead of a single 
“irreducible” sphere. This whole idea is revealed to us when we work in P*(C). It was kept secret 
in R?. 

But we need not stop here! How about a “circle of imaginary radius,” say x° + y* = —17 In 
R° we get the empty set. However in P*(C) we once again get a full picture: x7 + y* + 1 is 
irreducible, and the “sketch” turns out again to be topologically a sphere. In fact, any quadratic 
irreducible over C gives us a sphere, and any quadratic reducing to two distinct factors gives us 
two touching spheres. 
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5. Donuts and Coffee Cups. The reader might well wonder if every curve in P?(C) defined by 
a polynomial consists of a sphere or spheres. Or can we get donuts, two-handled spheres, and even 
surfaces of higher genus? 

Let us consider the exercise, often encountered in ordinary analytic geometry, of graphing 
y* = x(x? — 1). In R’, the graph is “symmetric about the x-axis,” and there are two “excluded 
regions” lying above the two open intervals (0,1) and (— oo, — 1). This real part appears in the 
x,y,-plane of Fig. 2. At time x, = 0 there’s also a “mirror image” of that curve in the x, y,-plane. 
(Those exceptional “excluded regions” really aren’t excluded after all.) 


2 


J} 


Fic. 2 


The two branches on the right and left meet in the point at infinity P,, which was added to C, 
(since | y|/|x| > 00 as |x| > oo). We now have three topological loops, each touching the other 
two. These can be bent to lie on a torus (Fig. 3); it turns out that the other curves corresponding 
to x, # 0 fill in the rest of the torus. Thus, of the whole torus, all we see in the usual real part are 
the two dark loops in Fig. 3, without P,,. 

One can similarly show, for instance, that 


y? — x(x* — VP )(x* — 27) +++ (x? = g?) 


defines in P*(C) a closed orientable surface of genus g. 

Through these examples, we see that doing analytic geometry in P?(C) on the one hand reveals 
a fuller picture of curves defined by polynomials, and on the other hand leads to an appreciation 
that orientable topological surfaces—donuts, coffee cups, pretzels and all—aren’t just “made up,” 
but arise in a very natural way. For further details and pictures on “algebraic curve sketching” 
over C, see [7]. 


6. A Pearl from Algebraic Geometry. We promised earlier that our new viewpoint would lead 
us to a much more general form of the Fundamental Theorem of Algebra. To see how, let’s briefly 
consider once again the first example of the parabola y = x”, looked at in the complex setting. 
The Fundamental Theorem of Algebra guarantees that the complex line C, intersects the 
parabola in two points, counted with multiplicity. “Multiplicity two” means, intuitively, that if we 
translate C , a little into a new line C #C,, then the point of intersection separates into two 
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distinct points, which then coalesce as C returns to its original position C,. (This idea can be 
generalized almost beyond recognition.) More generally, one can show, using a linear change of 
coordinates, that any complex line of the form y = ax + b (a, b € C) intersects the parabola in 
two points. Question: Does every complex line in P*(C) intersect it in two points? Answer: Yes. 
For the branches of the parabola approach a point at infinity, P,,—the one added to C,; it can 
then be shown that any line in P*(C) defined by x = a intersects the completed parabola in two 
points (at (a, a*)inC, y and at P,,). The only line left to consider is the one at infinity. Looking 
at it as the limit of, say, y = c as |c| > 00, one can see that the line at infinity and the parabola 
intersect at P,, with multiplicity two. (Relative to the topology we put on P?(C), the two points of 
intersection of y = c with the parabola coalesce to P,, as |c| — 00.) 

Note that the degree of a parabola is two, the degree of a line is one, and that 2- 1 = the 
number of points of intersection. The general form of the Fundamental Theorem of Algebra we 
promised is “Bézout’s theorem.” Before stating it we need only remark that the idea of 
determining the multiplicity of intersection by translating one of the curves a bit, generalizes to 
arbitrary curves C,, C, in P*(C) defined by nonconstant polynomials g,(x, y) and q3(x, y). This 
may be done as follows. Let an open neighborhood U in some C” parametrize the set of all 
“small” changes in the (nonzero) coefficients of g,. Then for any isolated point P in C, N C,, 
there is a nowhere dense subset of U off which this holds: The corresponding curve C; intersects 
C,, near P, in exactly m points. We then say C, and C, intersect at P with multiplicity m. 


BEZOUT’S THEOREM. Let q,, q> be distinct, irreducible polynomials in C[ x, y|\C; let them define 
curves C,,C, in P?(C). Then C, and C, intersect in (deg q,)- (deg q,) points, counted with 
multiplicity. 


REMARKS: 

(1) In the Fundamental Theorem of Algebra, g, and q, are very special: g, is of the form 
y — p(x)— e., it defines the graph of a function C, > C,; and q, is y. There is no need for such 
restrictions in Bézout’s Theorem. Both curves are put on an equal and more general footing. 

(2) One can easily draw in R* two ellipses intersecting in 2 - 2 = 4 points. As one ellipse is 
translated with respect to the other, we begin to “lose” points of intersection in R*; but not in 
P*(C)—the “lost” points simply have become complex. Thus two complex ellipses, disjoint in 
R ,,, extend to curves in P *(C) having four complex points of intersection. 

(3) Bézout’s Theorem can also be stated for polynomials which are not necessarily irreducible: 
if neither g, nor qg, have repeated factors, and if g,, g, share no nonconstant factors, then C, and 
C, intersect in (deg qg,) - (deg g,) points. 


166 KEITH M. KENDIG [March 


Bézout’s Theorem is named after the French mathematician Etienne Bézout (1730-1783). 
Although his main occupation was mathematics examiner in the French Naval Academy (and 
later in the Royal Artillery), he wrote two textbooks which became so popular they were 
translated into several languages and ran many editions. Bézout, who worked on “elimination 
theory” using linear equations, stated his theorem in terms of a determinant derived from the 
coefficients of g, and q,, called the “resultant” of qg, and q>. 


7. Some Additional Comments on Intersection Multiplicity. Bézout’s Theorem hinges on the 
concept of intersection multiplicity. This concept turns out to be an excellent example of 
the “three blind men touching an elephant’ phenomenon in mathematics—one big idea (the 
elephant) may appear to be very different to different people (and different generations). For 
instance, our definition of intersection multiplicity above looks almost totally geometric. In this 
section we give for curves a second and a third definition; one of them appears very algebraic, the 
other, purely topological. Yet over C, all definitions are equivalent and each one provides a basis 
for generalizing in a different direction. 

For the algebraic approach we start with the example of the complex parabola y — x* = 0 
intersecting C,. Now C, may be parametrized about (0,0) by x =¢, y = 0. Substituting into 
y — x? =0 gives ¢*. The order of the polynomial t? is two, which is just the intersection 
multiplicity of y — x* with C,! Any other complex line L through (0,0) intersects the parabola 
there in multiplicity one. In agreement with this, L may be parametrized by x = at, y = bt 
(b + 0); substituting this into y — x” gives bt — a*t* which has order one at 0. 

One could just as well substitute a parametrization of the parabola into the line. Parametrize 
y=x* by x =t, y = 2’, and plug into Ax + By. We get At + Bt? which has order one except 
when A = 0 (that is, except when L is C,); when A = 0 the order is two. As another example, 
consider two parabolas tangent at (0,0)—say, y — x7 = 0 and y — 4x” = 0. Substituting x = 1, 
y = t’ into y — 4x? gives —317, so the parabolas intersect in a double point at (0, 0). 

The general idea here is this: to find the intersection multiplicity mp of two curves C, and C, at 
P, parametrize one curve about P, and substitute these parametric equations into the equation of 
the other curve. The resulting order at zero is mp. A caveat: use only “reduced” parametrizations. 
Thus C, could be parametrized about P = (0,0) by x = r*, y = 0 (or the cusp curve y” = x? by 

= ¢*, y = 1°). But for correct mp, first reduce the parametrizations to lowest terms, getting 
x=t,y=O(orx=0?,y =P). 

It turns out that parametrizations can be found for any algebraic curve about any point, 
though at a “cross point” or “singularity,” one may need more than one parametrization there. 
For example, the curve y* = x7(x + 1) crosses itself at (0,0) (looking like an “alpha” in R .y,): 
Each part, or “branch” has its own parametrization. (Such parametrizations at singular points 
may be power series, but that doesn’t matter—the concept of order is still the same.) One then 
just adds up the multiplicities due to the different branches through the point. An elementary 
proof of the existence of such parametrizations, which includes an explicit method for finding 
them, is given in [9, Chap. IV]. 

Now let’s turn to the topological viewpoint. Let S; denote the three-sphere x? + x3 + y? + y? 
=e in C,, (=R,,.,y,,))) € > 0 is assumed small. S; intersects C, in a real circle C, and S; 
intersects y = x* in a topological circle which winds around C—twice! The intersections of S? 
with C, and C, have mutual linking number one; and the intersections of S? with y = x? and 
y = 4x” have linking number two. For a curve like y* = x’, for all sufficiently small e, S$? 0 
(y* = x’) winds around S; /O C, nine times. But if L is any other complex line through (0,0), 
then for all sufficiently small e, S$? N (y* = x’) winds around S?M L exactly four times. The 
reader can easily check that these linking numbers agree with the multiplicities obtained using the 
substitution method above. 


8. The World’s Most Elusive Curve-Sketching Problem. We have seen how, if we replace R by 
C, analytic geometry leads us in a natural way to topology. But R may be replaced by other fields, 
too. For instance, doing analytic geometry over @ instead of over R leads directly to number 
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theory. To see this we will try sketching, in @’, some curves defined by polynomials with 
coefficients in Q. 

As a first example, consider the line L in Q* defined by ax + by +c =0 (a,b,c € Q), It 
looks like the corresponding line L’ in R*, in the sense that L is everywhere dense in L’. It is also 
easily checked that the same thing is true of the “parabola” in Q* defined by y = x*—it is dense 
in the graph of y = x* in R*. 

How about the “rational circle” x? + y* = 1 in @*? Does it look like the corresponding real 
circle? This time the answer isn’t so obvious, and we are already led to an interesting question in 
number theory. What we are asking is this: for any point P on the real circle, is there a point 
P’ € Q’ arbitrarily near P and on that circle? We begin by observing that if (a, b) is any point of 
the rational circle ((3 /5,4/5) is an example), and if c is a common denominator of a and b, then 
ac, be and ¢ are integers satisfying the equation (ac)? + (bc)* = c?. Put another way, (ac, bc) is a 
“Pythagorean point” in R?—a point (p, q) with integer co-ordinates so that (0,0), (p,0), (p, 4) 
are the vertices of a triangle having sides of integral length. (Thus (3, 4) is a Pythagorean point in 
R*.) We may now rephrase the question this way: Are the slopes m of those 1-subspaces of R* 
passing through Pythagorean points dense in R? As it happens, all Pythagorean points are known 
(this is the number theory part): any point (p” — q*, 2pq) (p,q distinct, nonzero integers) is 
Pythagorean, and any Pythagorean point is either of this form, or an integral scalar multiple of it. 
One can now easily verify that for any m € R and e > 0, there is some 2 pg/( p” — q”) within e of 
m. So the rational circle x* + y* = 1 is indeed dense in the real one. 

What about other curves—for example x” + y” = 1? In R”, for n > 2 and even, it looks like 
something like the perimeter of a TV screen, becoming more and more “square” as n gets larger. 
For n > 2 and odd, it has two branches asymptotic to y = — x. Does an appropriate generaliza- 
tion of the above show that the corresponding rational curve is dense in the real one? No, it 
doesn’t. In fact it has been conjectured, though never proved, that except for “trivial points” (in 
which one co-ordinate is zero), in Q? the curve is actually empty! Does this sound familiar? It is a 
geometric way of looking at “Fermat’s Last Theorem”: for n > 2, a” + b” = c” has no nontrivial 
integer solutions. For, dividing through by c”, we can restate it as: ifn > 2, thenr” +s” = 1 has 
no nontrivial rational solutions. So Fermat’s Last Theorem is just a (very elusive) curve-sketching 
problem in Q?. 

In P?(C), we found that irreducible conics are topologically spheres. We can similarly try to 
sketch conics in Q? defined by ax? + bxy + cy* + dx + ey + f=0 (a,...,f © Q). We've al- 
ready found that the rational unit circle looks like the corresponding real circle. Question: Is the 
analogous thing true for the above “rational conics”? Answer: Not at all; for some choices of 
coefficients the rational locus is empty, for others there are finitely many solutions, and for still 
others it is dense in the real sketch. A full answer to this question is highly nontrivial. In fact, a 
large part of Gauss’ famous Disquisitiones arithmeticae is devoted to answering this and other 
questions closely related to it. It should be mentioned that one often extends @ to a finite 
algebraic extension of it, or replaces @ by a finite field and asks for the set of solutions to such 
Diophantine problems over the new field. 


9. Algebra: Old versus New. So far we’ve considered mostly what can be described as “curves 
in 2-spaces.” Each such curve can be defined using a single polynomial. However, a single 
polynomial will in general define different sets in different spaces. Thus x defines a point in C,, a 
line in C,,, a plane in C,,, and a hyperplane in C”. In each case the dimension is n — 1. It turns 
out that more generally, any nonconstant polynomial p € C[x,,..., x,,] defines a “hypersurface” 
inC, x, (that is, a surface of complex dimension n — 1). This is a basic extension of the case 
n = 2, where we saw that even polynomials like x* + y” — 1 (whose set of zeros in R? is empty) 
continues to define a full-fledged complex curve (have two real dimensions) in C”. 

But this fact can be looked at from the other end—namely, nonconstant single polynomials 
can yield only hypersurfaces in C”, nothing any different. It means that restricting oneself to 
single polynomials has serious limitations. We can conveniently see the situation in R°. Suppose, 
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for example, we try to define a real circle in R°. We soon find out that two polynomials are 
needed; roughly, each polynomial “cuts down the dimension by one.” Thus we can look at the 
circle as the intersection of: a plane and a cylinder; or a plane and a sphere; or a paraboloid and a 
cone, and so on. There is no single “canonical” pair defining the circle. The most natural thing to 
do is to consider the set of a// polynomials vanishing on the circle. This forms an ideal in the ring 
of all polynomials in three variables. This idea holds equally well over other fields—in particular, 
over C or Q. We call the set of zeros common to any collection of polynomials an algebraic 
variety, or, for short, a variety. Not only does a variety define an ideal, but any ideal of course 
defines a variety; we will find that varieties (geometry) and ideals (algebra) naturally link up, 
forming a kind of “dictionary.” Thus in extending analytic geometry from plane curves to 
varieties in higher dimensions, we are led from “old” (or high school) algebra to a part of 
“modern” algebra, namely commutative ring theory. And just as we experienced some enlighten- 
ment in sketching curves in P?(C) for instance, one finds that old facts from ring theory take on 
new geometric meaning; in fact, one can often predict, on the basis of geometric intuition, which 
statements in commutative ring theory are likely to be true and which are not. We indicate only a 
few highlights here, but they will supply a bit of the favor. 

For concreteness, we will consider varieties in C” defined by collections of polynomials chosen 
from the ring R = C[x,,..., x, ]. Ifa is an ideal of R, we will let V(a) denote the common set of 
zeros Of all the polynomials in a; this is called the variety defined by a. 

Recall that in C,,,, x? + y* defines the union of two complex lines, since a point is a zero of 
x? + y* (= (x + iy)(x — iy)) iff it makes at least one of x + iy, x — iy zero. More generally, for 
polynomials p and gq the set of zeros of the product pg is the union of the zero sets of p and of q. 
This generalizes to the ideal-theoretic level, as follows. First, if a and b are ideals of R, then the 
product ab is the set of all sums of products ab (a € a, b € 5); ab is an ideal of R. 


Fact: V(ab) = V(a) U V(b). 


With ideals, we get even more than with polynomials: If we define a + b to be the set of sums 
a+b(aeéa,be€b), then a + D is an ideal and we have this 


Fact: Via + b) = V(a) N V(). 


It can be shown, using the above two facts, that the union and intersection of any two varieties is 
again a variety. The operation “V” sets up a basic link between algebraic geometry and 
commutative ring theory. 

If ideals in fact generalize polynomials, what ideal-theoretic concept generalizes “irreducible 
polynomial”? The answer is prime ideal. (Recall that an ideal p is prime if p = gr € p implies that 
g€hporre€b.) It is easy to check that if p is irreducible, than the ideal (p) is prime. At the 
geometric level, we say that an algebraic variety is irreducible if it isn’t the union of two property 
smaller algebraic varieties. The following hooks up these two ideas: 


Fact: If p is prime, then V(p) is irreducible. 
Often an ideal can be written as a product of prime ideals, and then one has: 


Fact: A splitting of p into prime ideals corresponds to a decomposition of V(p) into 
irreducibles; that 1s, if a = p, --- p,, then V(a) = V(p,) U-::: U V(b,). 


It is interesting to note that although p prime implies V()p) irreducible, the converse is not in 
general true—V(a) can be irreducible without being prime. 


EXAMPLE: a = (y, y — x”) C C[x, y] is not prime since, e.g., y — (y — x*) = x? =x-xisin 


a, but x is not. But V(a) = (0,0) © C,, since only (0,0) is common to y= 0 (= C,) and 
y — x” = 0 (a parabola); and (0,0) is surely irreducible. 


Fact: Many ideals can define the same algebraic variety. 
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Now if there really is a good “dictionary,” then aren’t the different ideals defining the same 
variety trying to tell us something more? From the definition of algebraic variety, it is easy to see 
that if ac b, then V(a) D> V(b); that is, V is a partial-order reversing map from ideals to 
varieties. Now if V(a) = V(b), with a ¢ b, then isn’t a somehow defining something “larger’’ 
than b is? 

In the example of a = (y, y — x*) above, V(a) = (0,0). But if b = (x, y), then V(b) = (0,0) 
too, and it is not hard to check that a ¢ b. Geometrically, V(b) may be looked at as the 
intersection of x = 0 (1e., C,,) and y = 0 (e., C,). But V(a) is obtained by intersecting y = 0. 
Also, y = x? (a parabola), and the parabola intersects C , with multiplicity tvo! Since C , and C y 
intersect in only multiplicity one, a in a sense defines an object (double point) larger than b does 
(single point). 


Fact: Ideals not only define varieties; they also supply information about multiplicities. 


A generation ago, men such as Chevalley, Weil and Zariski did much to make such ideas 
precise, thus cementing the relationship between algebraic geometry and commutative ring theory. 
This has been continued in much expanded contexts by Grothendieck and others. 


10. The Fundamental Theorem of Algebra Revisited. We present here two famous generaliza- 
tions of the Fundamental Theorem of Algebra to higher dimensions; the statement of one is more 
algebraic in flavor and the other, more geometric. 

We consider the algebraic one first. Let us begin by noting that since C[ x] is a principle ideal 
domain there is, up to units, a natural one-to-one correspondence between the elements and the 
ideals of C[ x]: every element p in C[x] generates the principal ideal ( p); and conversely, every 
ideal of C[ x] is of the form (p), where p is uniquely determined up to a nonzero constant of C[ x]. 
Note that any constant of C[x] generates the ideal (0) or C[x]; any other element of C[x] 
generates a proper ideal of C[ x]. 

Next, in analogy with the notion of “zero of a polynomial,” we may, for any ideal a C 
C[x,,..., X,], define a zero of a to be any point in C” at which every polynomial of a vanishes. 
(Thus V(a) is simply the set of all zeros of a.) 

Now the Fundamental Theorem of Algebra—“every nonconstant polynomial in C[x] has a 
zero in C”—can be translated to the language of ideals: Every proper ideal of C| x] has a zero. 
This italicized statement generalizes directly to ideals in C[ x,,..., x,,]; it 1s the celebrated 


Hilbert Nullstellensatz: Every proper ideal of C[x,,..., x,,] has a zero. 


It tells us that for any ideal a G C[x,,...,x,], V(a) is nonempty. There are various 
generalizations and analogues of this theorem. For instance, given any proper principal ideal 
a=(p)¢C[x,,...,x,], it happens that V(a) is not only nonempty, but that its complex 


dimension in C” is n — | (that is, real dimension 2 — 2). This extends what we found out earlier 
about curves defined by nonconstant polynomials in C[x, y]: the full picture there has complex 
dimension 2 — | = 1, or real dimension 2. Very roughly, the further away a proper ideal a is from 
being principal, the smaller the dimension of V(a). 

Hilbert’s Nullstellensatz also generalizes from C to any algebraically closed field K; the 
varieties V(a) are then taken to lie in K”. 

We now consider our second generalization of the Fundamental Theorem of Algebra. First, 
generalizing the transition from C* to P7(C), one can add a point at infinity to the set of complex 
lines parallel to each l-subspace of C”, obtaining P”(C); P”(C) may then be supplied with a 
topology induced from the l-subspaces of C”*! passing through C”*'-open subsets, as we 
analogously did with P*(C). Any variety in C” may then be completed in P”(C) by taking its 
topological closure. 

Now Bézout’s Theorem, already an important generalization of the Fundamental Theorem of 
Algebra, in fact extends to higher dimensions, and this is our second generalization. To explain, 
for any variety V in P”(C) it turns out that there is some m < n such that “almost all’ subspaces 
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P”(C) of P”(C) intersect V in a fixed number d > 0 distinct points. (All subspaces P’”’(C) of 
P”(C) may be parametrized in a natural way by an algebraic variety G (a “Grassmann 
manifold”); “almost all’ means all subspaces corresponding to points off a proper algebraic 
subvariety of G.) V is then said to have dimension n — m (or codimension m), and degree d. As 
with curves, a variety is said to be irreducible if it isn’t the union of any two strictly smaller 
varieties. 


GENERALIZED BEZOUT THEOREM: Let V,W be two distinct irreducible algebraic varieties in 
P”"(C) of codimensions r,s and degrees d,e. Ifr+s <n, then V - W has codimension r + s and 
degree de. 


REMARKS: 

(1) We’ve written V- W instead of VM W to indicate that the intersection is “counted with 
multiplicity.” A geometric definition of multiplicity may be given in the same spirit as for curves. 
The following example (which may be visualized in R , ,, although it actually takes place in C,,,) 
gives the idea. The cylinder x* + y? = 1 (codimension 1, degree 2) tangentially intersects the 
sphere x7 + y*+.z7=1 in a double circle—that is, a circle counted with multiplicity two. 
(The double circle has codimension 1 + 1 = 2 and degree 2 - 2 = 4.) Almost all small changes in 
the coefficients of our two polynomials will separate the points of the double circle. For instance, 
shrinking the cylinder a little bit will split up the double circle into two separate circles. The 
cylinder and sphere now intersect “transversally,” and one can easily check that the union of the 
two circles has degree four; this union has multiplicity one, in that additional small “algebraic 
perturbations” of the sphere or cylinder won’t further separate points. 

As with curves, intersection multiplicity may also be looked at in a more algebraic way, as well 
as in a more topological fashion. One of the important milestones in algebraic geometry is Weil’s 
book [10], in which he works out the theory of intersection multiplicity in a purely algebraic way 
for algebraic varieties of arbitrary dimension over arbitrary groundfield. 

(2) The theorem can be stated to accommodate general, not-necessarily-irreducible algebraic 
varieties. 

(3) The theorem can be generalized to any algebraically closed field K, the varieties lying in 
[P” (IK). ) 

Before we leave the world of higher dimensions, we make a final observation. Just as one can 
do curve sketching over “arithmetic” fields K such as Q, algebraic extensions of @ and finite 
fields, one can ask for the appearance in, say, K” of varieties defined by polynomials with 
coefficients in K. We are lead to even more far reaching results (as well as interesting new 
questions) in number theory. For example, a study of the solution in K” of a quadratic form in n 
variables brings us all the way from the “n = 2” of Gauss’ Disquisitiones arithmeticae (1801) 
through deep studies by Hilbert, Minkowsky, Hasse, C. L. Siegel and others. Much work on these 
number-theoretic questions continues to this day. 


11. What About Calculus? A major sales pitch of this article has been that it is enlightening to 
study analytic geometry over fields other than R. One might wonder if the same is true for 
calculus. The answer is yes. Algebraic geometry enters the picture in a natural way; we briefly 
indicate how, in this section. 

We replace R by C. The reader is probably familiar with integration over C, in which “line 
integrals” appear. In studying this, one soon meets a problem such as integrating {Vx dx around 
0 © C,. There we encounter the multiple-valuedness of Vx , and we find that we must go around 
the origin twice to arrive back at the original value of /x. In this sense, then, it is somewhat 
artificial to insist that we’re integrating on C; the integration is more naturally taking place on a 
Riemann surface which, in our context, amounts to an algebraic curve which is “nonsingular.” 
(Nonsingular means that for each point P of the curve, in some neighborhood of P the curve is the 
graph of an invertible complex-analytic function. Thus the “circle”? x? + y* = 1 in C,, is 
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nonsingular; but C, U C, is not, since the part around (0, 0) isn’t the graph of a function.) Let us 
explain. In C , , the « curve C given by y* = x has a function on it, namely the restriction to C of the 
coordinate function yon C,,,, and we are integrating around a closed path on C. Of course one 
can integrate other functions on C,, too—for instance (the restriction to C of) rational functions of 
x and y. 

Note: Instead of looking at C as the union of the slices x, = constant used earlier, we could 
just as well let our slices be cylinders (real circle about (0,0) in C,.) x C,. Now the intersection of 
each such cylinder with C is a simple, closed curve lying over the real circle. As a point goes 
around the closed curve once, its projection on the circle goes around twice; the whole complex 
parabola is the union of (0,0) and all these “two loop” curves, one such curve for each of the 
concentric cylinders. We can now think of integrating around (0,0) as actually integrating along 
one of these curves. 

Now consider the line integral {x(x — 1)(x + 1) dx. We once again encounter multiple- 
valuedness when integrating around a simple closed curve in C, containing an odd number of 
the points 0,1, —1. Again it is natural to look at this as integration of y on the curve y* = 
x(x — 1)(x + 1). In P?(©), this defines the torus of §5; now it serves as a “Riemann surface of 
genus one.” More generally, the integral f/x(x* — 1*)--- (x? — g*) dx leads to a Riemann 
surface of genus g. 

What can be said about the values obtained by integrating on such curves? First, recall that in 
C, if a function is analytic in a simply-connected region R, then for fixed points Q), Q,, the value 
of any line integral I§. f(x) dx is independent of the path in R connecting Q, and Q,. But if f has, 
say, one pole P in R, then the value depends on the number of circuits the path makes around P. 
One can regard P as a “hole,” in the sense that fis analytic in R \ P but not in R. This time any 
two values of I§. f(x) dx differ by an integral multiple of a period (which is the value of 
{, f(x) dx, where y is closed and goes around P once). 

What happens when we integrate on, say, a nonsingular curve of genus one? The situation is 
analogous to the above, only now there are two “holes” since a torus is topologically the product 
circle X circle, and each of the two circles acts like a hole. Thus, the values of {6 21 If(x) dx (Qo, Q1 
fixed) form a point lattice in C. If one holds Q, fixed but lets Q, vary, the values that I§, 'f(x) dx 
attains fill out all of C. If the path of integration never topologically encircles a hole of the torus, 
the values fill out only a part of C—namely, a “fundamental.region,” a parallelogram topologi- 
cally equal to [0,1) x [0, 1), containing the point 0 € C,. Note that the set of values modulo the 
periods, ie., C/(Z X Z), is a torus. One can actually give this the structure not only of a 
commutative group, but also of an algebraic variety in P*(C), obtaining an “abelian variety.” 

What happens when we integrate on a nonsingular curve of genus g? Now there are 2 g holes, 
the values of [E:f(x) dx (Q, fixed, Q, varying) fill out C%, and the values of IG, 'f(x) dx (Qo, Q, 
both fixed) fill out a lattice in C® looking like Z*%. If the path y connecting Q, and Q, never 
encircles a hole, the values form a fundamental region looking like [0,1)7%; and C%/Z8 is a 
product of 2g real circles, which is an abelian variety called the “Jacobian variety” of the curve. 

It should be mentioned that the restriction of nonsingularity is not actually necessary. Any 
curve in P*(C) can be “desingularized;” for us, this amounts to replacing it by a nonsingular 
curve having all the function-theoretic information of the original curve. 

These ideas have been extended to varieties of higher dimension, and much effort has gone into 
trying to extend as much of all this as possible to the “arithmetic” fields mentioned earlier. 


12. Newer Methods in Algebraic Geometry. The “evolution of science” is often more like a 
series of revolutions. In mathematics the introduction of imaginary quantities, of differentials, of 
transfinite numbers, of nonconstructive existence proofs—the list could go on and on—all 
represented revolutions in their day. Riemann began a revolution with his introduction of 
Riemann surfaces, and it profoundly affected algebraic geometry. Hilbert’s nonconstructive proof 
in 1888 of the Nullstellensatz (§10) was wildly new then, prompting P. Gordan, a leading algebraic 
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geometer of the day, to his famous protest “Das ist nicht mathematische—das ist theologie!”’ 
More recently the school including Weil, Chevalley and Zariski revolutionized algebraic geometry 
by breaking away from the constraints of working over C, replacing C by an arbitrary ground- 
field; this “arithmetization of algebraic geometry” helped bring large parts of number theory into 
the fold of algebraic geometry. 

In each case, it was not a matter of a change in fashion for fashion’s sake; the power of any 
cluster of methods to solve new problems gradually gets exhausted, meeting the point of 
diminishing returns—and new approaches and concepts are needed. 

The 1950’s found a new storm brewing, one that ended up absorbing into algebraic geometry 
nearly all of commutative algebra, and topological notions such as fiber bundles, sheaves and 
various cohomological theories. In this final section we briefly indicate some of these newer ideas 
as they touch on algebraic geometry. An excellent introductory treatment can be found in [4}. 

By way of motivation, let us begin with an irreducible variety V € C” and let p denote the 
restriction to V of the polynomial p € C[x,,..., x, ]. It turns out that all such p form an integral 
domain D (no zero divisors); let K be its quotient field. We can regard elements of K as rational 
functions on the variety. (K is called the function field of V.) We can call the elements of the 
integral domain D “integral functions” on V. An element algebraic over K is an algebraic function 
on V. The similarity in nomenclature with numbers is striking—one has integral, rational and 
algebraic numbers. Nonalgebraic numbers are called transcendental, which are analogous to 
“transcendental functions.” Is this similarity accidental, or is something fundamental going on? 
We can ask a leading question right off: can we actually look at the ordinary integers Z and their 
quotients @ as functions on some “ variety’? The answer is yes. We briefly explain. First, all 
functions of D vanishing at a point P € V form a maximal ideal m, in D. (A point is a smallest 
nonempty object in V; a maximal ideal is a largest # D ideal in D.) It is not hard to see that for 
any function in D, its value at P is its ring-homomorphism image in D/mp = C. It turns out that 
every maximal ideal of D is some m > and that P and m, are in |: 1 correspondence. We may thus 
identify the set of points in V with the set of maximal ideals of D. 

What variety has Z as “D’’? In Z the maximal ideals are just the nonzero prime ideals. Thus 
the “variety” having Z as “ring of integral functions” has for its points the set of prime ideals (2), 
(3), (5),... ! The elements of Z may now be looked at as functions on our “variety.” For example 
25 at the point (p) is 25 mod( p); therefore it is 1 at (2), 1 at (3), 0 at (5), 4 at (7), and so on. The 
elements of Q are rational functions; 3/49, for example, is “regular” except for a pole of order 2 
at the point (7). 

It soon became clear to algebraic geometers that one could profitably be very general and 
consider any commutative ring R with unit in place of D. One could furthermore consider not 
only maximal ideals as points, but prime ideals (which generalize maximal ideals) as more general 
points. These more general points are like irreducible varieties of V which in many respects behave 
like ordinary points. The set of all prime ideals of R is denoted “Spec R”, and is suitably 
topologized. One can then build on Spec R to get an “affine scheme”: roughly speaking, it is 
Spec R together with, at each P € Spec(R), the functions (quotients of R) regular in some 
neighborhood P. One can glue these affine schemes together to get more general schemes, much as 
one glues together topological, differentiable or analytic neighborhoods to construct more general 
topological, differentiable or analytic manifolds. 

But another current was to make its way into algebraic geometry. Workers in manifold theory 
had been experimenting with fiber bundles, sheaves, various cohomological theories and the like. 
These powerful tools were yielding subtle invariants of the manifolds and could be used, for 
instance, to help classify those manifolds. It was realized that many of these ideas could be 
extended to schemes, and could provide to algebraic geometry new, useful invariants. However, all 
this ended up (1) being pretty abstract, and (2) introducing a lot of new, heavy baggage, baggage 
to be added to a subject already heavy with structure. The big question: is it worth it? Now, after 
it has had time to prove itself, we can say definitely yes. It has been all-important in solving many 
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“big” problems in algebraic geometry, some of long standing. Occasionally, after a modern proof 
has been found, a more elementary one follows, but the new insights provided by the new 
approach are crucial. 


Some examples. 


e One of the important classical results is the Riemann-Roch theorem for a projective curve C 
of genus g; this theorem gives the dimension of the vector space of functions on C having at most 
a prescribed set of poles there. This was extended to projective varieties of dimension two, but a 
general, higher-dimensional analogue seemed out of reach using older methods. The newer 
approach provided a natural way of looking at the general problem; today, through the work of 
Hirzebruch, Grothendieck and others, there exists a neat cohomological generalization to arbitrary 
dimension. (See [6].) Riemann-Roch type results are important in dealing with classification 
problems. 

e An old and venerated problem in algebraic geometry was to construct a nonsingular 
projective variety having a prescribed function field. Riemann assumed it; in 1945 Zariski, after 
Herculean efforts, solved the problem over groundfields of characteristic zero for varieties of 
dimension not more than three. 

For a time it seemed that if the acknowledged expert Zariski couldn’t resolve singularities in 
higher dimensions, then they just couldn’t be resolved! But the power of the newer methods 
rekindled new hope and efforts. Finally, in 1964, H. Hironaka solved the problem in characteristic 
zero for arbitrary dimension [5]. 

e In §8 we saw how “Fermat’s Last Theorem” could be looked at as the problem of showing 


that the curve x” + y” = 1 in Q,,, has no nontrivial points. One can more generally consider the 
equation 
(1) AyXo° +++: + a,x? =b 


and ask for the number of solutions in various fields. Many congruence results and problems in 
number theory can be looked at as finding the number of such solutions when the field is finite. 
Just as with Fermat’s Last Theorem, this too can be looked at geometrically, this time as a 
problem of counting the number of points of the variety defined by (1), considered over a finite 
field. 

In a 1949 paper [12], Weil formulated some conjectures about the solutions to (1). In dimension 
1 (r = 1 in (1)), he had answered these questions in [11]; various higher-dimensional examples 
pointed towards their truth for arbitrary r > 1, but, as is so often the case in algebraic geometry, 
generalizing to higher dimensions required new methods. In brief, the conjectures, if true, would 
establish fundamental connections between the number of solutions to (1) in finite fields (as 
reflected in a “zeta function” attached to (1)’s variety; such a zeta function is pregnant with 
number-theoretic information, just as is the usual zeta function familiar in complex variables) and 
the topology (actually, Betti numbers) of projective varieties over C. These conjectures stimulated 
a great amount of work, especially to find a cohomology for varieties over finite fields strong 
enough to yield the conjectures. Important contributions were made by Lang-Weil [8], Dwork [2], 
Grothendieck ([3], for instance), Deligne and others. All the conjectures have now been settled in 
the positive; the most resistant part was finally cracked by Deligne in 1973 [1]. At the same time, 
this has settled various conjectures in number theory. 

The above examples give a little idea of some of the accomplishments achieved using these 
newer methods. In addition to yielding concrete results, these methods bring with them a new 
perspective—a perspective that on one hand has extended algebraic geometry, and on the other 
hand, has helped unify it. 

I close on this thought: often, when a new set of ideas is introduced into mathematics, it all 
initially seems like a tremendous blizzard, with piles and piles of snow (a charge surely not 
escaped by algebraic geometry). But after a time it melts down to its essence—a few big ideas and 
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major theorems. The new ideas seem after a while to be simple and natural, and future generations 
build on them to arrive at mathematics that is deeper, more general and more unified. Yet there 
will always be concrete examples like curves, surfaces and numbers! They will never go out of 
style, and will always serve as a source of and testing ground for new directions and new ideas. 
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1. Introduction. Mathematical proofs are normally presented in a step-by-step, “linear” fash- 
ion, proceeding unidirectionally from hypotheses to conclusion. While this age-old and venerable 
method may be well suited for securing the validity of proofs, it is nonetheless unsuitable for a 
second, highly important role of most presentations—that of mathematical communication. 

In this article an alternative method, called the “structural method,” is proposed. The method, 
triggered by recent ideas from computer science, is intended to increase the comprehensibility of 
mathematical presentations while retaining their rigor. The basic idea underlying the structural 
method is to arrange the proof in /evels, proceeding from the top down; the levels themselves 
consist of short autonomous “modules,” each embodying one major idea of the proof. 

The top level gives in very general (but precise) terms the main line of the proof. The second 
level elaborates on the generalities of the top level, supplying proofs for unsubstantiated 
statements, details for general descriptions, specific constructions for objects whose existence has 
been merely asserted, and so on. If some such subprocedure is itself complicated, we may choose 
to give it in the second level only a “top-level description,” pushing the details further down to 
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lower levels. And so we continue down the hierarchy of subprocedures, each supplying more 
details to: plug in holes in higher levels, until we reach the bottom where (to borrow W. W. 
Sawyer’s metaphor [10, p. 222]) all the leaks are plugged and the proof is watertight. 

The top level is normally very short and free of technical (1.e., notational, computational, etc.) 
details. Thus it can be grasped at one glance, yielding an overview of the proof. (Note that the 
very term “overview” suggests view from the top.) The bottom level is quite detailed, resembling 
in this respect the standard linear proof. However, these details now appear only after their role in 
the proof is determined. Furthermore, they are now organized into conceptual units (the modules), 
each clearly and explicitly connected to its appropriate place in the total hierarchy. The 
intermediate levels facilitate a smooth transition from the generalities of the top level to the details 
of the bottom, from the global to the local perspective. 

The two approaches are compared pictorially in Fig. 1. The linear method is represented by an 
oriented line segment (a), the structural method by a “structure diagram” (b). The structure 
diagram displays the levels, the modules and their interconnections. In each box, or module, the 
argument flows linearly, but it is very short and “flat” (no complex nesting patterns of sublevels); 
thus, again, it can be grasped at a glance. Incidentally, the description above applies not only to 
proofs, but to other mathematical procedures such as definitions, constructions, algorithms and 
examples as well. 


beginning end 
—_——_ 
hypotheses conclusion 
(hyp ) (a) The linear method. ( ) 


(b) The structural method. 


Fic. 1. The two methods of presentation. 


One may think of the structural approach as viewing the proof (which is at ground level) from 
a tall building. When viewing from the top we see the whole proof at a glance, but only in vague 
outline—no details can be discerned. As we descend the levels of the building, a zooming effect 
occurs: our view encompasses smaller and smaller segments of the proof, but these are seen with 
more and more clarity. 

I anticipate a few objections some readers may have at this point. The first two objections, if 
carried to extremes, are, respectively, that what I am saying is wrong and that what I am saying is 
trivially true. More specifically, the first one may be expressed by the statement that “there is 
nothing wrong with the linear method; after all it was good enough for me and for generations of 
mathematicians.”” My answer is that we must look for insights in our students’ behavior, not 
merely in ourselves. After all, most of us are successful survivors of the standard method, so we 
make up an extremely biased sample to draw conclusions from. (For a fierce attack on the linear 
method cf. [6, pp. 115-119].) 

The second objection is represented by the following statement: “But your structural method 1s 
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what I’ve been doing in my classes all along.” This statement, which has been expressed by some 
acknowledged excellent teachers, actually supports my argument. If the structural method provides 
a coherent and explicit system of presentation, whereby many of the “options” of good teachers 
become standard—it will have achieved its purpose. Note again that these options (e.g., a short 
overview of a long and complicated proof) not only become standard; they actually become part 
of the “official,” formal proof. 

A third argument that has been raised actually grants that knowledge of the structure of a 
proof is essential for its understanding. However (it goes on), this kind of knowledge is best left to 
the students to discover for themselves. My response to this subtler variant of the “it was good 
enough for me” argument is that indeed students should discover the structure for themselves; 
only they don’t! And experience shows that this kind of task is beyond the capabilities of most 
undergraduates with standard mathematical training. In fact it may turn out to be an important 
teaching strategy within the structural method, to explicitly train students to structure linear 
proofs. 

Finally, it should be stressed that the proposed modification bears directly only on one part of 
the teaching-learning process: the formal presentation of mathematical procedures as they 
commonly occur in lectures, textbooks and journal articles. By no means is it meant to replace the 
informal and artful devices that good expositors have always been employing to enhance the full 
and active participation of the student in this process, such as intuition, heuristics, personal 
metaphors, humor and even acting. After all, if the student is not listening, it matters little what 
we are saying! 


2. Examples. I do not know of any way to prove (or disprove) my claims on the merits of the 
structural method. So in order to judge these claims we must rely on the powerful yet imprecise 
and subjective tools of our experience, intuition, reflections and observations; and a good way of 
activating these tools here is to carry out some mathematical “case studies.” 

On one level, these case studies are simply examples: they help us clarify and make more 
concrete the general concepts discussed previously. On a different level, they can function as a 
kind of mental experiment. That is, one can pretend to be a student reading these proofs for the 
first time (or an instructor teaching them to a class), and thus recapture mentally some of the real 
experience of learning (or teaching) by these methods. 

Ideally we should study some long and complicated proofs, where the structure is buried deeply 
in the linear presentation (e.g., Brouwer’s Fixed-Point Theorem), but we are limited here to 
theorems whose proofs are relatively short and elementary. Our examples will thus not be 
paradigmatic in the sense of exhibiting all the desired properties at once; rather, each will 
illuminate some of the issues only. 


2.1. The Infinite Number of Primes. Euclid’s original proof that there are infinitely many primes 
is so short and clear that it can be considered a one-level proof. So let us move further. 

The odd prime numbers fall naturally into two classes: those leaving a remainder of 1 upon 
division by 4, and those leaving a remainder of 3. I shall call them “monadic”’ and “triadic” for 
short. Thus a number is monadic if it can be represented as 4k + 1, and triadic if it can be 
represented as 4/ + 3 for some integers k and /. 

It is natural to ask how the prime numbers are distributed between these two classes, and it 
turns out that there are infinitely many in each of them. (These are special cases of the following 
theorem of Dirichlet: In any arithmetic progression {an + b|n = 1,2,3,...}, where a and b are 
relatively prime, there are infinitely many primes.) We shall consider here only the simpler case, 
that of the triadic numbers. 


THEOREM. There exist infinitely many triadic primes (i.e., numbers of the form 4k + 3). 


The first proof is patterned after a popular and well-known book. 
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Proof in the linear style. Consider a product of two monadic numbers: 
(4k + 1)(4/4+ 1) =4k- 414+ 4k 4+ 414+ 1=4(4k14+k +1) 4+1, 


which is again monadic. Similarly, the product of any number of monadic numbers is monadic. 
Now assume the theorem is false, so there are only finitely many triadic primes, say 
Pi> Poo--+> Pn. Define M = 4p, p,--: p, — 1. 
If p,|M, then p,|1 since p;|4p, p. --- p,. Since this is impossible, we conclude that no p, divides 
M. Also 2 does not divide M as M is odd. Thus all M’s prime factors are monadic, hence M itself 
must be monadic. But M=4p,p,::: p,-—1=4(p\p. °°: p, — 1) +3 is clearly triadic—a 
contradiction. Thus the theorem is proved. 


REMARK 1. Note that the student is led blindly and passively through the sequence of steps, 
which he or she must follow without ever being told the general plan of the proof and for what 
purpose the various steps are taken. Why, for example, should we “consider a product of two 
monadic numbers?” Why should we define a number M and why this particular way? Why should 
we check whether p, and 2 divide M or not? 

These questions should naturally arise in the mind of the alert student as he is going through 
the various steps of the proof, but the answers are not apparent in the presentation. And even the 
persistent student who does find the answers can do so only at the end of the proof. Worse yet, for 
many students, understanding (or learning) a proof has come to mean merely checking the validity 
of the deduction in each step. Thus they have stopped even asking these questions, let alone 
finding the answers. 


REMARK 2. I hope you have read the proof with the student’s eyes, not yours, since to a mature 
mathematician this proof is still very easy. However, similar frustration with the linear method can 
be experienced by mature mathematicians too, e.g., when reading a 15-page, strictly linear proof 
in a research journal. 


Proof in the Structural Style. 


Level 1. Suppose the theorem is false and let p,, p,,..., p, be all the triadic primes. We 
construct (in Level 2) a number M having the following two properties: 

(a) M as well as all its factors are different from p,, po,..., Py3 

(b) M has a triadic prime factor. 

These two properties clearly produce a contradiction, as we get a triadic prime which is not one 
Of P\, Po,--+5 Py. Lous the theorem is proved. 


In the Elevator. (The elevator, as a metaphor for the intermediate process of descending in 
levels, offers a convenient place to discuss heuristic and other informal issues concerning the next 
level.) 

How shall we approach the definition of M? In light of Euclid’s classical proof, it is natural to 
try M = p,p,-°-: p, + 1. This indeed meets requirement (a), but not (b). In fact, since for all we 
know M itself may turn out to be prime, it must be triadic to meet (b). 

Thus a natural second guess is M = 4p, p, --- p, + 3. However, this has another “bug”: since 
one of the p,’s is 3, M is divisible by 3, in violation of (a). But this bug, once discovered, is easy to 
fix: simply eliminate 3 from the product in the bugged definition. 


Level 2. Let M = 4p, --- p, + 3 (we assume p, = 3). We show that M satisfies the two 
requirements from Level 1. 

Requirement (a) means that no p; should divide M. Indeed, p,,..., p, do not divide M as they 
leave a remainder of 3; and 3 does not divide M as it does not divide 4p, --- p,. 

As for requirement (b), suppose on the contrary that all of M’s prime factors were monadic. 
Then M, as a product of monadic numbers, would itself be monadic (Lemma, Level 3)—a 
contradiction. Thus (a) and (b) are satisfied. 
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Level 3: LEMMA. A product of monadic numbers is again a monadic number. 


(The proof is as given above.) 


REMARK 3. Level | demonstrates two important features of the top level. First, it can be 
grasped at one glance, and second, it gives the essence of the proof. Consider for example the 
introduction of M. What is stressed is its role in the proof, the properties it should enjoy, and how 
these are used to achieve the goal. The actual, detailed construction, as well as the proof that M 
satisfies the required properties, are pushed down to a lower level. Thus the top level indeed gives 
a global view of the proof. 


REMARK 4. The “elevator” trick is a side benefit of the structural method, not an “official” 
part of it. Authors and instructors often like to inject informal remarks (mainly of a heuristic 
character) into the ongoing formal proof. In the linear method, where the argument is presented as 
an unbroken sequence of steps, there is no natural place to do it. So authors have been using 
various display tricks such as footnotes, different font, brackets and so on, to distinguish these 
remarks from the formal proof. In lectures, students are often confused as to what constitutes the 
formal proof and what are the auxiliary remarks. 

In the structural approach, since the proof is divided anyhow into independent, relatively short 
modules, it is always possible to collect all the informal remarks and present them during the 
“elevator trip,” that is, while descending to the next level. 


REMARK 5. The successive debugging in the search for M illustrates an important point. I often 
ask my students to keep a protocol of the process of finding a proof, besides the final product. 
From these protocols it appears that no one reaches the correct M without going through some 
debugging, roughly of the kind given here. Since this is an essential part of the creative process of 
doing mathematics, I believe it should be made explicit and discussed with the students. 


REMARK 6. Compare the location of the lemma on the product of monadic numbers in the two 
approaches. In the linear approach it appears in the beginning of the proof (or sometimes as a 
separate lemma before the main proof). The reader can have no idea why it is needed, but has to 
follow its proof nonetheless, and then keep the unused lemma in memory until it is needed at the 
end of the proof. In the structural approach the lemma is only announced when the need for it 
arises in the main proof. The lemma is used to complete the main proof and only then, in a lower 
level, its proof is given. 


REMARK 7. The case of the monadic primes is more involved and will not be discussed here. 
However, it should be pointed out that in the structural method we have at least a clear beginning; 
in fact, the top level of the proof remains exactly the same, except for the exchange 
“triadic” < “monadic.” (One says that the two proofs exhibit a “top-level similarity.”) The 
difficulty, as well as the “meat” of the proof, lie in the lower levels: the number M, promised in 
the top level, is not easy to deliver. 


2.2. A Theorem on Limits 
THEOREM. /f lim,_,, f(x) = L and lim,_,,g(x) = M, then lim,,_,, f(x)g(x) = LM. 


The following proof is not a caricature of the linear style; it is written strictly according to the 
“official” principles of that style and, in fact, is taken from a real calculus textbook. 


Proof in the Linear Style. Let ¢€ > 0 be given, and let 7 be the smaller of Ve/3 and 
€/3(1 +|L| + |M)). Since lim,_,, f(x) = L, there exists a 6, > 0 such that |f(x)-L|<y 
whenever 0 < |x — a| < 6,. Similarly, there exists a 6, > 0 such that |g(x) — M| < » whenever 
0 < |x — a| < 8,. Let 8 be the smaller of 6, and 6,. Now if 0 < |x — a| < 6, then 0 < |x — a|< 
5.,i = 1,2, and so we have: 
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f(x) g(x) — LM| = |L( g(x) — M) + M(f(x) — L) + (f(x) — L)(g(x) — M)| 
< |L\|g(x) — M| + |M||f(x) — LI + |f(x) — Lilg(x) — | 


< |Lie/3(1 + |L| + |M]) + |Mle/3(1 + |L| + |M]) + Ve73 - ye/3 
<6/3+6€/3 + 6/3 =€. q.e.d. 
Fortunately, many instructors know better. They let the student in on the secret of how these 
mysterious quantities y and 6 are actually discovered. But in so doing the direction of the 
argument is reversed, and eventually they have to abandon this unorthodox discussion and recast 
the official proof in more-or-less the form above (or at least mention that this recasting should be 
done). 
The argument in the following structured proof resembles this informal discussion but at the 
same time it is quite formal and rigorous. Thus the structural approach brings closer the human 
process and the formal-deductive one. 


A Structured Proof. 


Level 1. Let € > 0 be given. We find (in level 2) a 6 > O such that 0 < |f(x)g(x) — LM|<e 
whenever 0 < |x — a| < 6. Thus the theorem is proved. 


In the Elevator. We have to show that the expression | f(x) g(x) — LM| can be made as small 
as we please. To this end, we try to bound it by an expression we know can be made small. Such 
expressions are | f(x) — L|,|gx) — M| and multiples of these by a constant and by each other. 
After some trial and error the following expression emerges: 


(+) f(x)g(x) — LM = L(g(x)- M)+ M(f(x) — L) + (f(x) — L)(g(x) - M). 
Level 2. Using the equality (*) we have 
If(x) g(x) — LM| = |L( g(x) — M) + M( f(x) — L) + (f(x) — £)(g(x) - M) 
< |L\|g(x) — M| + |MI|f(x) — LI + |f(x) - Lilg(x) - MI. 


We find a 6 > 0 (in Level 3) such that whenever 0 < |x — a| < 6, each of the terms on the 
right-hand side is smaller than ¢/3. Thus the left-hand side is smaller than €, as required. 


In the Elevator. To get |L||g(x) — M| < €/3, we try to make |g(x) — M| < €/3|L|. However, 
there is a bug here: this only works if L # 0. One way of correcting this bug is to replace | L| by 
] + |L|. The case of |M|| f(x) — L| is similar. Finally, to get | f(x) — L||g(x) — M| < €/3, we 
make each of the factors smaller than ve/3 ; 


Level 3. We choose positive 6,, 5,, 6, 6, such that the following hold: 
\f(x) — L| < «€/3(1 +|M|) whenever 0 <|x — al <6,; 
lg(x) — M| < €/3(1 + |L|) whenever 0 < |x — a] <6); 


f(x) — L| < ye/3 whenever 0 < |x — al < 43; 
lg(x) — M| < ye/3 whenever 0 < |x — al < 8,. 


(Such 6,’s exist since L and M are the limits of f(x) and g(x) respectively.) Now let 6 be the 
smallest of 6,,6,,63;,6,, so that if 0 <|x — a| <6 then 0 < |x — a| < 6,,i = 1,2,3,4. Then 
whenever 0 < |x — a| < 6, the expressions |LZ||g(x) — M|, |M|| f(x) — L| and | f(x) — L||g(x) — 
M| all become smaller than ¢/3. Thus 6 satisfies the requirements of Level 2. 


REMARK. As seen from this example, structural proofs are longer to deliver, but (I believe) 
shorter to digest. In fact, they are longer because they contain more information (namely, the 
structure of the proof), and it is this very information which makes them more learnable, 
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illuminating and human. Thus in switching to structured proofs we simply agree to share with our 
students (or readers, etc.) more of what we know about the proof. And it is my belief that the loss 
in economy is more than balanced by the gain in learning. 


2.3. A High-School Level Problem... 
We next take up the following ruler-and-compass construction problem: 


Fic. 2. A construction problem. 


Construct a tangent to a given circle from a given point outside the circle. 
The linear description of the construction is as follows (refer to Fig. 3). 


C A 


Fic. 3. The construction. 


Draw the line segment OA and find its midpoint C. Around C as center draw a circle with 
radius |OC|. Let M be one point of intersection of the two circles. Then AM is the required 
tangent. 

In general, the solution of construction problems in school usually proceeds in three stages. 
First, an analysis of the problem is carried out and the construction is discovered; second, the 
construction is described formally (in reverse order!); third, a proof is given that the construction 
indeed meets all the requirements. 

In the structural presentation, all three stages are combined in one coherent process, thus again 
uniting the human (the first stage) with the mathematical (the second and third). 


A Structured Proof. 


Level 1. We find (in Level 2) a point M on the circle so that the line 4M is perpendicular to the 
radius OM. Then (by a well-known theorem) A M is a tangent, as required. 


In the Elevator. How shall we find the point M? We follow a well-known heuristic principle ((9, 
Ch. 1]): Look at the two properties determining M and construct the two corresponding loci; then 
construct M as the intersection of these two loci. 


Level 2. Let M be a point of intersection of the given circle with the locus of points where the 
segment OA subtends a right angle. That is, M is a point of intersection of the given circle and the 
circle (constructed in Level 3) having OA as diameter. 
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Level 3. To construct the circle with segment OA as diameter, let C be the midpoint of segment 
OA, then draw the circle with C as center and |OC| as radius. Ciearly OA is a diameter in this 
circle, as required. 


2.4. ... And a More Advanced One. The last example represented what seems, more or less, the 
lowest level of complexity in which arranging the proof in levels is still advantageous. While there 
is no corresponding upper limit (the more complex the proof, the more helpful is a structural 
presentation), we are nonetheless limited in our choice of case studies by the scope of this paper. 

Pushing towards this limit, we now take up a theorem of central importance in linear algebra, 
namely the theorem on canonical forms for Hermitian operators and matrices. The theorem has 
several variants (the best known of which being, perhaps, the one on orthogonal diagonalization 
of symmetric matrices), and is a close relative of the Spectral Theorem. To save space, we state it 
directly in its more technical, ready-to-prove form. 


THEOREM. Let V be a finite-dimensional complex inner-product space, and let T be a Hermitian 
operator on V. Then V has an orthonormal basis of characteristic vectors of T. 


(Recall that T is Hermitian if (Tu, v) = (u, Tv) for all vectors u and v in V, where (x, y) 
denotes the inner product of x and y.) 

The following proof is self-contained modulo the standard material on operators and inner- 
product spaces; that is, it assumes no knowledge of Hermitian operators beyond the definition. 
See, e.g., [2], [3], or [4] for the standard facts, as well as for standard proofs of the theorem. 
Usually, the contents of this proof are spread out over several lemmas and theorems, thus 
achieving a measure of structuring even in the standard presentations. The structuring, however, is 
incomplete and the presentation proceeds bottom-up; that is, the auxiliary results (lower levels) 
appear first and the main argument (top level) last. (Note, however, the top-down commentary in 
[3, p. 301].) 


A Structured Proof. 


Level 1. Let U be the subspace of V spanned by all the characteristic vectors of T. We prove 
two assertions about U: 

(a) U= V,1., the characteristic vectors of T span the whole space (Level 2.1); 

(b) U has an orthonormal basis of characteristic vectors of T (Level 2.2). 

Clearly, these two assertions yield the conclusion of the theorem. 


REMARK. From here on the proof branches to two independent subproofs, rooted at 2.1 and 
2.2. Note that while the presentation proceeds strictly top-down, it does not force a top-down 
reading of the proof. Some readers may prefer, for example, to read through the first branch (2.1) 
all the way to the bottom, then return to Level 2 and start on the second branch (2.2). 

In what follows we have added a number (in parentheses) to the title of each module. This 
number is a back-reference to the “calling” module, i.e., the one where the present module is 
referenced. A more compact representation of the interconnections within the hierarchy is given in 
the structure diagram (FIG. 4). 


Level 2.1 (1). To prove U = V we prove the equivalent statement U* = (0), where U* is the 
orthogonal complement of U. This in turn will follow from the following two assertions: 

(c) U* is T-invariant (Level 3.1). 

(d) Every nonzero 7-invariant subspace of V contains a characteristic vector of T (Level 3.2). 

Since U~ cannot contain a characteristic vector of T (this would contradict UM U+ = {0}), we 
must have U+ = {0), hence U = V. 


Level 2.2 (1). Let A;,..., A, be the distinct characteristic values of T, and let U,,..., U, be 
their respective characteristic spaces: 
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U, = {v0 € V|Tv = Xv}. 


By the Gram-Schmidt orthogonalization process we can construct an orthonormal basis B; to each 
subspace U,, and it turns out that without any more care on our part, the set B= B, U-:- UB, 
already forms an orthonormal basis for V. This is due to the following important lemma (proved 
in Level 3.3): 

Characteristic vectors of T that belong to distinct characteristic values are orthogonal. 

Thus if 7 # j, B; and B, are orthogonal, hence B is indeed an orthonormal basis for U. Since B 
consists of characteristic vectors of T, it satisfies all the requirements of Level 1 (b). 


Level 3.1 (2.1). U* is T-invariant. 


Level 3.2 (2.1). If W is a nonzero T-invariant subspace of V, then W contains a characteristic 
vector of T. 


Level 3.3 (2.2). LEMMA. Let T be a Hermitian operator. Then 
(a) All the (complex) characteristic values of T are real. 
(b) Characteristic vectors of T that belong to distinct characteristic values are orthogonal. 


We leave out the proofs of 3.1, 3.2 and 3.3, since Level 3 is the bottom level and the proofs 
appear in it in their standard (linear) form (see, e.g., [4, pp. 312—313]). Note, however, that it is 
only in 3.1] and 3.3 that the main hypothesis of the Theorem (namely that 7 is. Hermitian) finally 
enters the proof. In contrast, 3.2 1s true for operators on complex spaces in general. 


REMARK. There is a similar theorem for normal operators, and the similarity is 3-level deep, 
counting the theorem itself as level 0 of the proof. Since 3.2 is true for operators on complex 
spaces in general, only 3.1 and 3.3 need to be adjusted. 

We conclude this subsection with a structure diagram of the proof (Fig. 4), displaying the 
hierarchy of modules and their interconnections. 


3. Summary: Benefits of the Structural Method. 


3.1 More Communicative Proofs. The main benefit of presentations in the structural style is, 
hopefully, that ideas behind the formal proofs are better communicated. As we have seen, the 
main idea is given in the top level (or two), auxiliary ideas are packaged in autonomous modules, 
and the interconnections between the separate ideas are made explicit through the structure 
diagram. 

By analyzing the foregoing examples, we can be a bit more specific about what is meant by the 
“main idea” of a proof, and the different treatment given to it by the two methods. The main idea 
often lies in the construction of a new, intermediate object, the pivot, to mediate between the 
hypotheses and the conclusion. (In our examples the pivots are the number M in 2.1, the numbers 
n, 6, 6; in 2.2, the point M in 2.3, and the orthonormal basis of characteristic vectors, B, in 2.4.) 
Since the pivot occupies a central position in the proof and is directly connected to all its parts, it 
offers a vantage point from which to view the global architecture of the proof; and precisely this 
view is given in the top level of the structured proof (cf. the examples). Here the pivot is 
introduced by a statement of its essential properties and is immediately used to derive the 
conclusion of the theorem. The detailed definition, as well as proof of the postulated properties 
and questions of existence, are all pushed down to lower levels. 

In the linear approach the pivot is treated poorly (from the learner’s point of view) and its 
potential for revealing the architecture of the proof is wasted. Rather to the contrary, it is here 
where the proof most resembles pulling a rabbit from the hat (cf., e.g., the introduction of the 
number 7 in the linear proof of 2.2 or, for an extreme example, the element f(a, b, c) in [5, p. 
274]). The pivot is usually introduced near the beginning of the proof by a bare statement of its 
definition, which often appears extremely bizarre and complicated. Such definitions have an 
intimidating, even paralyzing, effect on many students when introduced too abruptly. The words 
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of Courant and Robbins, taken from a slightly different context ({1, p. 292]), are appropriate here: 
“There is an unfortunate, almost snobbish attitude on the part of some writers of textbooks, who 
present the reader with this definition without a thorough preparation, as though an explanation 
were beneath the dignity of a mathematician.” Note, in contrast, the “untricking” effect a 
structured presentation has on the definition of the pivot: the actual detailed definition is given 
only after its role in the proof and the reasons for its particular form are made clear in the upper 
levels. 

It should be remarked that the term “main idea” is used here more in the sense of an abstract 
or overview, and in no way is it meant to imply that mathematically it is the most important idea 
in the proof. For example, in the proof that there are infinitely many monadic primes (see Remark 
7 of 2.1) the main mathematical idea may well be to take M = (2p,p, --- p,)* + 1, which is 
relatively a low-level idea. Still, I would maintain that awareness of the main idea in the above 
sense is important for an insightful learning, and is more likely to lead the student away from 
learning proofs and definitions by rote. 


3.2. Learning Activities. 


Several new, structure-related activities are available to the learner, or the instructor, in view of 
the structural method. Here is a sample: 

—Given the higher levels of a proof, complete the lower levels. 

—Take any proof from a standard textbook and find its structure (i.e., arrange the proof in 
levels). This is not easy, but it is highly valuable and rewarding; and it results in a much deeper 
understanding of the proof. (For some examples of this activity, see Section 2.) 

—This is related to the previous one: Given a 10-page proof, describe it in one page. 

—Gziven two theorems that show some similarity, determine how deep this similarity is; that is, 
to how many levels of the proof does the similarity extend, counting from the top. 

All these activities can be used as self-learning activities, or they can be assigned as homework 
or test. Not only do they penetrate deeply into the particular subject matter, but they also 
encourage the learner to reflect on the process of theorem-proving itself. 

An additional benefit of the structural method is that of partial proofs. The situation often 
arises, where presenting or studying a proof to the last detail is impossible or undesirable, due to 
insufficient level of mathematical preparation or motivation on the part of the audience, or simply 
due to time limitations. Consider, for example, the following cases: survey lectures or articles, 
mathematics courses for engineering students, an instructor scanning a pile of textbooks in 
preparation for a course, a mathematician scanning a pile of research articles. In all such cases the 
structural method allows one to conveniently choose a suitable level of detail for the particular 
situation, by simply ignoring (perhaps temporarily) some of the lower levels of the proof. This 
bears some similarity to Polya’s advice concerning the use of “incomplete proofs” ([8, “Why 
proofs?’”’, pp. 219-—221]), but note that our partial proofs can always be refined to a complete 
proof. 


3.3 Conclusion. Iu. I. Manin has already referred to the human aspect of proofs when he said 
[7, p. 51]: “A good proof is one which makes us wiser.” Clearly, then, a good presentation of a 
proof is one which makes the listener (or reader) wiser. It is my hope that presentations in the 
structural style have an improved chance of passing this difficult test. 


Acknowledgment. I acknowledge gratefully the helpful comments from many colleagues on an earlier version of 
this paper. 
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WHO GAVE YOU THE EPSILON? 
CAUCHY AND THE ORIGINS OF RIGOROUS CALCULUS 


JUDITH V. GRABINER 
424 West 7th Street, Claremont, California 91711 


Student: The car has a speed of 50 miles an hour. What does that mean? 


Teacher: Given any e > 0, there exists a 6 such that if |t, — ¢,| < 6, then ~ —_ 50 <€ 
2° «1 

Student: How in the world did anybody ever think of such an answer? 

* * * * * * * * * * * * * 


Perhaps this exchange will remind us that the rigorous basis for the calculus is not at all 
intuitive—in fact, quite the contrary. The calculus is a subject dealing with speeds and distances, 
with tangents and areas—not inequalities. When Newton and Leibniz invented the calculus in the 
late seventeenth century, they did not use delta-epsilon proofs. It took a hundred and fifty years 
to develop them. This means that it was probably very hard, and it is no wonder that a modern 
student finds the rigorous basis of the calculus difficult. How, then, did the calculus get a rigorous 
basis in terms of the algebra of inequalities? 

Delta-epsilon proofs are first found in the works of Augustin-Louis Cauchy (1789-1867). This 
is not always recognized, since Cauchy gave a purely verbal definition of limit, which at first 
glance does not resemble modern definitions: “When the successively attributed values of the 
same variable indefinitely approach a fixed value, so that finally they differ from it by as little as 
desired, the last is called the /imit of all the others” [1]. Cauchy also gave a purely verbal definition 
of the derivative of f(x) as the limit, when it exists, of the quotient of differences (f(x + h) — 
f(x))/h when h goes to zero, a statement much like those that had already been made by Newton, 
Leibniz, d’Alembert, Maclaurin, and Euler. But what is significant is that Cauchy translated such 
verbal statements into the precise language of inequalities when he needed them in his proofs. For 
instance, for the derivative [2]: 


Let 5, e be two very small numbers; the first is chosen so that for all numerical 

(1) [i.e., absolute] values of h less than 6, and for any value of x included [in the 
interval of definition], the ratio (f(x + h) — f(x))/h will always be greater than 
f’(x) — e and less than f’(x) + e«. 
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CAUCHY AND THE ORIGINS OF RIGOROUS CALCULUS 


JUDITH V. GRABINER 
424 West 7th Street, Claremont, California 91711 
Student: The car has a speed of 50 miles an hour. What does that mean? 


; ; , Sy — S 
Teacher. Given any e > 0, there exists a 6 such that if |t, — ¢,| < 6, then —— — 50 <e. 


Student: How in the world did anybody ever think of such an answer? 


* * * * * * * * * * * * * 


Perhaps this exchange will remind us that the rigorous basis for the calculus is not at all 
intuitive—in fact, quite the contrary. The calculus is a subject dealing with speeds and distances, 
with tangents and areas—not inequalities. When Newton and Leibniz invented the calculus in the 
late seventeenth century, they did not use delta-epsilon proofs. It took a hundred and fifty years 
to develop them. This means that it was probably very hard, and it is no wonder that a modern 
student finds the rigorous basis of the calculus difficult. How, then, did the calculus get a rigorous 
basis in terms of the algebra of inequalities? 

Delta-epsilon proofs are first found in the works of Augustin-Louis Cauchy (1789-1867). This 
is not always recognized, since Cauchy gave a purely verbal definition of limit, which at first 
glance does not resemble modern definitions: ““When the successively attributed values of the 
same variable indefinitely approach a fixed value, so that finally they differ from it by as little as 
desired, the last is called the /imit of all the others” [1]. Cauchy also gave a purely verbal definition 
of the derivative of f(x) as the limit, when it exists, of the quotient of differences (f(x + h) — 
f(x))/h when h goes to zero, a statement much like those that had already been made by Newton, 
Leibniz, d’Alembert, Maclaurin, and Euler. But what is significant is that Cauchy translated such 
verbal statements into the precise language of inequalities when he needed them in his proofs. For 
instance, for the derivative [2]: 


Let 6, e be two very small numbers; the first is chosen so that for all numerical 

(1) [i.e., absolute] values of h less than 6, and for any value of x included [in the 
interval of definition], the ratio (f(x + h) — f(x))/A will always be greater than 
f’(x) — e and less than f’(x) + e«. 
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This one example will be enough to indicate how Cauchy did the calculus, because the question 
to be answered in the present paper is not, “how is a rigorous delta-epsilon proof constructed?” 
As Cauchy’s intellectual heirs we all know this. The central question is, how and why was Cauchy 
able to put the calculus on a rigorous basis, when his predecessors were not? The answers to this 
historical question cannot be found by reflecting on the logical relations between the concepts, but 
by looking in detail at the past and seeing how the existing state of affairs in fact developed from 
that past. Thus we will examine the mathematical situation in the seventeenth and eighteenth 
centuries—the background against which we can appreciate Cauchy’s innovation. We will 
describe the powerful techniques of the calculus of this earlier period and the relatively unimpres- 
sive views put forth to justify them. We will then discuss how a sense of urgency about rigorizing 
analysis gradually developed in the eighteenth century. Most important, we will explain the 
development of the mathematical techniques necessary for the new rigor from the work of men 
like Euler, d’Alembert, Poisson, and especially Lagrange. Finally, we will show how these 
mathematical results, though often developed for purposes far removed from establishing founda- 
tions for the calculus, were used by Cauchy in constructing his new rigorous analysis. 


The Practice of Analysis: From Newton to Euler. In the late seventeenth century, Newton and 
Leibniz, almost simultaneously, independently invented the calculus. This invention involved three 
things. First, they invented the general concepts of differential quotient and integral (these are 
Leibniz’s terms; Newton called the concepts “fluxion” and “fluent’’). Second, they devised a 
notation for these concepts which made the calculus an algorithm: the methods not only worked, 
but were easy to use. Their notations had great heuristic power, and we still use Leibniz’s dy /dx 
and fy dx, and Newton’s x, today. Third, both men realized that the basic processes of finding 
tangents and areas, that is, differentiating and integrating, are mutually inverse—what we now 
call the Fundamental Theorem of Calculus. 

Once the calculus had been invented, mathematicians possessed an extremely powerful set of 
methods for solving problems in geometry, in physics, and in pure analysis. But what was the 
nature of the basic concepts? For Leibniz, the differential quotient was a ratio of infinitesimal 
differences, and the integral was a sum of infinitesimals. For Newton, the derivative, or fluxion, 
was described as a rate of change; the integral, or fluent, was its inverse. In fact, throughout the 
eighteenth century, the integral was generally thought of as the inverse of the differential. One 
might imagine asking Leibniz exactly what an infinitesimal was, or Newton what a rate of change 
might be. Newton’s answer, the best of the eighteenth century, is instructive. Consider a ratio of 
finite quantities (in modern notation, (f(x + h) —f(x))/h) as h goes to zero). The ratio 
eventually becomes what Newton called an “ultimate ratio.” Ultimate ratios are “limits to which 
the ratios of quantities decreasing without limit do always converge, and to which they approach 
nearer than by any given difference, but never go beyond, nor ever reach until the quantities 
vanish”’ [3]. Except for “reaching” the limit when the quantities vanish, we can translate Newton’s 
words into our algebraic language. Newton himself, however, did not do this, nor did most of his 
followers in the eighteenth century. Moreover, “never go beyond” does not allow a variable to 
oscillate about its limit. Thus, though Newton’s is an intuitively pleasing picture, as it stands it 
was not and could not be used for proofs about limits. The definition sounds good, but it was not 
understood or applied in algebraic terms. 

But most eighteenth-century mathematicians would object, “Why worry about foundations?” 
In the eighteenth century, the calculus, intuitively understood and algorithmically executed, was 
applied to a wide range of problems. For instance, the partial differential equation for vibrating 
strings was solved; the equations of motion for the solar system were solved; the Laplace 
transform and the calculus of variations and the gamma function were invented and applied; all 
of mechanics was worked out in the language of the calculus. These were great achievements on 
the part of eighteenth-century mathematicians. Who would be greatly concerned about founda- 
tions when such important problems could be successfully treated by the calculus? Results were 
what counted. 
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This point will be better appreciated by looking at an example which illustrates both the 
“uncritical” approach to concepts of the eighteenth century and the immense power of eigh- 
teenth-century techniques, from the work of the great master of such techniques: Leonhard Euler. 
The problem is to find the sum of the series 


1/1 + 1/44+1/94+---4+1/k?7 +555, 
It clearly has a finite sum since it is bounded above by the series 
14+1/1-24+1/2-34+1/3-44+---+ 1/(k-1)-k+---, 


whose sum was known to be 2; Johann Bernoulli had found this sum by treating 1/1 - 2 + 
1/2-3+1/3-4+--- as the difference between the series 1/1 + 1/2 + 1/3 +---- and the 
series 1/2 + 1/3 + 1/4 +.---, and observing that this difference telescopes [4]. 

Euler’s summation of L%_,1/k? makes use of a lemma from the theory of equations: given a 
polynomial equation whose constant term is one, the coefficient of the linear term is the product 
of the reciprocals of the roots with the signs changed. This result was both discovered and 
demonstrated by considering the equation (x — a)(x — b) = 0, having roots a and b. Multiplying 
and then dividing out ab, we obtain 


(1/ab)x* — (1/a + 1/b)x + 1=0; 
the result is now obvious, as is the extension to equations of higher degree. 


Euler’s solution then considers the equation sin x = 0. 
Expanding this as an infinite series, Euler obtained 


x — x7/314+ x°/51— ++. =0. 
Dividing by x yields 

1 —x7/31+x4/5!—--- =0. 
Finally, substituting x7 = u produces 

1—u/3!+u*/S5!—-:- =0. 


But Euler thought that power series could be manipulated just like polynomials. Thus, we now 
have a polynomial equation in u, whose constant term is one. Applying the lemma to it, the 
coefficient of the linear term with the sign changed is 1/3! = 1/6. The roots of the equation in u 
are the roots of sin x = 0 with the substitution u = x*, namely 77,477, 977,.... Thus the lemma 
implies 
1/6 = 1/07 4+ 1/407 4+ 1/907 4+---. 
Multiplying by 2’ yields the sum of the original series [5]: 
1/1 4+1/44+1/9+---+1/k? +--+ =’ /f6. 


Though it is easy to criticize eighteenth-century arguments like this for their lack of rigor, it is 
also unfair. Foundations, precise specifications of the conditions under which such manipulations 
with infinites or infinitesimals were admissible, were not very important to men like Euler, 
because without such specifications they made important new discoveries, whose results in cases 
like this could readily be verified. When the foundations of the calculus were discussed in the 
eighteenth century, they were treated as secondary. Discussions of foundations appeared in the 
introductions to books, in popularizations, and in philosophical writings, and were not—as they 
are now and have been since Cauchy’s time—the subject of articles in research-oriented journals. 

Thus, where we once had one question to answer, we now have two. The first remains, where 
do Cauchy’s rigorous techniques come from? Second, one must now ask, why rigorize the calculus 
in the first place? If few mathematicians were very interested in foundations in the eighteenth 
century [6], then when, and why, were attitudes changed? 

Of course, to establish rigor, it is necessary—though not sufficient—to think rigor is signifi- 
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cant. But more important, to establish rigor, it is necessary (though also not sufficient) to have a 
set of techniques in existence which are suitable for that purpose. In particular, if the calculus is to 
be made rigorous by being reduced to the algebra of inequalities, one must have both the algebra 
of inequalities, and facts about the concepts of the calculus that can be expressed in terms of the 
algebra of inequalities. 

In the early nineteenth century, three conditions held for the first time: Rigor was considered 
important; there was a well-developed algebra of inequalities; and, certain properties were known 
about the basic concepts of analysis—limits, convergence, continuity, derivatives, 
integrals— properties which could be expressed in the language of inequalities if desired. Cauchy, 
followed by Riemann and Weierstrass, gave the calculus a rigorous basis, using the already-exist- 
ing algebra of inequalities, and built a logically-connected structure of theorems about the 
concepts of the calculus. It is our task to explain how these three conditions—the developed 
algebra of inequalities, the importance of rigor, the appropriate properties of the concepts of the 
calculus—came to be. 


The Algebra of Inequalities. Today, the algebra of inequalities is studied in calculus courses 
because of its use as a basis for the calculus, but why should it have been studied in the eighteenth 
century when this application was unknown? In the eighteenth century, inequalities were im- 
portant in the study of a major class of results: approximations. For example, consider an 
equation such as (x + 1)" = a, for p» not an integer. Usually a cannot be found exactly, but it can 
be approximated by an infinite series. In general, given some number n of terms of such an 
approximating series, eighteenth-century mathematicians sought to compute an upper bound on 
the error in the approximation—that is, the difference between the sum of the series and the nth 
partial sum. This computation was a problem in the algebra of inequalities. Jean d’Alembert 
solved it for the important case of the binomial series; given the number of terms of the series n, 
and assuming implicitly that the series converges to its sum, he could find the bounds on the 
error—that is, on the remainder of the series after the nth term—by bounding the series above 
and below with convergent geometric progressions [7]. Similarly, Joseph-Louis Lagrange invented 
a new approximation method using continued fractions and, by extremely intricate inequality- 
calculations, gave necessary and sufficient conditions for a given iteration of the approximation to 
be closer to the result than the previous iteration [8]. Lagrange also derived the Lagrange 
remainder of the Taylor series [9], using an inequality which bounded the remainder above and 
below by the maximum and minimum values of the nth derivative and then applying the 
intermediate-value theorem for continuous functions. Thus through such eighteenth-century work 
[10], there was by the end of the eighteenth century a developed algebra of inequalities, and people 
used to working with it. Given an n, these people are used to finding an error—that is, an epsilon. 


Changing Attitudes toward Rigor. Mathematicians were much more interested in finding 
rigorous foundations for the calculus in 1800 than they had been a hundred years before. There 
are many reasons for this: no one enough by itself, but apparently sufficient when acting together. 
Of course one might think that eighteenth-century mathematicians were always making errors 
because of the lack of an explicitly-formulated rigorous foundation. But this did not occur. They 
were usually right, and for two reasons. One is that if one deals with real variables, functions of 
one variable, series which are power series, and functions arising from physical problems, errors 
will not occur too often. A second reason is that mathematicians like Euler and Laplace had a 
deep insight into the basic properties of the concepts of the calculus, and were able to choose 
fruitful methods and evade pitfalls. The only “error” they committed was to use methods that 
shocked mathematicians of later ages who had grown up with the rigor of the nineteenth century. 

What then were the reasons for the deepened interest in rigor? One set of reasons was 
philosophical. In 1734, the British philosopher Bishop Berkeley had attacked the calculus on the 
ground that it was not rigorous. In The Analyst, or a Discourse Addressed to an Infidel 
Mathematician, he said that mathematicians had no business attacking the unreasonableness of 
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religion, given the way they themselves reasoned. He ridiculed fluxions—“ velocities of evanescent 
increments”—calling the evanescent increments “ghosts of departed quantities” [11]. Even more 
to the point, he correctly criticized a number of specific arguments from the writings of his 
mathematical contemporaries. For instance, he attacked the process of finding the fluxion (our 
derivative) by reviewing the steps of the process: if we consider y = x’, taking the ratio of the 
differences ((x + h)? — x?)/h, then simplifying to 2x + h, then letting h vanish, we obtain 2 x. 
But is h zero? If it is, we cannot meaningfully divide by it; if it is not zero, we have no right to 
throw it away. As Berkeley put it, the quantity we have called h “might have signified either an 
increment or nothing. But then, which of these soever you make it signify, you must argue 
consistently with such its signification” [12]. 

Since an adequate response to Berkeley’s objections would have involved recognizing that an 
equation involving limits is a shorthand expression for a sequence of inequalities—a subtle and 
difficult idea—no eighteenth-century analyst gave a fully adequate answer to Berkeley. However, 
many tried. Maclaurin, d’Alembert, Lagrange, Lazare Carnot, and possibly Euler, all knew about 
Berkeley’s work, and all wrote something about foundations. So Berkeley did call attention to the 
question. However, except for Maclaurin, no leading mathematician spent much time on the 
question because of Berkeley’s work, and even Maclaurin’s influence lay in other fields. 

Another factor contributing to the new interest in rigor was that there was a limit to the 
number of results that could be reached by eighteenth-century methods. Near the end of the 
century, some leading mathematicians had begun to feel that this limit was at hand. D’Alembert 
and Lagrange indicate this in their correspondence, with Lagrange calling higher mathematics 
“decadent” [13]. The philosopher Diderot went so far as to claim that the mathematicians of the 
eighteenth century had “erected the pillars of Hercules” beyond which it was impossible to go 
[14]. Thus, there was a perceived need to consolidate the gains of the past century. 

Another “factor” was Lagrange, who became increasingly interested in foundations, and, 
through his activities, interested other mathematicians. In the eighteenth century, scientific 
academies offered prizes for solving major outstanding problems. In 1784, Lagrange and his 
colleagues posed the problem of foundations of the calculus as the Berlin Academy’s prize 
problem. Nobody solved it to Lagrange’s satisfaction, but two of the entries in the competition 
were later expanded into full-length books, the first on the Continent, on foundations: Simon 
L’Huilier’s Exposition élémentaire des principes des calculs supérieurs, Berlin, 1787, and Lazare 
Carnot’s Réflexions sur la métaphysique du calcul infinitésimal, Paris, 1797. Thus Lagrange clearly 
helped revive interest in the problem. 

Lagrange’s interest stemmed in part from his respect for the power and generality of algebra; 
he wanted to gain for the calculus the certainty he believed algebra to possess. But there was 
another factor increasing interest in foundations, not only for Lagrange, but for many other 
mathematicians by the end of the eighteenth century: the need to teach. Teaching forces one’s 
attention to basic questions. Yet before the mideighteenth century, mathematicians had often 
made their living by being attached to royal courts. But royal courts declined; the number of 
mathematicians increased; and mathematics began to look useful. First in military schools and 
later on at the Ecole Polytechnique in Paris, another line of work became available: teaching 
mathematics to students of science and engineering. The Ecole Polytechnique was founded by the 
French revolutionary government to train scientists, who, the government believed, might prove 
useful to a modern state. And it was as a lecturer in analysis at the Ecole Polytechnique that 
Lagrange wrote his two major works on the calculus which treated foundations; similarly, it was 
40 years earlier, teaching the calculus at the Military Academy at Turin, that Lagrange had first 
set out to work on the problem of foundations. Because teaching forces one to ask basic questions 
about the nature of the most important concepts, the change in the economic circumstances of 
mathematicians— the need to teach—provided a catalyst for the crystallization of the foundations 
of the calculus out of the historical and mathematical background. In fact, even well into the 
nineteenth century, much of foundations was born in the teaching situation; Weierstrass’s 
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foundations come from his lectures at Berlin; Dedekind first thought of the problem of continuity 
while teaching at Zurich; Dini and Landau turned to foundations while teaching analysis; and, 
most important for our present purposes, so did Cauchy. Cauchy’s foundations of analysis appear 
in the books based on his lectures at the Ecole Polytechnique; his book of 1821 was the first 
example of the great French tradition of Cours d’ analyse. 


The Concepts of the Calculus. Arising from algebra, the algebra of inequalities was now there 
for the calculus to be reduced to; the desire to make the calculus rigorous had arisen through 
consolidation, through philosophy, through teaching, through Lagrange. Now let us turn to the 
mathematical substance of eighteenth-century analysis, to see what was known about the concepts 
of the calculus before Cauchy, and what he had to work out for himself, in order to define, and 
prove theorems about, limit, convergence, continuity, derivatives, and integrals. 

First, consider the concept of limit. As we have already pointed out, since Newton the limit had 
been thought of as a bound which could be approached closer and closer, though not surpassed. 
By 1800, with the work of L’Huilier and Lacroix on alternating series, the restriction that the limit 
be one-sided had been abandoned. Cauchy systematically translated this refined limit-concept 
into the algebra of inequalities, and used it in proofs once it had been so translated; thus he gave 
reality to the oft-repeated eighteenth-century statement that the calculus could be based on limits. 

For example, consider the concept of convergence. Maclaurin had said already that the sum of 
a series was the limit of the partial sums. For Cauchy, this meant something precise. It meant that, 
given an e, one could find n such that, for more than n terms, the sum of the infinite series is 
within e of the nth partial sum. That is the reverse of the error-estimating procedure that 
d’Alembert had used. From his definition of a series having a sum, Cauchy could prove that a 
geometric progression with radius less in absolute value than 1 converged to its usual sum. As we 
have said, d’Alembert had shown that the binomial series for, say, (1 + x)?/7 could be bounded 
above and below by convergent geometric progressions. Cauchy assumed that if a series of 
positive terms is bounded above, term-by-term, by a convergent geometric progression, then it 
converges; he then used such comparisons to prove a number of tests for convergence: the root 
test, the ratio test, the logarithm test. The treatment is quite elegant [15]. Taking a technique used 
a few times by men like d’Alembert and Lagrange on an ad hoc basis in approximations, and 
using the definition of the sum of a series based on the limit-concept, Cauchy created the first 
rigorous theory of convergence. 

Let us now turn to the concept of continuity. Cauchy gave essentially the modern definition of 
continuous function, saying that the function f(x) is continuous on a given interval if for each x 
in that interval “the numerical [1.e., absolute] value of the difference f(x + a) — f(x) decreases 
indefinitely with a” [16]. He used this definition in proving the intermediate-value theorem for 
continuous functions [17]. The proof proceeds by examining a function f(x) on an interval, say 
[b, c], where f(b) is negative, f(c) positive, and dividing the interval [b, c] into m parts of width 
h =(c— b)/m. Cauchy considered the sign of the function at the points f(b), f(b + A),..., 
f(b + (m — I)jh), f(c); unless one of the values of fis zero, there are two values of x differing by 
h such that f is negative at one, positive at the other. Repeating this process for new intervals of 
width (c — b)/m,(c — b)/m?*,..., gives an increasing sequence of values of x: b, b,, b5,... for 
which f is negative, and a decreasing sequence of values of x: c, c,, c2,... for which f is positive, 
and such that the difference between b, and c, goes to zero. Cauchy asserted that these two 
sequences must have a common limit a. He then argued that since f(x) is continuous, the 
sequence of negative values f(b, ) and of positive values f(c,) both converge toward the common 
limit f(a), which must therefore be zero. 

Cauchy’s proof involves an already existing technique, which Lagrange had applied in 
approximating real roots of polynomial equations. If a polynomial was negative for one value of 
the variable, positive for another, there was a root in between, and the difference between those 
two values of the variable bounded the error made in taking either as an approximation to the 
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root [18]. Thus again we have the algebra of inequalities providing a technique which Cauchy 
transformed from a tool of approximation to a tool of rigor. 

It is worth remarking at this point that Cauchy, in his treatment both of convergence and of 
continuity, implicitly assumed various forms of the completeness property for the real numbers. 
For instance, he treated as obvious that a series of positive terms, bounded above by a convergent 
geometric progression, converges: also, his proof of the intermediate-value theorem assumes that a 
bounded monotone sequence has a limit. While Cauchy was the first systematically to exploit 
inequality proof techniques to prove theorems in analysis, he did not identify all the implicit 
assumptions about the real numbers that such inequality techniques involve. Similarly, as the 
reader may have already noticed, Cauchy’s definition of continuous function does not distinguish 
between what we now call point-wise and uniform continuity; also, in treating series of functions, 
Cauchy did not distinguish between pointwise and uniform convergence. The verbal formulations 
like “for all’ that are involved in choosing deltas did not distinguish between “for any epsilon and 
for all x” and “for any x, given any epsilon” [19]. Nor was it at all clear in the 1820’s how much 
depended on this distinction, since proofs about continuity and convergence were in themselves so 
novel. We shall see the same confusion between uniform and point-wise convergence as we turn 
now to Cauchy’s theory of the derivative. 

Again we begin with an approximation. Lagrange gave the following inequality about the 
derivative: 


(2) f(x +h) = f(x) + hf'(x) + AY, 


where V goes to 0 with A. He interpreted this to mean that, given any D, one can find h 
sufficiently small so that V is between — D and +D [20]. Clearly this is equivalent to (1) above, 
Cauchy’s delta-epsilon characterization of the derivative. But how did Lagrange obtain this result? 
The answer is surprising; for Lagrange, formula (2) was a consequence of Taylor’s theorem. 
Lagrange believed that any function (that is, any analytic expression, whether finite or infinite, 
involving the variable) had a unique power-series expansion (except possibly at a finite number of 
isolated points). This is because he believed that there was an “algebra of infinite series,” an 
algebra exemplified by work of Euler such as the example we gave above. And Lagrange said that 
the way to make the calculus rigorous was to reduce it to algebra. Although there is no “algebra” 
of infinite series that gives power-series expansions without any consideration of convergence and 
limits, this assumption led Lagrange to define f’(x) without reference to limits, as the coefficient 
of the linear term in h in the Taylor series expansion for f(x + h). Following Euler, Lagrange 
then said that, for any power series in h, one could take A sufficiently small so that any given term 
of the series exceeded the sum of all the rest of the terms following it; this approximation, said 
Lagrange, is assumed in applications of the calculus to geometry and mechanics [21]. Applying 
this approximation to the linear term in the Taylor series produces (2), which I call the Lagrange 
property of the derivative. (Like Cauchy’s (1), the inequality-translation Lagrange gives for (2) 
assumes that, given any D, one finds h/ sufficiently small so |V| < D with no mention whatever of 
x.) 

Not only did Lagrange state property (2) and the associated inequalities, he used them as a 
basis for a number of proofs about derivatives: for instance, to prove that a function with positive 
derivative on an interval is increasing there, to prove the mean-value theorem for derivatives, and 
to obtain the Lagrange remainder for the Taylor series. (Details may be found in the works cited 
in [22].) Lagrange also applied his results to characterize the properties of maxima and minima, 
and orders of contact between curves. 

With a few modifications, Lagrange’s proofs are valid—provided that property (2) can be 
justified. Cauchy borrowed and simplified what are in effect Lagrange’s inequality proofs about 
derivatives, with a few improvements, basing them on his own (1). But Cauchy made these proofs 
legitimate because Cauchy defined the derivative precisely to satisfy the relevant inequalities. 
Once again, the key properties come from an approximation. For Lagrange, the derivative was 
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exactly—no epsilons needed—the coefficient of the linear term in the Taylor series; formula (2), 
and the corresponding inequality that f(x + h) — f(x) lies between h(f’(x) + D), were ap- 
proximations. Cauchy brought Lagrange’s inequality properties and proofs together with a 
definition of derivative devised to make those techniques rigorously founded [22]. 

The last of the concepts we shall consider, the integral, followed an analogous development. In 
the eighteenth century, the integral was usually thought of as the inverse of the differential. But 
sometimes the inverse could not be computed exactly, so men like Euler remarked that the integral 
could be approximated as closely as one liked by a sum. Of course, the geometric picture of an 
area being approximated by rectangles, or the Leibnizian definition of the integral as a sum, 
suggests this immediately. But what is important for our purposes is that much work was done on 
approximating the values of definite integrals in the eighteenth century, including considerations 
of how small the subintervals used in the sums should be when the function oscillates to a greater 
or lesser extent. For instance, Euler treated sums of the form U7 _)f(x,)(x,41 — x,) aS ap- 
proximations to the integral f>"f(x) dx [23]. 

In 1820, S.-D. Poisson, who was interested in complex integration and therefore more 
concerned than most people about the existence and behavior of integrals, asked the following 
question. If the integral Fis defined as the antiderivative of f, and if b — a = nh, can it be proved 
that F(b) — F(a) = {?f(x) dx is the limit of the sum 


S=hf(a)+hf(at+h)+---+hf(a+(n- 1h) 


as h gets small? (S is an approximating sum of the eighteenth-century sort.) Poisson called this 
result “the fundamental proposition of the theory of definite integrals.” He proved it by using 
another inequality-result: the Taylor series with remainder. First, he wrote F(b) — F(a) as the 
telescoping sum 


(3) F(at+h)—-F(a)+F(a+ 2h)- F(at+h)+---+ F(b) -— F(at (n-1)A). 


Then, for each of the terms of the form F(a + kh) — F(a +(k — 1)h), Taylor’s series with 
remainder gives, since by definition F’ = f, 


F(a + kh) — F(a+(k-—1)h) =hf(a + (k- 1)h) + R,A'*” 
where w > 0, for some R,. Thus the telescoping sum (3) becomes 
hf(a) +hf(ath)+---+hf(at (n—-1)h) + (Rpts +R, aie”. 


So F(b) — F(a) and the sum S differ by (R, + --- + R,,)A'*™”. Letting R be the maximum value 
for the R,, 


(R,+---+R,)hAi** <n-R(h'*”)=R-nh-h” = R(b- a)h”. 


Therefore, if h is taken sufficiently small, F(b) — F(a) differs from S by less than any given 
quantity [24]. 

Poisson’s was the first attempt to prove the equivalence of the antiderivative and limit-of-sums 
conceptions of the integral. However, besides the implicit assumptions of the existence of 
antiderivatives and bounded first derivatives for f on the given interval, the proof assumes that the 
subintervals on which the sum is taken are all equal. Should the result not hold for unequal 
divisions also? Poisson thought so, and justified it by saying, “If the integral is represented by the 
area of a curve, this area will be the same, if we divide the difference... into an infinite number of 
equal parts, or an infinite number of unequal parts following any law” [25]. This, however, is an 
assertion, not a proof. And Cauchy saw that a proof was needed. 

Cauchy did not like formalistic arguments in supposedly rigorous subjects, saying that most 
algebraic formulas hold “only under certain conditions, and for certain values of the quantities 
they contain” [26]. In particular, one could not assume that what worked for finite expressions 
automatically worked for infinite ones. Thus, Cauchy showed that the sum of the series 
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1/1+1/44+ 1/9+.--- was 77/6 by actually calculating the difference between the nth partial 
sum and 7*/6 and showing that it was arbitrarily small [27]. Similarly, just because there was an 
operation called taking a derivative did not mean that the inverse of that operation always 
produced a result. The existence of the definite integral had to be proved. And how does one 
prove existence in the 1820’s? One constructs the mathematical object in question by using an 
eighteenth-century approximation that converges to it. Cauchy defined the integral as the limit of 
Euler-style sums Uf (x,)(X;,41, — x,) for x,,, — x, sufficiently small. Assuming explicitly that 
f(x) was continuous on the given interval (and implicitly that it was uniformly continuous), 
Cauchy was able to show that all sums of that form approach a fixed value, called by definition 
the integral of the function on that interval. This is an extremely hard proof [28]. Finally, 
borrowing from Lagrange the mean-value theorem for integrals, Cauchy proved the Fundamental 
Theorem of Calculus [29]. 


Conclusion. Here are all the pieces of the puzzle we originally set out to solve. Algebraic 
approximations produced the algebra of inequalities; eighteenth-century approximations in the 
calculus produced the useful properties of the concepts of analysis: d’Alembert’s error-bounds for 
series, Lagrange’s inequalities about derivatives, Euler’s approximations to integrals. There was a 
new interest in foundations. All that was needed was a sufficiently great genius to build the new 
foundation. 

Two men came close. In 1816, Carl Friedrich Gauss gave a rigorous treatment of the 
convergence of the hypergeometric series, using the technique of comparing a series with 
convergent geometric progressions; however, Gauss did not give a general foundation for all of 
analysis. Bernhard Bolzano, whose work was little known until the 1860’s, echoing Lagrange’s call 
to reduce the calculus to algebra, gave in 1817 a definition of continuous function like Cauchy’s 
and then proved—by a different technique from Cauchy’s—the intermediate-value theorem [30]. 
But it was Cauchy who gave rigorous definitions and proofs for all the basic concepts; it was he 
who realized the far-reaching power of the inequality-based limit concept; and it was he who gave 
us—except for a few implicit assumptions about uniformity and about completeness— the 
modern rigorous approach to calculus. 

Mathematicians are used to taking the rigorous foundations of the calculus as a completed 
whole. What I have tried to do as a historian is to reveal what went into making up that great 
achievement. This needs to be done, because completed wholes by their nature do not reveal the 
separate strands that go into weaving them—especially when the strands have been considerably 
transformed. In Cauchy’s work, though, one trace indeed was left of the origin of rigorous calculus 
in approximations— the letter epsilon. The e corresponds to the initial letter in the word “erreur” 
(or “error’’), and Cauchy in fact used e for “error” in some of his work on probability [31]. It is 
both amusing and historically appropriate that the “e,” once used to designate the “error” in 
approximations, has become transformed into the characteristic symbol of precision and rigor in 
the calculus. As Cauchy transformed the algebra of inequalities from a tool of approximation to a 
tool of rigor, so he transformed the calculus from a powerful method of generating results to the 
rigorous subject we know today. 


References 


1. A. -L. Cauchy, Cours d’analyse, Paris, 1821; in Oeuvres completes d’Augustin Cauchy, series 2, vol. 3, Paris, 
Gauthier-Villars, 1899, p. 19. 

2. A. -L. Cauchy, Resume des lecons donnees a l’ecole royale polytechnique sur le calcul infinitesimal, Paris, 
1823; in Oeuvres, series 2, vol. 4, p. 44. Cauchy used i for the increment; otherwise the notation is his. 

3. Isaac Newton, Mathematical Principles of Natural Philosophy, 3rd ed., 1726, tr. A. Motte, revised by Florian 
Cajori, University of California Press, Berkeley, 1934, Scholium to Lemma XI, p. 39. 

4. Johann Bernoulli, Opera Omnia, IV, 8; section entitled “De seriebus varia, Corollarium III,” cited by D. J. 
Struik, A Source Book in Mathematics, 1200-1800, Harvard, Cambridge, 1969, p. 321. 

5. Boyer, History of Mathematics, p. 487; Euler’s paper is in Comm. Acad. Sci. Petrop., 7, 1734-5, pp. 123-34; 


194 JUDITH V. GRABINER 


in Leonhard Euler, Opera omnia, series 1, vol. 14, pp. 73-86. 

6. J. V. Grabiner, The Origins of Cauchy’s Rigorous Calculus, M. I. T. Press, Cambridge and London, 1981, 
chapter 2. 

7. J. d'Alembert, Reflexions sur les suites et sur les racines imaginaires, in Opuscules mathematiques, vol. 5, 
Briasson, Paris, 1768, pp. 171—215; see especially pp. 175-178. 

8. J. -L. Lagrange, Traite de la resolution des equations numeriques de tous les degres, 2nd ed., Courcier, Paris, 
1808; in Oeuvres de Lagrange, Gauthier-Villars, Paris, 1867-1892, vol. 8, pp. 162-163. 

9. Lagrange, Theorie des fonctions analytiques, 2nd ed., Paris, 1813, in Oeuvres, vol. 9, pp. 80-85; compare 
Lagrange, Lecons sur le calcul des fonctions, Paris, 1806, in Oeuvres, vol. 10, pp. 91-95. 

10. Grabiner, Origins of Cauchy’s Rigorous Calculus, pp. 56-68; compare H. Goldstine, A History of 
Numerical Analysis from the 16th through the 19th Century, Springer-Verlag, New York, Heidelberg, Berlin, 1977, 
chapters 2-4. 

11. George Berkeley, The Analyst, section 35. 

12. Analyst, section 15. Berkeley used the function x” where we have used x”, and a Newtonian notation, 
lower-case o, for the increment. 

13. Letter from Lagrange to d’Alembert, 24 February 1772, in Oeuvres de Lagrange, vol. 13, p. 229. 

14. D. Diderot, De interpretation de la nature, in Oeuvres philosophiques, ed., P. Verniere, Garnier, Paris, 
1961, pp. 180-181. 

15. Cauchy, Cours d’analyse, Oeuvres, series 2, vol. 3; for real-valued series, see especially pp. 114-138. 

16. Cauchy, op. cit., p. 43. So did Bolzano; see below, and note 30. 

17. Cauchy, op. cit., pp. 378-380. For an English translation of this proof, see Grabiner, Origins, pp. 167-168. 
For clarity, I have substituted b, b,, bo,... and c, ¢), ¢,... for Cauchy’s x9, x), x5,... and X, X’, X”,... in the 
present version. 

18. Lagrange, Equations numeriques, sections 2 and 6, in Oeuvres, vol. 8; also in Lagrange, Lecons elementaires 
sur les mathematiques données a l’ecole normale en 1795, Seances des Ecoles Normales, Paris, 1794-1795; in 
Oeuvres, vol. 7, pp. 181-288; this method is on pp. 260-261. 

19. I. Grattan-Guinness, Development of the Foundations of Mathematical Analysis from Euler to Riemann, 
M. I. T. Press, Cambridge and London, 1970, p. 123, puts it well: “Uniform convergence was tucked away in the 
word “always,” with no reference to the variable at all.” 

20. Lagrange, Lecons sur le calcul des fonctions, Oeuvres 10, p. 87; compare Lagrange, Theorie des fonctions 
analytiques, Oeuvres 9, p. 77. I have substituted A for the i Lagrange used for the increment. 

21. Lagrange, Theorie des fonctions analytiques, Oeuvres 9, p. 29. Compare Lecons sur le calcul des fonctions, 
Oeuvres 10, p. 101. For Euler, see his Institutiones calculi differentialis, St. Petersburg, 1755; in Opera, series 1, vol. 
10, section 122. 

22. Grabiner, Origins of Cauchy’s Rigorous Calculus, chapter 5; also J. V. Grabiner, The origins of Cauchy’s 
theory of the derivative, Hist. Math., 5, 1978, pp. 379-409. 

23. The notation is modernized. For Euler, see Institutiones calculi integralis, St. Petersburg, 1768-1770, 3 vols; 
in Opera, series 1, vol. 11, p. 184. Eighteenth-century summations approximating integrals are treated in A. P. 
Iushkevich, O vozniknoveniya poiyatiya ob opredelennom integrale Koshi, Trudy Instituta Istorii Estestvoznaniya, 
Akademia Nauk SSSR, vol. 1, 1947, pp. 373-411. 

24. S. D. Poisson, Suite du memoire sur les integrales definies, Journ. de l’Ecole polytechnique, Cah. 18, 11, 
1820, pp. 295-341, 319-323. I have substituted h, w for Poisson’s a, k, and have used R, for his Ro. 

25. Poisson, op. cit., pp. 329-330. 

26. Cauchy, Cours d’analyse, Introduction, Oeuvres, Series 2, vol. 3, p. iii. 

27. Cauchy, Cours d’analyse, Note VII, Oeuvres, series 2, vol. 3, pp. 456-457. 

28. Cauchy, Calcul infinitesimal, Oeuvres, series 2, vol. 4, 122-25; in Grabiner, Origins of Cauchy’s Rigorous 
Calculus, pp. 171-175 in English translation. 

29. Cauchy, op. cit., pp. 151-152. 

30. B. Bolzano, Rein analytischer Beweis des Lehrsatzes dass zwischen je zwey Werthen, die ein entgegenge- 
setztes Resultat gewaehren, wenigstens eine reele Wurzel der Gleichung liege, Prague, 1817. English version, S. B. 
Russ, A translation of Bolzano’s paper on the intermediate value theorem, Hist. Math., 7, 1980, pp. 156-185. The 
contention by Grattan-Guinness, Foundations, p. 54, that Cauchy took his program of rigorizing analysis, definition 
of continuity, Cauchy criterion, and proof of the intermediate-value theorem, from Bolzano’s paper without 
acknowledgement is not, in my opinion, valid; the similarities are better explained by common prior influences, 
especially that of Lagrange. For a documented argument to this effect, see J. V. Grabiner, Cauchy and Bolzano: 
Tradition and transformation in the history of mathematics, to appear in E. Mendelsohn, Transformation and 
Tradition in the Sciences, Cambridge University Press, Cambridge, forthcoming; see also Grabiner, Origins of 
Cauchy’s Rigorous Calculus, pp. 69-75, 102-105, 94-96, 52-53. 

31. Cauchy, Sur la plus grande erreur a craindre dans un resultat moyen, et sur le systeme de facteurs qui rend 
cette plus grande erreur un minimum, Comptes rendus 37, 1853; in Oeuvres, series 1, vol. 12, pp. 114-124. 


CENTER SECTION 
(Vol. 90, No. 3, March 1983) 


Telegraphic Reviews 


Edited by Lynn Arthur Steen, with the assistance of the Mathematics Departments 
of Carleton, Macalester, and St. Olaf Colleges. Books submitted for review 
should be sent to Book Review Editor, American Mathematical Monthly, St. Olaf 
College, Northfield, Minnesota 55057. 


Telegraphic reviews are designed to give prompt notice of all new books in the mathematical 
sciences. Certain of these books will be selected for more extensive review in the Reviews section 


of the Monthly. 


Special Codes: 


T: Textbook 13-18: Grade Level 

S: Supplementary Reading 1-4 : Time in Semesters 
P: Professional Reading ** : Special Emphasis 
L: Undergraduate Library 22 : Questionable 


General, P, L. Faces of Science. V.V. Nalimov. ISI Pr, 1981, xv + 297 pp, $22.50. [ISBN: 0-89495- 
010-X] A prominent Soviet statistician contemplates the social nature of science, viewing it as a 
large self-organized information system, a cybernetic system. This strongly humanistic analysis 
focusses especially on the role of language and of uncertainty in shaping scientific theory and 
experimental practice. GHM 


General. Practical Math Handbook for the Building Trades. Paul Calter. Prentice-Hall, 1983, x + 
294 pp, $8.95 (P). [ISBN: 0-13-692228-7] Very much a practical handbook at the precalculus level. 
Contains a general mathematical formula and table section (166 pages), including an outline of 
pocket calculator use, and a nice (110 page) section on construction including very complete and 
accessible tables and formulas for plumbing, heating, and financial computations. Handy for house- 


holders and also as a source of "practical" problems for high school algebra. JAS 


General, S, L*. Tools for Teaching. Birkhauser Boston, 1981. UMAP Modules 1977-1979. xii + 727 
pp, $35 [ISBN: 3-7643-3049-X]; UMAP Modules for 1980. xii + 690 pp, $35. [ISBN: 3-7643-3059-7] 
Annual collections of field-tested modules from project UMAP. These lecture-sized supplements offer 
diverse opportunities for classroom enrichment: cardiac output, Monte Carlo experiments, digestion 
in sheep, scheduling prison guards, et al. LAS 


General, S, L*. Handbook of Cubik Math. Alexander H. Frey, Jr., David Singmaster. Enslow Pub, 
1982, viii + 193 pp, $9.95 (P). [ISBN: 0-89490-058-7] A well-written elaboration of Singmaster’s 
Notes on Rubik”s Magic Cube (TR, February 1982), containing much the same material in more accessi- 
ble form: notation, solution methods, analysis of basic moves, subgroup structures, advanced 
methods. Deals only with the 3x3 cube. LAS 


General, S, L*. Inside Rubik’s Cube and Beyond. Christoph Bandelow. Birkhauser Boston, 1982, 125 
pp, $3.95 (P). [ISBN: 3-7643-3078-3] A systematic, mathematical treatment of Rubik’s cube and its 
associated group theory, complete with definitions, theorems and proofs. The notation is_ similar 
but not identical to the now~standard notation introduced by Singmaster. Includes discussion of 
other similar puzzles (tetrahedron, dodecahedron, 2x2 cube, but not the 4x4 cube), flowchart for a 
computer program and a catalogue of maneuvers. LAS 


General, P, L*¥*. An Assessment of Research-Doctorate Programs in the United States: Mathematical 6& 
Physical Sciences. Ed: Lyle V. Jones, Gardner Lindzey, Porter E. Coggeshall. National Academy Pr, 
1982, xii + 243 pp, $10.50 (P). [ISBN: 0-309-03299-7] Full report on the recent, widely-publicized 
survey of characteristics and quality in Ph.D. programs. Data from 16 different measures (of size, 
reputation, library, graduates, publications, research support) are presented in both raw, standard- 


ized, and graphical forms. LAS 


Mathematics Appreciation, S*(13-15), L. Mathematical Fallacies and Paradoxes. Bryan H. Bunch. Van 
Nostrand Reinhold, 1982, xi + 216 pp, $16.95. [ISBN: 0-442-24905-5] A well-written elementary intro- 
duction to paradoxes of logic, geometry, and algebra, assuming only the rudiments of high school 
algebra and geometry. Moves carefully but swiftly from traditional fallacies (e.g., division by 
zero) through liar paradoxes to more sophisticated things such as GUdel”s theorem, Banach-Tarski 
paradox and Zeno’s paradoxes. LAS 


Education, S*(13). Studying Mathematics. Mary Catharine Hudspeth, Lewis R. Hirsch. Kendall/Hunt, 
1982, xi + 51 pp, $4.95 (P). [ISBN: 0-8403-2768-4] Nearly everything you want students to know about 
preparing for and being successful in elementary courses. Covers topics such as: how to read a math 
text, preparing for exams, organized note-taking. Responds well to standard student 
complaints/excuses, e.g., "I would have passed if I didn“t make so many dumb mistakes." JRG 


C34 Center Section March 


Education, P. Pushbutton Mathematics: Calculator Math Problems, Examples, and Activities. Kenneth 
P. Goldberg. Prentice-Hall, 1982, x + 197 pp, $8.95 (P). [ISBN: 0-13-743302-6] A guide to the use 
of calculators in the mathematics classroom. Gives detailed examples of some topics and then addi- 
tional activities in seven areas: elementary algebra, trigonometry, intermediate/advanced algebra, 


geometry, business/consumer mathematics, probability and statistics, and elementary calculus. RSK 


Education, P*¥, L. The Future of College Mathematics. Ed: Anthony Ralston, Gail S$. Young. 
Springer-Verlag, 1983, ix + 278 pp. [ISBN: 0-387-90813-7] Proceedings of a 1982 summer conference at 
Williams College devoted to an analysis of the proposed need for a freshman/sophomore discrete 
mathematics alternative to calculus. Includes both papers--suggested course outlines, interface 
with client disciplines, problems of implementation--and summaries of discussion. LAS 


History. Gabor SzegU: Collected Papers. Ed: Richard Askey. Birkhauser Boston, 1982, $180 set. 
Volume 1: 1915-1927, xx + 857 pp [ISBN: 3-7643-3056-2]; Volume 2: 1927-1943, x + 869 pp [ISBN: 3- 
7643-3060-0]; Volume 3: 1945-1972, x + 880 pp. [ISBN: 3-7643-3061-9] All of SzegS’s papers, arranged 
in chronological order, introduced by a short biography; commentary on SzegU’s life and work by G. 
Polya, P.C. Rosenbloom, and R. Askey; reviews of three of Szeg¥’s books; and two mathematical com- 
mentaries on SzegU’s work. LAS 


History, S*(14-18), P*, L*¥**, Neyman--from life. Constance Reid. Springer-Verlag, 1982, 298 pp, 
$19.80. [ISBN: 0-387-90747-5] Fascinating non-technical*account of the life of Jerzy Neyman, one of 
the great pioneers of mathematical statistics, by the author of Hilbert and Courant in G&ttingen 
and New York. Written while Neyman (1894-1981) was still alive, it contains his personal recollec- 
tions as well as those of his colleagues, including his best-known collaborator, Egon Pearson. Must 
reading for anyone interested in the history of statistics. RSK 


History, L*. The Collected Letters of Colin MacLaurin. Ed: Stella Mills. Shiva Pub, 1982, xx + 
496 pp, $35. {ISBN: 0-906812-08-9] Colin MacLaurin, one of Scotland’s greatest mathematicians, 
helped interpret and defend Newton in, e.g., his 1742 Treatise of Fluxions, an answer to Bishop 
Berkeley’s 1734 The Analyst. This volume of letters, both general and scientific, traces the 
development of his ideas in exchanges with Newton and Stirling and many less well known eighteenth 


century scientists and mathematicians. LAS 


Foundations, P. Mathematical Logic, Revised Edition. Willard Van Orman Quine. Harvard U Pr, 1976, 
xii + 346 pp, $25; $9.95 (P). [ISBN: 0-674-55450-7] Unaltered reprinting of 1951 revised edition. A 
classic of the field, but now considerably dated as to notation and emphasis. GHM 


Foundations, T*(15-16: 1), S, L*. Formal Number Theory and Computability: A Workbook. Alec Fisher. 
Clarendon Pr, 1982, xiii + 190 pp, $12.50 (P); $37. [ISBN: 0-19-853188-5; 0-19-853178-8] Interesting 
text for first undergraduate course in mathematical logic. Focuses on GUdel”’s incompleteness 
theorem and related consistency and decidability results. Covers general propositional and predi- 
cate calculus briefly. Introduces computability using register machines. Numerous exercises; com- 
plete solutions in appendix. KS 


Graph Theory, P. Zero-Symmetric Graphs: Trivalent Graphical Regular Representations of Groups. 
H.S.M. Coxeter, Roberto Frucht, David L. Powers. Academic Pr, 1981, ix + 170 pp, $15. [ISBN: 0-12- 
194580-4] Description of all known zero-symmetric graphs with not more than 120 vertices. Zero- 
symmetric graphs are finite trivalent graphs whose automorphism group acts regularly on the ver- 


tices. KS 


Combinatorics, P. Lecture Notes in Mathematics-952: Combinatorial Mathematics IX. Ed: Elizabeth J. 
Billington, Sheila Oates-Williams, Anne Penfold Street. Springer-Verlag, 1982, xi + 443 pp, $23 
(P). [LISBN: 0-387-11601-X] Proceedings of the Ninth Australian Conference on Combinatorial Mathemat-— 
ics held at the University of Brisbane, Australia, August 1981. Texts of seven of the nine invited 


talks, plus 20 contributed papers. LAS 


Discrete Mathematics, T(17-18: 1, 2), S, L. Introduction to Coding Theory. J.H. van Lint. Grad. 
Texts in Math., No. 86. Springer-Verlag, 1982, ix + 171 pp, $24. [ISBN: 0-387-11284-7] Graduate 
level text emphasizing construction and analysis of codes with some attention to mathematical prob- 
lems of decoding. Assumes knowledge of abstract and linear algebra, combinatorics, and probability 
theory. KS 


Number Theory, $*(13), P, L*. Solving Equations in Integers. A.O. Gelfond. Trans: 0O.B. Sheinin. 
Little Math. Lib. MIR Pub, 1981, 56 pp, $2 (P). Based on a lecture the author delivered in 1951 
for participants in a Mathematical Olympiad. Focuses on classical results concerning equations in 
two or three unknowns of low degree. CEC 


Number Theory, T(17: 1), S, P, L. Adeles and Algebraic Groups. A. Weil. Progress in Math., V. 23. 
Birkhauser Boston, 1982, 126 pp, $10. [ISBN: 3-7643-3092-9] This volume contains the original lec- 
ture notes (from 1959-60) in which the concept of adeles was first introduced. The notes have been 
supplemented by an extended bibliography and a brief survey of subsequent research. CEC 


Number Theory, $(18), P. Periods of Hilbert Modular Surfaces. Takayuki Oda. Progress in Math., V. 
19. Birkhauser Boston, 1982, xvi + 123 pp, $10. [ISBN: 3-7643-3084-8] The author investigates Hodge 
structures of Hilbert modular surfaces and shows that the Hodge structure attached to a Hilbert 


modular surface is a direct sum of tensor products of two Hodge structures of weight 1. Includes a 


1983 Center Section C35 


bibliography. CEC 


Number Theory, P. Ternary Quadratic Forms and Norms. Ed: Olga Taussky. Lect. Notes in Pure & 
Appl. Math., V. 79. Dekker, 1982, vii + 135 pp, $19.75 (P). [ISBN: 0-8247-1651-5] Presents six 
papers given at a Cal Tech seminar during the 1974-75 academic year. Of special interest is 
Zassenhaus’s exposition of Gauss’s theory of ternary forms. Other contributors are Kisilevsky, 
Plesken, Rehm, Taussky, Weiss, and via letter, van der Waerden. SG 


Number Theory, T(15: 1), S$, P, L. The Higher Arithmetic: An Introduction to the Theory of Numbers, 
Fifth Edition. H. Davenport. Cambridge U Pr, 1982, 189 pp, $22.95; $9.95 (BP). [ISBN: 0-521-24422- 
6; 0-521-28678-6] A new edition of this classic work, which was first published in 1952 and last 


reissued in 1970. This edition includes problems and solutions and occasional updates. CEC 


Linear Algebra, P. Recent Applications of Generalized Inverses. Ed: S.L. Campbell. Research Notes 
in Math., No. 66. Pitman Pub, 1982, 274 pp, $23.95 (P). [ISBN: 0-273-08550-6] Collection of papers 
designed to reflect current trends in research done on and with generalized inverses of matrices. 
Includes applications to electrical networks and Markov chains, use in numerical analysis, and 
extension of theory to matrices over rings and infinite dimensional spaces. KS 


Linear Algebra, T(16-17: 1, 2), §, P, L. Cartesian Tensors: With Applications to Mechanics, Fluid 
Mechanics and Elasticity. A.M. Goodbody. Math. & Its Appl. Halsted Pr, 1982, 298 pp, $75. [ISBN: 
0-470-27254-6] Multilinear algebra developed with physical applications--especially mechanics--in 
mind. Emphasis on computation; many explicit calculations, examples. A readable text for engineer~ 
ing students with some mathematical sophistication. Exercises, many with solutions. PZ 


Linear Algebra, T(13: 1), S, L. Matrix Algebra Useful for Statistics. Shayle R. Searle. Wiley, 
1982, xxii + 438 pp, $29.95. [ISBN: 0-471-86681-4] The final two chapters on regression and linear 
models in statistics are described using matrix terminology. The objective of the book is to lead 
even the minimally prepared student (high school algebra) to a familiarity with the notation and an 
understanding of the relevant theory (including generalized inverses, partitioned matrices, eigen- 
values and eigenvectors). The theoretical development is profusely illustrated with numerical exam- 
ples and applications from the applied sciences. LCL 


Algebra, P. Lecture Notes in Mathematics-924: Séminaire d”“Algébre Paul Dubreil et Marie-Paule Mal- 


liavin. Ed: M.-P. Malliavin. Springer-Verlag, 1982, v + 461 pp, $21.60 (P). [ISBN: 0-387-11496-3] 
Proceedings of a 1981 conference which took place in Paris. CEC 


Algebra, T(18), S, P. A Concrete Approach to Division Rings. John Dauns. Res. & Educ. in Math., 
No. 2. Heldermann Verlag, 1982, xx + 417 pp, 78 DM (P). [ISBN: 3-88538-202-4] This is the first 
book to treat all types of division rings. By constructing and drawing attention to concrete exam- 
ples, the author is able to lead even the non-expert easily and quickly to the frontiers of the 
subject--in marked contrast to the classical references on division rings in which the _ relevant 
facts are often given as corollaries to long chains of previous results. LCL 


Algebra, T(18), P. Cohomology of Groups. Kenneth S. Brown. Grad. Texts in Math., No. 87. 
Springer-Verlag, 1982, x + 306 pp, $28. [ISBN: 0-387-90688-6] An introduction to a subject which 
draws on a rich interplay between algebra and topology. The author builds the subject from the bot- 
tom, assuming only a knowledge of the first year graduate courses in algebra and algebraic topology. 
The final four chapters are more specialized. LCL 


Algebra, P. Lecture Notes in Mathematics-944: Representations of Algebras. Ed: M. Auslander, E. 
Lluis. Springer-Verlag, 1982, w + 258 pp, $14 (P). [ISBN: 0-387-11577-3] Notes of four series of 
lectures given at the workshop held the week before the Third International Conference on Represen- 
tations of Algebras’ held August 8-16, 1980 in Puebla, Mexico. The proceedings of this conference 
appeared as Lecture Notes 903. JAS 


Algebra, P. Lecture Notes in Mathematics-951: Advances in Non-Commutative Ring Theory. Ed: P.J. 
Fleury. Springer-Verlag, 1982, 142 pp, $8.50 (P). [ISBN: 0-387-11597-8] Proceedings of the Twelfth 
George H. Hudson Symposium held at the State University College of Arts and Science at Plattsburgh, 


New York on April 23-25, 1981. JAS 


Algebra, T(14-16). A First Undergraduate Course in Abstract Algebra, Third Edition. Abraham P. 
Hillman, Gerald L. Alexanderson. Wadsworth Pub, 1983, ix + 501 pp. [ISBN: 0-534-01195-0] The major 
changes from previous editions include several sections on discrete mathematics and a new chapter on 
coding theory. However, the book remains a solid introduction to modern algebra suitable for any 
student of mathematics, computer science, or statistics. (First Edition, Extended Review, August- 
September 1974; Second Edition, TR, June-July 1978.) SG 


Algebra, P. Lecture Notes in Computer Science-144: Computer Algebra. Ed: Jacques Calmet. 
Springer-Verlag, 1982, xiv + 301 pp, $14 (P). [ISBN: 0-387-11607-9] Over thirty papers from the 
European Computer Algebra Conference held in Marseille, France in April 1982, dealing with the 
design, analysis, and implementation of constructive methods for solving various problems (e.g., 
root finding, factorization) in different algebraic structures. LCL 


Algebra, P. Lecture Notes in Mathematics-933: Lie Algebras and Related Topics. Ed: D. Winter. 


ce rrr 


Springer-Verlag, 1982, vi + 236 pp, $11 (P). [ISBN: 0-387-11563-3] Proceedings of a conference held 
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at Rutgers University at New Brunswick, New Jersey, May 29-31, 1981. JAS 


Calculus, $€13). Areas and Logarithms. A.I. Markushevich. Trans: I. Aleksanova. MIR Pub, 1981, 
70 pp, $2 (P). Integral as area. Develops logarithm as integral and derives properties, including 
series approximation for In(1+x). Brief discussion of distribution of primes and relationship to 
logarithm function. Supplementary treatment of Simpson’s Rule. JRG 


Real Analysis, P. Lecture Notes in Mathematics-945: Measure Theory, Oberwolfach 1981. Ed: D. 
KY¥lzow, D. Maharam-Stone. Springer-Verlag, 1982, xv + 431 pp, $20.50 (P). [ISBN: 0-387-11580-3] 
Proceedings of a June 1981 conference at Oberwolfach. Concludes with a short list of open problems. 
LAS 


Complex Analysis, $(18), P. Approximation Uniforme Qualitative sur des Ensembles non Bornés. Paul 
M. Gauthier, Walter Hengartner. Pr U Montreal, 1982, 89 pp, $11 (P). [ISBN: 2-7606-0574-4] Notes 
from lectures given at the August 1981 NATO seminar at Montreal. Surveys results and methods of 
locally uniform approximation theory for holomorphic functions on closed domains in Riemann sur~- 
faces, and for harmonic functions on domains in R®, Approximation theorems of Runge and Mergelyan 
are generalized. PZ 


Complex Analysis, P. Lecture Notes in Mathematics-950: Complex Analysis. Ed: J. Eells. Springer- 
Verlag, 1982, 428 pp, $20.50 (P). (ISBN: 0-387-11596-X1 Proceedings of the 1980 international summer 
school in theoretical physics in Trieste: seven expositions on such topics as twistor theory, holo- 
morphic vector bundles, and holomorphic foliations. LAS 


Differential Equations, T**(15: 1), S, L*. A First Course in Differential Equations with Applica- 
tions, Second Edition. Dennis G. Zill. Prindle, Weber & Schmidt, 1982, ix + 527 pp. LISBN: 0-534- 
98011-2] Several revisions have been made in this edition, including over 600 new problems and many 
new examples and figures. All of the standard topics are covered, along with an introduction to 
partial differential equations, in this readable text. (TR, First Edition, August-September 1980.) 


CEC 


Differential Equations, $(17-18), P. Methods of Bifurcation Theory. Shui-Nee Chow, Jack K. Hale. 
Grund. der math. Wissenschaften, B. 251. Springer-Verlag, 1982, xv + 515 pp, $48. [ISBN: 0-387- 
90664-9] The primary objective of this monograph is a presentation of those aspects of static and 
dynamic bifurcation theory which are of particular value in the study of differential equations. 
Introductory material on nonlinear functional analysis and the qualitative theory of differential 
equations makes the book accessible to nonspecialists. AO 


Differential Equations, T(16-17: 1, 2), L. Ordinary Differential Equations, Second Edition. Philip 
Hartman, Birkhauser Boston, 1982, xv + 612 pp, $29.95. [ISBN: 3-7643-3068-6] Reprint of the 1973 
corrected reprint (TR, June-July 1974) of the 1964 Wiley original edition. AO 


Differential Equations, T(18: 1), S, P. Variational Principles and Free-Boundary Problems. Avner 
Friedman. Pure & Appl. Math. Wiley, 1982, ix + 710 pp, $52.50. [ISBN: 0-471-86849-3] A major 
effort to present in "a systematic and self-contained manner" many of the recent developments in the 
theory of free boundary problems. The first two chapters are used to assemble the necessary theory 
of partial differential operators with the remaining three aimed toward applications; jets and cavi- 
ties, variational problems, and non-variational problems. Bibliography remarks, references, index. 
JS 


Numerical Analysis, P, L. Applied Iterative Methods. Louis A. Hageman, David M. Young. Comp. Sci. 
& Appl. Math. Academic Pr, 1981, xvii + 386 pp, $39.50. [ISBN: 0-12-313340-8] Presents a number of 
iterative methods for solving large, sparse systems of linear algebraic equations including both 
Computational and theoretical aspects. Also includes a discussion of the use of iterative methods 
in the solution of multidimensional boundary-value problems. AO 


Numerical Analysis, $(16-18), P. Implementation of Finite Element Methods for Navier-Stokes Equa- 
tions. Francois Thomasset. Ser. in Computational Physics. Springer-Verlag, 1981, vii + 159 pp, 
$28. [ISBN: 0-387-10771-1] A concise introduction to the use of finite element methods in fluid 


mechanics. The emphasis is on implementation issues rather than on the underlying mathematics. AO 


Numerical Analysis, P. Numerical Integration. Ed: G. H¥mmerlin. ISNM 57. Birkhauser Boston, 1982, 
275 pp, $29.95. [ISBN: 3-7643-1254-8] The proceedings of a conference held in Oberwolfach in 
October, 1981. Among the topics discussed in the 25 papers are the theory of quadrature, multidi- 
mensional numerical integration schemes, and the quadrature of functions with special properties 
(e.g., convexity). AO 


Numerical Analysis, P. Méthodes de lagrangien augmenté. M. Fortin, R. Glowinski. Dunod, 1982, xvi 
+ 320 pp, 170 FF. [ISBN: 2~04-015465-5] The object of the book is to show that augmented Lagrange 
methods provide highly effective algorithms for the numerical solution of boundary value problems 
for partial differential equations. LCL 


Numerical Analysis, T(17-18: 1), P. Finite Elements: Mathematical Aspects, Volume IV. J. Tinsley 
Oden, Graham F. Carey. Prentice-Hall, 1983, viii + 195 pp, $29. [ISBN: 0~13-317081-0] This work 
presents the basic mathematics needed to understand the convergence and stability properties of fin- 
ite element methods for linear elliptic boundary-value problems. Some experience with finite 
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element methods and some knowledge of real and functional analysis is presumed. AO 


Numerical Analysis, P. Lecture Notes in Mathematics-953: Iterative Solution of Nonlinear Systems of 
Equations. Ed: R. Ansorge, Th. Meis, W. TYrnig. Springer-Verlag, 1982, vii + 202 pp, $12 (P). 
[ISBN: 0-387-11602-8] Proceedings representing 13 of 24 lectures presented at a meeting held at 


Oberwolfach, Germany, January 31 to February 5, 1982. JAS 


Numerical Analysis, P. Numerical Grid Generation. Ed: Joe F. Thompson. Elsevier North-Holland, 
1982, xxv + 909 pp, $95. [ISBN: 0-444-00757-1] Proceedings of a symposium on the numerical genera- 
tion of curvilinear coordinate systems and their use in the numerical solution of partial differen- 
tial equations, held in Nashville in April, 1982. 45 papers, many of which are expository. RWN 


Numerical Analysis, $(13-15). Numerical Analysis Demonstrations on the HP-33E. Peter and Marie 
Louise Henrici. Wiley, 1982, 234 pp, $11.95 (P). [ISBN: 0-471-05943-9] A machine-dependent supple- 
ment to the first author’s Essentials of Numerical Analysis, With Pocket Calculator Demonstrations 
(TR, January 1983). Nonetheless, an interesting, useful set of examples. RWN 


Numerical Analysis, P. Nonlinear Equivalence, Reduction of PDEs to ODEs and Fast Convergent Numeri- 
cal Methods. E.E. Rosinger. Research Notes in Math., No. 77. Pitman Pub, 247 pp, $21.95 (P). 
[ISBN: 0-273-08570-0] Integrates several of the author’s works on several numerical methods for 
solving nonlinear systems of evolution partial differential equations. RWN 


Functional Analysis, P. From A to Z: Proceedings of a Symposium in Honour of A.C. Zaanen. Ed: C.B. 
Huijsmans, et al. Math. Centre Tracts, No. 149. Math Centrum, 1982, vii + 130 pp, Dfl. 16,80 (P). 
[ISBN: 90-6196-241-2] Collection of papers by former Ph.D. students of Zaanen, primarily in func- 
tional analysis, integration theory and Riesz space theory. JRG 


Functional Analysis, P. SchrUdinger-type Operators with Continuous Spectra. M.S.P. Eastham, H. 
Kalf. Research Notes in Math., No. 65. Pitman Pub, 1982, 281 pp, $24.95 (P). [ISBN: 0-273-08526-3] 


An account of recent results concerning the existence and non-existence of eigenvalues embedded in 
the continuous spectrum of Sturm-Liouville and Schr¥dinger-type operators. AO 


Functional Analysis, P. Spectral Theory and Wave Operators for the Schr&¥dinger Equation. A.M. 
Berthier. Research Notes in Math., No. 71. Pitman Pub, 1982, 306 pp, $24.95 (P). [ISBN: 0-273- 
08562-X] This work gives a nearly complete spectral analysis of self-adjoint Schridinger operators 
with short range potentials. AO 
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Spaces of Continuous Functions. Zbigniew Semadeni. Springer-Verlag, 1982, v + 136 pp, $9.80 (P). 
[ISBN: 0-387-11481-5] The objects of study are countable subsets whose linear span is dense. Gen- 
eral properties of Schauder bases of C(X), construction of spline and other special bases, and 
nonexistence results for certain classes of bases are developed. In textbook style, with exercises. 
PZ 


Functional Analysis, T(18: 1, 2), S, P. Lecture Notes in Mathematics-918: Schauder Bases in Banach 


Functional Analysis, P. Toeplitz Centennial. Ed: I. Gohberg. Operator Theory, V. 4. Birkhauser 
Verlag, 1982, 588 pp, $48.40. [ISBN: 3-7643-1331-1] Proceedings of a May 1981 memorial conference in 
Tel Aviv on the centennial of Otto Toeplitz” birth: 29 research papers on aspects of operator 
theory, plus four memorial papers (by K8the, Dieudonne, and Toeplitz” son) describing Toeplitz” 
mathematical work as well as the personal and social pressures on a leading Jewish scholar during 
the rise of the National Socialists. LAS 


Functional Analysis, P. Lecture Notes in Mathematics—948: Functional Analysis. Ed: D. Butkovié, H. 
Kraljevi¢é, S. Kurepa. Springer-Verlag, 1982, x + 239 pp, $12 (P). [ISBN: 0-387-11594-3] Lecture 
notes (excluding those of P.R. Halmos which have been published elsewhere) from a November 198] 


conference at Dubrovnik, Yugoslavia. LAS 


Functional Analysis, T(17-18: 1), S, P, L. Notes on Real and Complex C*-Algebras. K.R. Goodearl. 
Birkhauser Boston, 1982, 211 pp, $28.95. [ISBN: 0-906812-16-xX] Intended as a "leisurely" introduc 
tion to C*-algebras, three major results are developed: the complex and real Gelfand-Naimark theory 
for both the commutative and non-commutative cases and the classification of approximately finite- 
dimensional C*-algebras. Exercises, references, index. JS 


Functional Analysis, T(17-18: 1), S, P, L. A Short Introduction to Perturbation Theory for Linear 
Operators. Tosio Kato. Springer-Verlag, 1982, xiii + 161 pp, $19.80. [ISBN: 0-387-90666-5] With 
relatively minor changes and a modest expansion, this is essentially the first two chapters of the 
author’s longer work on the subject (Perturbation Theory for Linear Operators, Springer-Verlag, 
1966; TR°s, August-September 1967; January 1978). It deals with the finite-dimensional theory and, 


as such, is considerably simpler and self-contained. JS 


Functional Analysis, P. "Ultra"-Techniques in Banach Space Theory. Brailey Sims. Papers in Pure & 
Appl. Math., No. 60. Queen’s U, 1982, v + 117 pp, (P). The Banach space ultraproduct has led to 
new developments and improved understanding in a variety of areas in Banach space theory. Written 
for Banach space theorists; assumes no prior knowledge of ultrapowers, and makes no explicit use of 
model theory. LCL 
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Analysis, P. Lecture Notes in Mathematics-919: Séminaire Pierre Lelong-Henri Skoda (Analyse) Années 


1980/81 et Colloque de Wimereux, Mai 1981. Ed: Pierre Lelong, Henri Skoda. Springer-Verlag, 1982, 
vii + 386 pp, $17.40 (P). [ISBN: 0-387-11482-3] 


Analysis, P. Lecture Notes in Mathematics-906: Séminaire de Théorie du Potentiel Paris, No. 6. F. 


SS TT 


Hirsch, G. Mokobodzki. Springer-Verlag, 1982, iv + 328 pp, $15.40 (Pp). [ISBN: 0-387-11185-9] _ 


Analysis, P. Lecture Notes in Mathematics—939: Martingale Theory in Harmonic Analysis and Banach 
Spaces. Ed: J.-A. Chao, W.A. Woyczynski. Springer-Verlag, 1982, viii + 225 pp, $12 (P). [ISBN: 0- 
387-11569-2] The proceedings, less ten lectures by Donald L. Burkholder, of the NSF-CBMS conference 
held at the Cleveland State University, Cleveland, Ohio, July 13-17, 1981. JAS 


Analysis, P, L¥**. Handbook of Applicable Mathematics, Volume IV: Analysis. Ed: Walter Ledermann, 
Steven Vajda. Wiley, 1982, xxiii + 865 pp, $85. [ISBN: 0-471-10141-9] Fourth of six volumes on core 
mathematics designed to set forth mathematical tools for practitioners in a_ self-contained, direct 
manner. This volume covers calculus, real and complex analysis, ordinary, partial and stochastic 
differential equations, integral transforms, nonlinear optimization, functional analysis, and _ much 
more. LAS 


Analysis, P. Nonlinear Analysis and Applications. Ed: S.P. Singh, J.H. Burry. Lect. Notes in Pure 
& Appl. Math., V. 80. Dekker, 1982, x + 468 pp, $49.75 (P). [ISBN: 0-8247-1790-2] The proceedings 
of an international conference held at Memorial University, St. John’s, Newfoundland, Canada from 
June 1-3, 1981. JAS 


Analysis, P, L. Conference on Harmonic Analysis in Honor of Antoni Zygmund. Ed: William Beckner, et 
al. Wadsworth Pub, 1983, $79.75 set [ISBN: 0-534-98043-0]. Volume I, xvi + 345 pp; Volume II, xvi 
+ 488 pp. Papers on trigonometric series, Fourier analysis, singular integrals, Hardy spaces, dif- 
ferentiation theory, partial differential equations, and harmonic analysis, from a March 1981 
conference at the University of Chicago on the occasion of Zygmund’s eightieth birthday. LAS 


Algebraic Geometry, P. Lecture Notes in Mathematics—94/7: Algebraic Threefolds. Ed: Alberto Conte. 
Springer-Verlag, 1982, vii + 315 pp, $16.50 (P). [ISBN: 0-387-11587-0] Proceedings of the CIME 
conference held at the "Villa Monastero" in Varenna (Como), Italy, June 14-23, 1981. JAS 


Algebraic Geometry, P. Lecture Notes in Mathematics-943: Modifications Analytiques. Vincenzo 
Ancona, Giuseppe Tomassini. Springer-Verlag, 1982, 120 pp, $8 (P). [ISBN: 0-387-11570-6] 


Algebraic Geometry, P. Introduction to Algebraic and Abelian Functions, Second Edition. Serge Lang. 
Grad. Texts in Math., No. 89. Springer-Verlag, 1982, ix + 169 pp, $32. [ISBN: 0-387-90710-6] Among 
the changes and additions from the First Edition (TR, January 1973) are Weil”’s proof of Riemann- 
Roch, Rohrlich’s work on the Fermat curve, and an expanded discussion of theta functions. SG 


Differential Geometry, T(18), S, P. Singularities of Smooth Functions and Maps. Jean Martinet. 
Transl: Carl P. Simon. London Math. Soc. Lect. Note Ser., No. 58. Cambridge U Pr, 1982, xiv + 256 
pp, $19.95 (P). [ISBN: 0-521-23398-4] A geometric treatment using only the basic notions of algebra 
(modules, ideals), topology (function spaces), and analysis (Taylor’s theorem, implicit function 
theorem). The emphasis is on the local theory and the important role of unfolding and deformation. 
Among ideas discussed in detail are Morse theory, the division theorem for smooth functions, and 
several classification theorems, including Thom’s theory of the seven elementary catastrophes. LCL 


Differential Geometry, P. Lecture Notes in Mathematics-949: Harmonic Maps. Ed: R.J. Knill, M. 
Kalka, H.C.J. Sealey. Springer-Verlag, 1982, 158 pp, $10 (P). [ISBN: 0-387-11595-1] Contributed 
papers from the December 1980 CBMS Regional Conference at Tulane, intended as a companion volume to 
the CBMS volume (Springer Lecture Note No. 950) containing the principal lectures of James Eells. 
LAS 


Differential Geometry, P. Selected Papers of Kentaro Yano. Ed: Morio Obata. Math. Stud., No. 70. 
Elsevier North-Holland, 1982, liii + 145 pp, $58.25 (P). [ISBN: 0-444-86495-4] A selection (by Yano) 
of his research papers in differential geometry, introduced by S.S. Chern and by a mathematical 
autobiography by Yano. LAS 


Geometry, T*(15-16: 1, 2), S, L**. The Non-Euclidean, Hyperbolic Plane: Its Structure and Con- 
Sistency. Paul Kelly, Gordon Matthews. Universitext. Springer-Verlag, 1981, xiii + 333 pp, $24 
(Pp). [ISBN: 0-387-90552-9] With a detailed chapter on absolute geometry serving as _ foundation, 
authors present a careful and thorough--yet elementary--treatment of hyperbolic geometry. Use of 
transformations contributed novelty, efficiency and elegance. The Poincaré model, which is used to 
establish relative consistency, is studied in great detail. Many exercises and an appendix with 
brief treatment of other geometries. Excellent textbook for prospective high school teachers. SS 


Geometry, P. Lecture Notes in Mathematics-926: Geometric Techniques in Gauge Theories. Ed: R. Mar- 
tini, E.M. de Jager. Springer-Verlag, 1982, ix + 219 pp, $12.50 (P). [ISBN: 0-387-11497-1] Proceed- 
ings of the Fifth Scheveningen Conference on Differential Equations, The Netherlands, August 23-28, 


1981. JAS 


Algebraic Topology, P. Etale Homotopy of Simplicial Schemes. Eric M. Friedlander. Annals of Math. 
Stud., No. 104. Princeton U Pr, 1982, vii + 190 pp, $26.50; $11.50 (P). [ISBN: 0-691-08288-X] An 
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exposition of the current state of étale homotopy theory, with applications to algebraic topology, 
algebraic geometry, and the cohomology of groups. SG 


Topology, P. Topologia. J. Margalef Roig, E. Outerelo Dominguez, J.L. Pinilla Ferrando. Editorial 
Alhambra, 1982, ix + 382 pp, (P). [ISBN: 84-205-0879-9] Chapters 12 and 13 of an encyclopedia of 


topology. Covers connectedness and spaces of functions. In Spanish. KS 


Operations Research, S(15-17), L. Systems Modelling and Optimization. Ed: Peter Nash. Control 
Eng. Ser-, V. 16. Peter Peregrinus, 1981, xii + 201 pp, $43 (P). [ISBN: 0-906048-63-X] The arti- 
cles in this volume are drawn from material presented at a summer school for control engineers in 
July, 1980. Approximately one-half of the book is an introductory survey of optimization theory. 
The remainder is made up of five case studies and a short chapter on model evaluation. AO 


Probability, P. Lecture Notes in Mathematics-921: Séminaire de Probabilités XVI, 1980/81 
Supplément: Géométrie Différentielle Stochastique. Ed: J. Azéma, M. Yor. Springer-Verlag, 1982, 285 
pp, $12.90 (P). {ISBN: 0-387-11486-6] A supplement .to Lecture Notes 920 which covers the special 
lectures on stochastic differential geometry. JAS 


Probability, P. Lecture Notes in Mathematics-928: Probability Measures on Groups. Ed: H. Heyer. 
Springer-Verlag, 1982, x + 477 pp, $23 (P). [ISBN: 0-387-11501-3] Proceedings of the Sixth Confer- 
ence held at Oberwolfach, Germany, June 28 to July 4, 1981. JAS 


Statistics, T(16-18: 1, 2), P*. Bayesian Reliability Amalysis. Harry F. Martz, Ray A. Waller. 
Wiley, 1982, xix + 745 pp, $44.95. [ISBN: 0-471-86425-0] In the Wiley Series in Probability and 
Mathematical Statistics. Comprehensive, self-contained presentation. First quarter provides an 
introduction to probability, statistics, and classical reliability. Last three quarters deals 
exclusively with Bayesian methods for reliability analysis. Good sets of references. RSK 


Statistics, P. Rates of Convergence in the Central Limit Theorem. Peter Hall. Research Notes in 
Math., N. 62. Pitman Pub, 1982, 251 pp, $22 (P). [ISBN: 0-273-08565-4] Research monograph unifying 
two disparate traditional approaches: upper bounds on rates of convergence, and characterizations of 


the rate of convergence. Presumes background in measure theory and Fourier analysis. RSK 


Statistics, P. Sampling and Statistics Handbook for Research. Chester H. McCall, Jr. Towa St U Pr, 
1982, xi + 340 pp, $25.95 (P). [ISBN: 0-8138-1628-9] Introduction to basic statistical techniques 
and sampling theory for researchers in education. More concerned with sample surveys than experi- 
ments, it has chapters on stratified, cluster, and combination sampling plans. No exercises. RSK 


Statistics, P. Edgeworth Expansions for Linear Combinations of Order Statistics. R. Helmers. Math. 
Centre Tracts, No. 105. Math Centrum, 1982, iv + 137 pp, Dfl. 17,85 (P). [ISBN: 90-6196-174-2] 
Technical monograph providing more precise estimates of linear combinations of order statistics 


which have limiting standard normal distributions. RSK 


Computer Literacy, $(13). CBM Professional Computer Guide. Adam Osborne, Jim Strasma, Ellen 
Strasma. Osborne/McGraw-Hill, 1982, xi + 512 pp, $15 (P). [ISBN: 0-931988-75-6] This book is sim- 
ply an extended user’s manual for the Commodore CBM 8032 computer system. The entire book is highly 
specific for the hardware and software of that particular system. For teaching purposes it would 
probably need to be supplemented by an introductory programming text and/or an introductory text on 
microcomputers. For its intended audience, however (owners of CBM 8032 systems), it seems quite 
thorough. It takes them through CBM-Basic, screen editing, peripherals, mass storage and advanced 
features of the hardware. This would be a useful book to purchase at the time that you buy a CBM 
8032 personal computer. MS 


Computer Programming, $(13), P. Program Your Microcomputer in BASIC. P.E. Gosling. Dilithium Pr, 
1981, viii + 91 pp, $5.95 (P). [ISBN: 0-918398-52-5] An introduction to Basic primarily for micro- 
computers. The text consists largely of examples of programs. CEC 


Computer Programming, T*(13: 1), L. An Introduction to Programming and Problem Solving with Pascal, 
Second Edition. G. Michael Schneider. Wiley, 1982, xi + 468 pp, $21.95. [ISBN: 0-471-08216-3] This 
textbook is designed for a first course in computer programming similar to the course CSl outlined 
in the ACM Curriculum “78 report. It covers all aspects of programming from problem specification 
through documentation and maintenance. AO 


Computer Programming, S. Practical BASIC Programs: IBM Personal Computer Edition. Ed: Lon Poole. 
Osborne/McGraw-Hill, 1982, x + 162 pp, $15.99 (P). [ISBN: 0-931988-80-2] Collection of forty pro- 
grams from areas of finance, management decision making, statistics, and mathematics. Each program 
includes brief introduction to subject matter, description of input and output, sample run, practice 
problems, and complete listing. KS 


Computer Programming, T(13: 1), S. Learning BASIC Step by Step, Teacher’s Guide. Vern McDermott, 
Diana Fisher. Computer Sci Pr, 1982, vii + 257 pp, $17.95 (P). [ISBN: 0-914894-33-1] Another Basic 
programming text. Introduces elements of language through demonstration problems. Extensive use of 
flowcharts. Teacher’s guide contains more detailed explanations, solutions, comments on common 
errors and other pedagogical hints. KS 
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Computer Programming, T*(13: 1), L. Advanced Programming and Problem Solving with Pascal. 6G. 
Michael Schneider, Steven C. Bruell. Wiley, 1981, xiii + 506 pp, $26.95. [ISBN: 0-471-07876-X] A 
textbook for a second course in computer programming similar to the course CS2 of the ACM Curriculum 
“78 report. A special feature of this book is the inclusion of relatively large, complex case stu-~ 
dies to illustrate key points. AO 


Computer Programming. Timex Sinclair 1000: Programs, Games and Graphics. Robin Jones, Ian Stewart. 
Birkhauser Boston, 1982, vii + 156 pp, $10.95 (P). [ISBN: 3-7643-3080-5] A non-threatening, witty 
but yet sound introduction to the Timex-Sinclair 1000. Includes sections on both hardware and Basic 
along with several program listings of applications. CEC 


Software Systems, $(16-18), P. Lecture Notes in Computer Science-143: Operating Systems Engineer- 
ing. Ed: M. Maekawa, L.A. Belady. Springer-Verlag, 1982, vii + 465 pp, $19 (P). [ISBN: 0-387- 
11604-4] The proceedings of an IBM-sponsored conference held in Amagi, Japan, in October, 1980. The 
papers present recent work on the engineering of operating systems organized under the headings: 
concurrency and access control, program behavior and performance models, operating system evalua- 
tion, user imterfaces, distributed operating systems, network operating systems, development 
processes and tools, and data flow machines. AO 


Computer Science, P, L. The ILLIAC IV: The First Supercomputer. R. Michael Hord. Computer Sci Pr, 
1982, xii + 350 pp, $29.95. [ISBN: 0-914894-71-4] The Illiac IV (operational in 1975) was the first 
large scale array computer, incorporating both pipelining and a high level of parallelism. Intended 
for computer professionals, this book details the history, structure, applications, and impact of 


the Illiac IV, providing a perspective on its strengths and weaknesses. GHM 


Computer Science, 8(15-17), L. Computer Architecture: A Structured Approach. R.W. Doran. APIC Stu- 
dies in Data Processing, No. 15. Academic Pr, 1979, xi + 233 pp, $42. [ISBN: 0-12-220850-1] An 
introduction to the general principles of computer architecture and a detailed examination of their 
implementation in the Burroughs B6700 and B7700 computers. AO 


Computer Science, T(14-15: 1, 2). Mathematical Structures for Computer Science. Judith L. Gerst- 
ing. WH Freeman, 1982, xi + 452 pp, $22.50. [ISBN: 0-7167-1305-5] A broad overview of some basic 
structures and techniques involving logic, sets, graphs, Boolean algebras, groups, morphisms, simple 
coding, finite state and Turing machines, and formal languages. Informal style with many applica- 
tions, but also an appreciable introduction to the rigorous theoretical framework for expressing 
ideas in computer science. GHM 


Computer Science, P. An Analysis of Sparse Matrix Storage Schemes. M. Veldhorst. Math. Centra 
Tracts, No. 150. Math Centrum, 1982, vii + 237 pp, Dfl. 30,45 (P). [ISBN: 90-6196-242-0] A number 
of storage schemes for sparse matrices are proposed and algorithms for manipulating them are 
analyzed in terms of their computational complexity. AO 


Computer Science, P. Lecture Notes in Computer Science-142: Problems and Methodologies in Mathemat- 
ical Software Production. Ed: P.C. Messina, A. Murli. Springer-Verlag, 1982, 271 pp, $12.50 (P). 
[ISBN: 0-387-11603-6] The presentations from an international seminar held at Sorrento, Italy, 
November 3-8, 1980. JAS 


Computer Science, T(14-15), S*. Data Base Management Systems: A Guide to Microcomputer Software. 
David Kruglinski. Osborne/McGraw-Hill, 1983, ix + 260 pp, $16.95 (P). [ISBN: 0-931988-84-5] Three 
chapters of elementary but substantial theory are followed by detailed discussions of four popular 
microcomputer systems: Condor, dBASE II, FMS-80, and MDBS III. The book closes with some remarks 
about the future, other microcomputer-based data base systems, and some sample programs for dBASE II 
and FMS-80. Its biggest weakness as an elementary text is the lack of exercises. A well-written 
approach that provides substantial insight about what“s going on--much more than a user’s manual. 
JAS 


Computer Science, P, L*. Campus Computing Strategies. Ed: John W. McCredie. Digital Pr, 1983, xi 
+ 316 pp, $21. [ISBN: 0~932376-20-7] 10 chapters by directors of college and university computer 
centers providing case studies in organization, planning and management of campus computer services. 
The book evolved from a 1981-82 study by EDUCOM, a national consortium specializing in academic com- 
puter services. LAS 


Control Theory, P. Contrdle impulsionnel et inéquations quasi variationnelles. A. Bensoussan, J.L. 
Lions. Dunod, 1982, xv + 596 pp, 280 FF. [ISBN: 2-04-015428-0] Uses dynamic programming to estab- 
lish a relationship between stochastic control and quasi-variational inequalities in a way that 
leads to numerical applications. LAS 


Control Theory, T(18), P. Stochastic Control by Functional Analysis Methods. Alain Bensoussan. 
Stud. in Math. & Its Appl., V. ll. Elsevier North-Holland, 1982, xv + 410 pp, $58.25. [ISBN: 0- 
444-86329-X] An advanced textbook on stochastic control, containing basic results, relying on func- 
tional analysis and the theory of partial differential equations. LCL 


Systems Theory, T(17-18: 1, 2), P. System Theory: A Hilbert Space Approach. Avraham Feintuch, 
Richard Saeks. Pure & Appl. Math., V. 102. Academic Pr, 1982, xiii + 310 pp. [ISBN: 0-12-251750-4] 
A unified theory of linear systems using Hilbert space techniques. Also presents the theory of 
operators defined on a Hilbert resolution space (essentially a Hilbert space with a time structure 
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axiomatically adjoined). AO 


Applications, T(16-17: 1, 2), L. Applied Mathematics: Principles, Techniques, and Applications. 
James A. Cochran. Wadsworth Pub, 1982, x + 399 pp, $34.95. [ISBN: 0-534-98026-0] A textbook for an 
advanced course in modern applied mathematics. The topics covered include the calculus of varia- 
tions, stability theory, conformal mappings, generalized functions, integral equations, and asymp-~- 
totics. AO 


Applications, P. Nonlinear Problems: Present and Future. Ed: Alan Bishop, David Campbell, Basil 
Nicolaenko. Math. Stud., No. 61. Elsevier North-Holland, 1982, xi + 483 pp, $69.75 (P). [ISBN: 0- 
444-86395-8] Proceedings of an interdisciplinary conference inaugurating the Los Alamos Center for 
Nonlinear Studies: turbulence, reaction-diffusion processes, nonlinear analysis, and low dimensional 


solids. LAS 


Applications, S, P, L¥. Iterated Maps on the Interval as Dynamical Systems. Pierre Collet, Jean- 
Pierre Eckmann. Progress in Physics, No. 1. Birkhauser Boston, 1980, vii + 248 pp, $17.50. [ISBN: 
3-7643-3026-0] An exposition for mathematicians and theoretical physicists of the fascinating (and 
often perplexing) connection among iterated continuous maps of the interval, dissipative dynamical 
systems (e.g., physical systems with friction) when driven by an outside force, and "chaotic," "tur- 
bulent" or "strange" phenomena in biological, chemical, physical and economic systems. LAS 


Applications (Kconomics), P. Lecture Notes in Economics and Mathematical Systems-202: SMS--A Pro- 
gram Package for Simulation and Gaming of Stochastic Market Processes and Learning Behavior. Ulrich 
Witt, Joachim Perske. Springer-Verlag, 1982, vii + 266 pp, $18.50 (P). [ISBN: 0-387-11551-X] SMS is 
a package of Fortran programs for Stochastic Market Simulation. This work describes the use of SMS 


for experimental programming and Monte Carlo simulation studies. AO 


Applications (Economics), P. Philosophy of Economics. Ed: W. Stegmllller, W. Balzer, W. Spohn. 
Stud. in Contemp. Econ. Springer-Verlag, 1982, viii + 306 pp, $21 (P). [ISBN: 0-387-11927-2] 
Proceedings of a 1981 conference in Munich, the beginnings of a new philosophical enterprise. In 
three parts: structuralist theories (the use of model-theoretic tools in the analysis of theories); 
applications of structuralist theories to Marxian theory; and decision theory as a basis for 
economic activity. LAS 


Applications (Economics), P, L. Lecture Notes in Economics and Mathematical Systems-207: Modern 
Analysis of Value Theory. Y. Fujimori. Springer-Verlag, 1982, 165 pp, $12 (P). [ISBN: 0-387- 
11949-3] An exposition in terms of Leontief and von Neumann models of Marx’s theory of value that 
explains economic relationships among goods in terms of the value of human labor which is assumed 
invariant. LAS 


Applications (Economics), P. Essays in Game Theory and Mathematical Economics in Honor of Oskar 
Morgenstern. Robert J. Aumann, et al. Bibliographisches Institut, 1981, 196 pp, 42 DM (P). [ISBN: 
3~411-01609-4] Nine papers presented at a June 1979 memorial symposium at the University of 
Mannheim. LAS 


Applications (Fluid Flow), T(17-18: 1), S. Computational Methods for Fluid Flow. Roger Peyret, 
Thomas D. Taylor. Ser. in Computational Physics. Springer-Verlag, 1983, x + 358 pp, $42.50. 
[ISBN: 0-387-11147-6] Numerical methods for solving Navier-Stokes equations, including finite 
difference, finite element, spectral and special methods. Applied to incompressible and inviscid 
and viscid compressible flows. A useful introduction to this subject. RWN 


Applications (Game Theory), T(16-18), S, P*, L. Game Theory in the Social Sciences: Concepts and 
Solutions. Martin Shubik. MIT Pr, 1982,ix + 514 pp, $35. (ISBN: 0-262-19195-4] A pluralistic use of 
n-person game theory to reshape the methodology of theoretical economics: different concepts of 
"solution" reflect various economic models in ad hoc yet pragmatic arguments. Begins with several 
chapters devoted to mathematical modelling in the social sciences; then introduces the variety of 
solution concepts provided by the theory of games. A second volume will be devoted to specific 
economic applications. LAS 


Applications (Humanities), P*, L. Computing in the Humanities. Ed: Richard W. Bailey. Elsevier 
North-Holland, 1981, viii + 191 pp, $46.50. [ISBN: 0-444-86423-7] Papers selected from the Fifth 
International Conference on Computers in the Humanities held in May 1981 at Ann Arbor. Topics range 
from morphology to typography (TEX for humanists), from remedial teaching of foreign languages and 
UNIX aids for composition courses to data bases for biblical texts. "The extension of humanistic 
inquiry through data processing is merely scholarship carried out by other means." LAS 


Applications (Management), S, P*, L*¥**. The Art and Science of Negotiation. Howard Raiffa. Harvard 
U Pr, 1982, x + 373 pp, $18.50. [ISBN: 0-674-04812-1] Case studies in negotation--political, civil, 
economics--ranging from the fair division of estates to the law of the sea. Organized along game- 
theoretic lines (two parties, one issue; two parties, many issues; several parties, many issues), 
the analysis of cases employs an informal mix of elementary algebra, game theory, linear program- 


ming, and other heuristic optimization techniques. LAS 


Applications (Modelling), T*(14-15: 1), S, L. Applying Mathematics: A Course in Mathematical Model- 
ling. David Burghes, Ian Huntley, John McDonald. Math. & Its Applic. Ellis Horwood, 1982, 194 pp, 
$44.95. [LISBN: 0-85312-417-5] Appealing yet simple models (aircraft separation, broad jump at Mexico 
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City, communication satellites) introduce realistic modelling situations. Brief discussion of stra- 
tegies -and philosophy of modelling then leads to a rich collection of suggestions for modelling pro- 
jects. An excellent resource for a modelling seminar, using little more than elementary calculus. 
LAS 


Applications (Modelling). Air Pollution: Assessment Methodology and Modeling. Ed: Eric Weber. 
NATO: Challenges of Mod. Soc., V. 2. Plenum Pr, 1982, viii + 329 pp, $39.50. [ISBN: 0~-306~40997-6] 
A comprehensive, state~of-the-art survey of air quality control modelling prepared for NATO as part 
of a German pilot’ study, in cooperation with Belgium and the U.S. EPA. Concludes with a 60 page 
glossary of modelling and air quality control terminology. LAS 


Applications (Physics), L. Formulas, Facts and Constants for Students and Professionals in 
Engineering, Chemistry and Physics. H.J. and K.H. Fischbeck. Springer-Verlag, 1982, xii + 251 pp, 
$14 (P). [ISBN: 0-387-11315-0] Includes mathematical formulae (tables of integrals, etc.), conver- 
Sion factors and values of fundamental constants for the SI system, the basic terms of spectroscopy, 


atomic structure, and wave mechanics, and physical data of use in the laboratory. AO 


Applications (Physics), T(16-18: 1), P¥, L. Quantum Mechanics of Atoms and Molecules. Walter Thir- 
ring. Course in Math. Physics, No. 3. Transl: Evans M. Harrell. Springer-Verlag, 1981, viii + 300 
pp, $28. [ISBN: 0-387-81620-8] An axiomatic introduction to quantum mechanics which emphasizes con- 
crete results that can be compared with experimental data. A good illustration of the use of modern 


functional analysis in mathematical physics. AO 


Applications (Physics), P. Quaritum Mechanics in Mathematics, Chemistry, and Physics. Ed: Karl E. 
Gustafson, William P. Reinhardt. Plenum Pr, 1981, ix + 506 pp, $59.50. [ISBN: 0-306-40737-X] The 37 
papers in this volume were originally presented at the March, 1980 AMS meeting in Boulder, Colorado 
at a special session on mathematical physics. AO 


Applications (Physics), P. Iterative Methods for Calculating Static Fields and Wave Scattering by 
Small Bodies. Alexander G. Ramm. Springer-Verlag, 1982, xii + 122 pp, $24 (P). [ISBN: 0-387- 
90682-7] This monograph presents iterative methods for solving exterior and interior static 
boundary-value problems. Most of the problems discussed are three~dimensional. AO 


Applications (Physics), $(17-18), P. Spacetime and Geometry: The Alfred Schild Lectures. Ed: 
Richard A. Matzner, L.C. Shepley. U of Texas Pr, 1982, x + 189 pp, $37.50. [ISBN: 0-292-77567-9] 
Seven expository lectures given over a period of several years starting in 1977. The _ lectures 
presented here are: "Why Is the Universe So Symmetrical?" by Dennis Sciama; "Null Congruences and 
Plebanski-Schild Spaces," by Ivor Robinson; "Linearization Stability," by Dieter Brill; "Nonlinear 
Model Field Theories Based on Harmonic Mappings," by Charles W. Misner; "Gravitational Fields in 
General Relativity," by Roy P. Kerr; "On the Potential Barriers Surrounding the Schwarzschild Black 
Hole," by Subrahmanyan Chandrasekhar; "The Initial Value Problem and Beyond," by James W. York, Jr. 
and Tsvi Piran. JAS 


Applications (Physics), P. Lecture Notes in Physics-162: Relativistic Action at a Distance: Classi- 
cal and Quantum Aspects. Ed: J. Llosa. Springer-Verlag, 1982, x + 263 pp, $15 (P). [ISBN: 0-387- 
11573-0] Proceedings of a workshop held in Barcelona, Spain, June 15-21, 1981. JAS 


Applications (Physics), P. General Relativity: An Introduction to the Theory of the Gravitational 
Field. Hans Stephani. Ed: John Stewart. Transl: Martin Pollock, John Stewart. Cambridge U Pr, 
1982, xvi + 298 pp, $49.50. [ISBN: 0-521-24008-5] An introduction to the general theory of rela- 
tivity for readers familiar with theoretical mechanics, electrodynamics, and special relativity. 
The essential concepts and formulae of Riemannian geometry are presented in the first few chapters. 
AO 


Applications (Physics), IT. Introduction to Special Relativity. Wolfgang Rindler. Clarendon Pr, 
1982, x + 185 pp, $13.95 (P); $39. [ISBN: 0~19-853182-6; 0-19-853181-8] A nicely written text. Lots 
of exercises and a good index. Tensors are taken classically and the author has no qualms. about 
working in three-space rather than four-space when it”s advantageous to clarifying his point. How- 
ever, the essence is space-time geometry and tensors. JAS 


Applications (Physics), T(117-18: 1, 2), P*¥. Classical Fields: General Relativity and Gauge Theory. 
Moshe Carmeli. Wiley, 1982, xvii + 650 pp, $44.95. [ISBN: 0-471-86437-4] This text presents a uni- 
fied treatment of gauge fields and gravitational fields. Approximately one-half of the work is 
devoted to each topic. AQ 


Applications (Social Science), P, L. Power, Voting, and Voting Power. Ed: Manfred J. Holler. 
Physica-Verlag, 1982, 338 pp, $69.95 (P). [ISBN: 3-7908-0266-2] A normative and descriptive theory 
of democracy subsumed under two themes--theory of social choice and theory of games. A _ carefully 
planned, international volume intended to address the issues outlined in Arrow’s 1951 Nobel-prize 
winning monograph on social choice. LAS 


Reviewers 


RJA: Richard J. Allen, St. Olaf; RB: Richard Brown, Carleton; JNC: Judith N. Cederberg, St. Olaf; 
CEC: Clifton E. Corzatt, St. Olaf; JD-B: John Dyer-Bennet, Carleton; JRG: Jennifer R. Galovich, St. 
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Olaf; SG: Steven Galovich, Carleton; JG: Jack Goldfeather, Carleton; PH: Paul Humke, St. Olaf; RBK: 
Roger B. Kirchner, Carleton; RSK: Richard S. Kleber, St. Olaf; JK: Joseph Konhauser, Macalester; 
LCL: Loren C. Larson, St. Olaf; GHM: George H. Mills, Carleton; RWN: Richard W. Nau, Carleton; AO: 
Arnold Ostebee, St. Olaf; AWR: A. Wayne Roberts, Macalester; MS: Michael Schneider, Macalester; JS: 
John Schue, Macalester; SS: Seymour Schuster, Carleton; JAS: J. Arthur Seebach, Jr., St. Olaf; KS: 
Kay Smith, St. Olaf; LAS: Lynn Arthur Steen, St. Olaf; TAV: Theodore A. Vessey, St. Olaf; MW: Martha 
Wallace, St. Olaf; FLW: Frank L. Wolf, Carleton; PZ: Paul Zorn, St. Olaf. 


Section Reports 


An asterisk (*) by the title of a paper indicates that copies of the paper are available from 
the author. Papers presented under special sponsorship as part of joint meetings are so noted in 
parentheses. 


New Jersey Section 


The fall meeting of the New Jersey section was held on October 31, 1982 at Rutgers University’s 
Newark campus. Thirty people registered for the meeting. 


Invited Addresses: 
"Time Warps, String Edits, and Macromolecules: The Mathematics of Sequence Comparisons," by Joseph 
Kruskal, Bell Labs. 
"A Mathematician”s London," by Dorothy Goldberg, Kean College. 
Panel Discussion: 
"On the Retraining of Mathematicians in Computer Science," by Susan Marchand, Kean College 


(Moderator); Robert Gillman, Stevens Institute of Technology; Anita La Salle and James Mc Hugh, 
New Jersey Institute of Technology; Richard Pirchner, Monmouth College. 


Student Paper: 


"Non-Associative Simple Rings," by Sin Min Lee, Stevens Institute of Technology. 


Eastern Pennsylvania and Delaware Section 


The annual fall meeting of the Eastern Pennsylvania and Delaware Section was held on November 
20, 1982 at West Chester State College. There were approximately 125 persons in attendance. 


Invited Lectures: 


"Some Bijective Proofs in Combinatorics," by Herbert S. Wilf, University of Pennsylvania. 
"Descartes, Euler, and Polyhedra," by Peter J. Hilton, State University of New York at Binghamton. 
"Mathematical Science Curricula," by Alan C. Tucker, State University of New York at Stony Brook. 


Panel Discussion: 


"Mathematical Science Curricula," by Gerald J. Porter, University of Pennsylvania (Moderator); 
Jeorg Mayer, Lebanon Valley College, and Larry E. Sigler, Bucknell University. 


Seaway Section 


The Seaway Section held its fall meeting at St. Lawrence University, Canton, New York on 
November 12-13, 1982 with a registration of 61 persons. 


Invited Address: 
"Retraining Mathematicians to Teach Computer Science," by Ed Dubinsky, Clarkson College. 
Papers Presented: 


"What Makes a Ring Finite?" by Howard E. Bell, Brock University. 

"Understanding and Teaching Mathematical Problem Solving,” by Alan H. Schoenfeld, University of 
Rochester. 

"A Follow-Up Study of the Winners of the U.S.A. Mathematical Olympiad," by Nura D. Turner, State 
University of New York at Albany. 

"Graphical Solutions of Systems of n Linear Equations with n Equal to or Greater Than Three," by 
Robert D. Larsson, North Country Community College. 

"Teaching Statistics with Computers," by Chanchal Singh, St. Lawrence University. 

"Some Interesting Properties of the Integer Function," by Robert W. Sloan, Alfred University. 
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"A Simulation Approach to Evaluating Experimental Designs," by Robin H. Lock, Clarkson College. 
"The Role of Monotone Method in the Study of Differential Equations," by Srinivasa Leela, State 
University College at Geneseo. 


Northeastern Section 


The annual meeting of the Northeastern Section was held at Worcester State College on November 
19-20, 1982. Approximately 140 attended the meeting. 


Invited Lectures: 


* "CUPM’s Recommendation for a Mathematics Curriculum in the 1980°s," by Alan Tucker, SUNY at Stony 
Brook. 
"How to Succeed in Business With and Without Mathematics," by Richard Franklin, Director of Cor- 
porate Development, Instrumentation Labs. 
"The Maximum Principle and Analytic Functions (Christie Lecture)," by John Wermer, Brown Univer- 
sity. 
"A Guided Discovery Approach to Calculus," by Donald Small, Colby College. 


Panel Discussions: 


"Textbook and Curriculum Needs for the 1980°s," Bodh Gulati (Moderator) Southern Connecticut State 
College; Gary Ostedt, Wiley Publishers; Peter Divine, McGraw-Hill Publishers; Gary Folvin, 
Allyn and Bacon Publishers. 

"Why Math, or Why and How Should We Teach Mathematics in the 1980°s," Carroll McMahon (Moderator), 
Bentley College; Richard Goller, Sanders Associates; Alan Suchat, Wellesley. 


Contributed Papers: 


* "IRA’s, the Higher Mathematics of 2 + 2, and the Fundamental Theorem of Inflation," by Robert 
Pease, Jr., Senior Editor, Science. 
"Stereographic Projection," by Ernest Manfred, U.S. Coast Guard Academy. 
* "Using the Computer in the Classroom," by John D. McKenzie, Jr., Babson College. 
"Mathematical Systems Analysis--A Course," by John Goulet, Colby College. 


Maryland-District of Columbia-Virginia Section 


The Maryland-District of Columbia-Virginia Section met on November 12-13, 1982 at Gallaudet 
College in Washington, D.C. 


Invited Addresses: 


"The Mathematical Sciences Curriculum K-12: What is Still Fundamental and What is Not?" by Marcia 
Sward, Associate Director of the Mathematical Association of America. 

"Applications of Statistics in the Telecommunications Industry," by James Maher, Bell Telephone 
Labs. 


Short Presentations: 


"On the Zeros of Riemann’s Continuous Non-differentiable Function," by Lee Whitt, Daniel H. Wagner 
Associates. 

"Self-inverse Integer Matrices," by Robert Hanson, James Madison University. 

"Measuring the Effectiveness of Computer Networks," by Robert Lewand, Goucher College. 

"Jeeps Are Not Scalars," by Carmen Castells, student at The Johns Hopkins University. 

"Algorithm for Singular Value Decomposition," by Hoy Booker, American University. 

"Math Anxiety in Minorities," by Queen E. Wiggs, University of the District of Columbia. 

"Markov Chains and Work Histories of the Disabled," by John Hennessey, Loyola College and the 
Social Security Administration. 

"Group Theory and Crystallography," by George Mackiw, Loyola College. 

"Buckled Polyhedra and Other Developable Surfaces," by Michael Goldberg. 

"Optimization of Cancer Chemotherapy and Radiotherapy," by Paul Massell, U.S. Naval Academy. 

"Ancient and Modern Mathematics in the St. John’s College Program,” by Samuel Kutler, St. John’s 
College. 

"Rotationally Invariant Bergman Measures and Subharmonic Functions," by Richard B. Tucker, Mary 
Baldwin College. 

"A Computer Graphics Tutorial," by Stefan Shrier, Systems Planning Corporation. 

"Helping Students Improve Their Mathematics Learning Skills," by Lewis Hirsch, Rutgers University. 

"Security Algebras for Databases, etc.," by John Hays, Naval Research Laboratory. 

"Graphical Analysis Methods Applied to Examining the Relationship of Ability and Achievement Vari- 
ables to Successful Problem Solving," by Carolyn Maher, Rutgers University. 
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Four faces with two mathematically famous names. The classical book bearing those names 
was for a time co-extensive with the whole of its field. See p. 211. 
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If, for example, our integral involves the product of a logarithmic function and an algebraic 
function, we choose u to be the log part and, keeping criterion (1) in mind, the algebraic part is 
chosen for dv. By making this selection, in fv du, the logarithmic character has disappeared by 
virtue of the differentiation. The antidifferentiation of the algebraic part results (most of the time) 
in an algebraic v. Consequently, the integral fv du is strictly algebraic in its integrand and 
hopefully easier than our original integral. 

Similarly, if we have any combination of two of these types of functions in our original 
integral, we choose for u that type that appears first in LIATE and dv is whatever is left, that part 
that appears second in LIATE. 


Rationale: Differentiating logs and inverse trig functions changes their character to algebraic, 
while algebraic functions run the “middle of the road;”’ differentiating them yields algebraic, 
while integration sometimes yields nonalgebraic, so we would probably differentiate them before 
integrating. Trig and exponential functions “bring up the rear” since their differentiation or 
antidifferentiation results in similar functions. 


As an example, consider {x In(x) dx. Using LIATE, we pick u = In(x) and dv = x dx. We 
obtain du = x ' dx and v = x”/2. Consequently, 


fxin(x) dx = n(x) — [Fax 


x? x? 
= “z In(x) — 4 + C. 


Students have liked using LIATE and find that choosing u and dv is much less frightening. The 
author has yet to see an integral done by integration by parts to which LIATE cannot be 
successfully applied. 

To be able to use integration by parts, perhaps it is better LIATE than never. 


Reference 


1. Howard Anton, Calculus with Analytic Geometry, Wiley, New York, 1980. 


MISCELLANEA 


95. For the most part [Glaisher] accepted or rejected contributions on his own responsibility. 
The best editors have of course a wonderful power of judging the quality of a manuscript without 
attempting to read it; but the task becomes steadily more difficult as mathematics grows and 
specializes... A private journal, controlled autocratically by a kindly and discriminating editor, 
can serve many useful purposes, and particularly that of giving early encouragement to beginners, 
on whom societies, with councils and referees, are sometimes very severe. 


—G. H. Hardy, “Dr. Glaisher and the ‘Messenger of Mathematics,’” Messenger of 
Mathematics, 58 (1929) 159-160. 


ANSWER TO PHOTOS ON PAGE 195. 


Top: P. S. Alexandroff and H. Hopf (topology). Bottom: A. D. Alexandroff and E. Hopf 
(analysis, of two different but overlapping kinds). 


UNSOLVED PROBLEMS 
EDITED BY RICHARD GUY 


In this department the MONTHLY presents easily stated unsolved problems dealing with notions ordinarily 
encountered in undergraduate mathematics. Each problem should be accompanied by relevant references (if any 
are known to the author) and by a brief description of known partial results. Manuscripts should be sent to 
Richard Guy, Department of Mathematics and Statistics, The University of Calgary, Calgary, Alberta, Canada 
T2N IN4. 


AN OLLA-PODRIDA OF OPEN PROBLEMS, OFTEN ODDLY POSED 


RICHARD K. Guy 


Many problems are submitted to the MONTHLY, but a large proportion don’t appear in print, 
for a variety of reasons. Some problems submitted to this section are brief and with no known 
connexions with the literature, so that they don’t fit the format we’ve been using. But this isn’t a 
good reason for not airing them. 

Some problems may not be well enough posed, so that the real problem is to decide what is the 
problem. This may have several solutions, and the resulting problems may vary from the trivial to 
the impossible, so that, once existence and lack of uniqueness have been established, an 
intermediate problem may be to select the solution which is the best problem! For example, 


José M. Bayod, University of Santander, Santander, Spain, 


notes that in some parts of his country, a farmer owns a part of the mountains that surround his 
farm, according to the “flowing water law” which can be stated thus: 


A spot of rain lands at A on the mountain and flows downhill until it reaches 
a point B on the farm. Then the owner of point B in the valley also owns the 
point A on the mountain. 


The main problem is to build a mathematical model for the problem: given an area in the 
valley, what part of the mountains is associated with it? A rough formulation would be: given a set 
of contour lines, is there a (unique) set of orthogonal trajectories (lines of greatest slope)? Joel 
Brenner warns about the many pitfalls. It may be necessary to know how the law is interpreted. A 
peak, or even a ridge, can be neglected as being of measure zero, but what about a plateau, 
whether horizontal or sloping? In practice, contour lines often have a discontinuous tangent at a 
stream bed, while the stream beds are likely to be of critical importance. Finally, contours and 
stream beds are often time-dependent. How can mathematicians and lawyers help one another? 
R. B. Kusner suggests that one may wish to assume that the contour lines are the level curves of a 
Morse function f (1.e., a smooth function with isolated nondegenerate critical points) since Morse 
functions are dense in the continuous functions and their complement in the smooth functions has 
measure zero. Unique orthogonal trajectories to the contour lines are generated by the gradient 
field Df. 

Bayod further writes that there are remarkably few difficulties in real life; perhaps because so 
far neither mathematicians nor lawyers have been around! It was only curiosity that led him to 
formulate the problem, not the need to fill a legal gap. Yet the mountain often has pines, very 
good for wood, and is very valuable. 


Robert B. Kusner, University of California, Berkeley, CA 94720 


sends a less controversial problem on r-equilateral sets which still leaves something to the reader’s 
imagination, however. 
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Suppose r is a positive real number and / is a metric space with distance d. Then a subset E of 
M is called r-equilateral, and we write d(E) = r, if d(x, y) =r for every pair of distinct points 
x, yin E. Also define 


e(r, M) = max(#E|d(E) =r} 
where #E is the cardinality of the set E, and call e(M/), the maximum of e(r, M) taken over all 
positive real numbers, the equilateral dimension of M. 


General Problem. Compute the equilateral dimension for some familiar metric spaces. 
The most familiar examples of all are the normed linear spaces L,(n) defined on R” for 
integers p > 1 by the norm 


\/p 
Edel’) 


l<i<n 


ll, = ( 


ZS 


Fic. | 


Figure | is a picture of the unit “spheres” when n = 2. For p = 1 we have the inside square; 
p = 2isacircle; p = 3 and p = 7 are also shown, and the dotted square is the limit as p > oo, the 
unit “sphere” in the space L, (1) with norm 


Ix|lo = max [x;]. 
l<i<n 


If you look at pictures, you should be able to convince yourself that e(L,(n)) > n + 1 for all 
p,1<p< _o, and all n > 0. On the other hand, you will find that e(L,(”)) < 2”, and Figure 2 
may help you see that equality holds when p = oo. When p = 1, the lower bound on e(L,(n)) is 
2n. 

Also the case p = 2 is quite special: every isometry (distance-preserving map) is affine, and all 
the affine maps generated by rotation and translation are isometries. Check that if d( E) = r and 
+E =k + 1, then the affine span of E has dimension k as an affine space. Therefore e( L,(n)) = 
n+l, 


Problem 0. Does e(L,(n)) = 2n? 
Problem 1. Compute e(L,(7)) for 2 < p < o. 


Kusner’s guess is n + 1, but he notes that there are lots of numbers between n + 1 and 2”. 
Here is a “solution sketch.” In the range 1 < p < oo the r-spheres are all smooth (n — 1)- 
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manifolds. If d(E) =r and #E =k, then a point x such that d(E, x) =r must lie on the k 
r-spheres about the points of E. Since the dimension of the intersection decreases by | “each 
time” (“general position”) the biggest #E is n + 1. Certainly the nonsmoothness for p = 1 and 
p = oc account for the difference. 

Riemannian manifolds are metric spaces which are “locally like R”.” To see that e(M) >n+4+ 1 
for such an n-dimensional manifold M”, let x € M” and pick a point y on the (smooth) r-sphere 
about x. The r-sphere about y intersects the one about x in a set of dimension > n — 2. Now 
proceed inductively as in the above “solution sketch.” 


kD 


Fic. 3 


Observe that e(S”) = n + 2 for the round sphere S$” Cc R”*! = L,(n + 1). It may be that the 
following example shows that e(M7*) is arbitrarily large. For any k, pick k points on a 
2-dimensional (diffeomorphic but not isometric to the round) sphere M?. Think of M? as made of 


1983] UNSOLVED PROBLEMS 199 


putty and pull out long fingers at these k points as in Figure 3. If the metric on M? is perturbed to 
correspond to this stretching, we get k points which are arbitrarily close to being equilateral. The 
open question is: can we go all the way to equilateral? 

If yes, define u( M) as the minimum of e( M, r) taken over all positive reals. Then u( M7’) = 3 
for these examples. 


Problem 2. Compute an upper bound for u( M”) and a lower bound for e(M”) where M” is a 
Riemannian manifold of dimension n. (Guess: n + 1.) 
If no, then the problem is 


Problem 2’. Compute an upper bound for e( M”). (Guess: n + 2.) 


The behavior of e(M”,r) for various r should give some rough idea of the “shape” of a 
Riemannian manifold M”. For example, if e(M”, r) = n + 2 we expect M” to have a “spherical 
protrusion” of radius comparable to r/m as in Figure 4. Can you make this more precise? 


%) 


Fic. 4 


Problem 3. Describe those Riemannian manifolds M” for which 


(a) e(M")=n+ 1,n4+2,..., or 
(b) u(M”) = 1,2,...,n4+ 1, or 
(c) e(M",r) = 1,2,...,n +1, +4 2,... for all rin D, some preassigned domain. In particu- 


lar, what are the complete Riemannian manifolds M” such that e(M,r) =n + 1 for all r? 


Problem 4. Classify all complete metric spaces M with e(M,r) =n +4 1 for all r. Or to what 
extent does e(M,r) =n + 1 characterize closed submetric spaces of the Riemannian examples 
found above? 


Another good example of a problem where the problem is to decide what the problem is starts 
from R. P. Agnew, Differential Equations, McGraw-Hill, NY, 2nd ed., 1960, pp. 39-40: 


One day it started snowing at a heavy and steady rate. A snowplow started 
out at noon, going at 2 miles the first hour and 1 mile the second hour. What 
time did it start snowing? 


To solve this, it is assumed that the plow clears a constant volume of snow per unit of time. 
This is unrealistic since the plow would move infinitely fast when there’s no snow (though that 
doesn’t arise in this problem). Perhaps there should also be an upper bound on the depth of snow 
the plow can cope with. 


C. M. Bender, A. Toomre and Frederic Y. M. Wan [The snowplow problems, Applied Math. 
Notes, Vancouver, B.C. 1(1975) #1, 7-11] ask the additional question: 
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Suppose a second identical snowplow starts sometime after noon, say at 
t = T, in the wake of the first. If it ever catches up with the first plow, when, 
where and how? 


The answer to “how” is “with infinite speed”! They also ask the question: 


Three identical snowplows started at noon, 1 p.m. and 2 p.m., respectively. 
All three collided some time later. When did it start snowing? 


The answer is given as 11:30 a.m. Answer all these questions under alternative assumptions. 
Also answer the following variant asked by M. G. Stone: 


The snowplow Starts out at noon. It turns around (instantaneously!) at 2 p.m. 
and returns along its own path at 3 p.m. What time did it start snowing? At 
what time should the plow have turned around to return by 2 p.m.? 


IS A DISTANCE ONE PRESERVING MAPPING BETWEEN METRIC SPACES ALWAYS AN ISOMETRY? 


TTHEMISTOCLES M. RASSIAS 
4, Zagoras Street, Paradissos, Amaroussion, Athens, Greece 


1. Preliminaries. We begin with the definition of an isometry: Let X, Y be two metric spaces, 
d,, d, the distances on X and Y. A byection mapping /: X — Y, of X onto Y, is defined to be an 
isometry if d,( f(x), f(y)) = d,(x, y) for all elements x, y of X. If f: X — Y is an isometry, then 
the inverse mapping f_': Y > X is an isometry of Y onto X. Two metric spaces X and Y are 
defined to be isometric if there exists an isometry of X onto Y. It thus follows that an isometry 1s 
an isomorphism for the metric space structures. Some of the properties of an isometry are 
mentioned in [3] and in [4]. We now state in which sense an incomplete space can be fattened out 
to be complete: If (X, d,) is an incomplete metric space, then there exists a complete metric space 
X so that X is isometric to a dense subset of X¥. Mazur and Ulam [5] have proved that every 
isometry of a normed real vector space onto a normed real vector space is a linear mapping up to 
translation. Consider then the following condition (distance one preserving property), for /: 
X— Y. 


(DOPP) Given x, y © X with d,(x, y) = 1. Then d,( f(x), f(y)) = 1. 


2. The Problem. Let f: X — Y be a mapping (not necessarily continuous) satisfying condition 
(DOPP). Is f: X —~ f(X) € Y an isometry? 

The problem still remains open even for the case where X = R” and Y= R” with2<n<m 
(see for example [6, p. 277]). Beadle [1] has covered a number of cases for mappings f: R” — R” 
that preserve some distance. For these mappings it is clear that n < m because R” has equilateral 
n-simplices if and only if nm < m. Beckman and Quarles [2] proved that f is an isometry if 
1<n=m< oo. Ifn< m,f might not be an isometry if m is too large. It is not yet known if 
there is a unit distance preserving mapping f: R* — R° which is not an isometry. 
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an isomorphism for the metric space structures. Some of the properties of an isometry are 
mentioned in [3] and in [4]. We now state in which sense an incomplete space can be fattened out 
to be complete: If (X, d,) is an incomplete metric space, then there exists a complete metric space 
X so that X is isometric to a dense subset of X. Mazur and Ulam [5] have proved that every 
isometry of a normed real vector space onto a normed real vector space is a linear mapping up to 
translation. Consider then the following condition (distance one preserving property), for /: 
X— Y. 


(DOPP) Given x, y © X with d,(x, y) = 1. Then d,( f(x), f(y)) = 1. 


2. The Problem. Let f: X — Y be a mapping (not necessarily continuous) satisfying condition 
(DOPP). Is f: X — f(X) € Y an isometry? 

The problem still remains open even for the case where X = R” and Y= R” with2<n<m 
(see for example [6, p. 277]). Beadle [1] has covered a number of cases for mappings f: R” — R” 
that preserve some distance. For these mappings it is clear that n < m because R” has equilateral 
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NOTES 
EDITED BY SABRA S. ANDERSON, SHELDON AXLER, AND J. ARTHUR SEEBACH, JR. 


Material for this department should be sent to Professor J. Arthur Seebach, Jr., Department of Mathematics, St. 
Olaf College, Northfield, MN 55057. 


AN ELEMENTARY PROOF OF THE ISOMORPHISM C* = S! 


L. RICHARD DUFFY 
33 Fellsway East, Malden, MA 02148 


In his introductory algebra textbook [1], Louis Shapiro makes the following remark at the end 
of the section on group isomorphisms: 


Thus isomorphism is a simple and important concept. It is also complicated. For instance, only 
in the last few years were the nonzero complex numbers shown to be isomorphic to the 
complex numbers of absolute value 1. 


This fact, which may seem a bit surprising (at least to beginning students of algebra), appears to 
have had its first published proof in [2]. 

Each of these two groups is a “divisible” group, meaning that the equation x” = g has a 
solution x in the group for every positive integer n and every g in the group. Using this fact, the 
proof cited above shows that the isomorphism in question is a simple corollary of the more 
general Structure Theorem for Divisible Abelian Groups (see [3], for example). While not 
excessively difficult, this latter theorem does take a while to establish from first principles, and its 
full power is by no means necessary to demonstrate the isomorphism claimed above. The 
following proof requires only the rudiments of group theory, together with Zorn’s Lemma. 

Let C* denote the multiplicative group of nonzero complex numbers, S' its unit circle 
subgroup, R the additive group of reals, and Z the integers. The map sending re’ to the pair 
(log r,0/27 + Z) immediately establishes the isomorphisms C* ~ R X R/Z and S' = R/Z. 
Thus it will be sufficient to show that RR X R/Z = R/Z. To do this, we’ll need three basic facts: 

I. In a vector space, any independent set of elements can be enlarged to a basis. (Easily 
proved by means of Zorn’s Lemma.) 

II. If ¢: G > A is an isomorphism of groups and AG, then ¢(A)<H and the two quotient 

groups G/A and H/¢(A) are isomorphic. 

III. If G, H are two groups with normal subgroups A, B, respectively, then A X B<G X H, and 

(GX H)/(A X B) = G/A X H/B. 

To begin with, R is a vector space over the rational field @, and the singleton set {1} is 
obviously independent over Q. So by I, there is a basis &% of R over @ with 1 © %. Then the set 
ci* = & xX {0} U (0) X ® is a basis of the vector space R X R over @ (in which the scalar 
multiplication is the obvious one given by qg- (r,s) = (qr, gs)). Furthermore, %* is a union of 
two disjoint sets, each equinumerous to the infinite set &, whence %* is equinumerous to % (by a 
standard fact in set theory which is also proved by Zorn’s Lemma). 

Now let $: &%* — % be any one-one correspondence which maps the ordered pair (0, 1) to 1. 
Then it is a simple matter to show that ¢ extends uniquely to a rational-vector-space isomorphism 
®: R X R — R, which is also a group isomorphism, of course. If 7 is any integer, then 


®((0,2)) = O(n- (0,1)) =2- O((0,1)) =2-$((0,1)) =n- Lan. 
Thus ® restricts to an isomorphism of the subgroup {0} X Z onto Z. 
Now we're finished, because II gives (R X R)/({0} X Z) = R/Z, and III gives 
(R X R)/(0} X Z) = R/O} X R/Z = R X R/Z, whence R X R/Z = R/Z, as desired. 
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MATRICES WITH INTEGER ENTRIES AND INTEGER EIGENVALUES 


J.-C. RENAUD 
Mathematics Department, University of Papua New Guinea 
Waigani, Papua New Guinea 


When eigenvalues are first covered in linear algebra, it is often convenient to give numerically 
simple exercises—thus it may be desirable to provide problems which have integer solutions. It is 
reasonably easy to find examples of 2 X 2 and 3 X 3 matrices with integer entries and integer 
eigenvalues, but this leads naturally to the question: exactly how can one construct, in general, an 
n X n matrix with such a property? The theorem below gives a complete answer, in the sense that 
all such matrices can be built up in the same elementary way. Remarkably, its proof is based on 
only three well-known results. 


THEOREM. The n Xn matrix A with integer entries has integer eigenvalues if and only if it is 
expressible in the form 
n—1 
A=) wy,4+ kl, 
i=! 
where the u, and v; are row vectors with n integer components such that u,°v, = 0 for | <i<j < 
n — 1, k is an integer, and I, is the unit n X n matrix. 
The eigenvalues are k, u,°v, + k,..., U,_,°V,_-, + kK. 


Discussion and Proof. Consider the n X n matrix A with integer entries and eigenvalues. We can 
choose one eigenvalue, say k, and form B= A — kI,. Clearly B also has integer entries and 
eigenvalues, one of which is 0. It 1s now sufficient to show that B is expressible in the form of the 
summation term in A above. The first step is to show: 


LEMMA 1. Let B be an n X n matrix with integer entries whose determinant is zero. Then B is 
expressible in the form B = XY where X is n X (n— 1), Y is (n — 1) Xn, and all entries are 
integers. 


Proof. This is based on a well-known result (see for example Theorem 7.10 in [2]): since the 
determinant of B is zero, B is expressible in the form B = M,DM, where M, and M, are invertible 
and D has the form diagonal (d,,..., d,,_,,0), all entries of M,, M,, and D being integers. Now 
D = D,D where D, = diagonal (1,..., 1,0) is n Xn, and B=(M,D,)(DM,). But the nth 
column of M,D, and the nth row of DM, are null: delete these, and call the remainder X and Y, 
respectively. Then B = XY as required. 

Note that the converse also holds. 

Two other fundamental lemmas are required: Lemma 2 may be found in the exercises of 
Chapter 22 in [1], attributed to Barton, while Lemma 3 is Theorem III.12 in [3]. These are: 


LEMMA 2. Let B = XY where X is nX(n— 1), Y is (n— 1) Xn. Then the characteristic 
polynomial of B,|B — XI,|, is equal to the polynomial —X|YX — XI,,_ ||. 
LEMMA 3. Let C be an m X m matrix with integer entries and integer eigenvalues. Then there 


exists T, m X mand invertible, |T| = +1, with integer entries such that T-'CT is upper triangular. 


These three lemmas are sufficient to complete the proof of the theorem. We have B = XY, and 
there exists T, (n — 1) X (n — 1), such that TYXT' is upper triangular. Moreover, the eigenval- 
ues of XY are just 0 and those of YX, or those of T(YX)T'. 


202 J.- C. RENAUD [March 


R. Clay, The punctured plane is isomorphic to the unit circle, J. Number Theory, | (1969) 500-501. 
J. 


J. 
J. J. Rotman, The Theory of Groups: an Introduction, 2nd ed., Allyn and Bacon, Boston, 1973, p. 186. 


2. 
3. 


MATRICES WITH INTEGER ENTRIES AND INTEGER EIGENVALUES 


J.-C. RENAUD 
Mathematics Department, University of Papua New Guinea 
Waigani, Papua New Guinea 


When eigenvalues are first covered in linear algebra, it 1s often convenient to give numerically 
simple exercises—thus it may be desirable to provide problems which have integer solutions. It is 
reasonably easy to find examples of 2 X 2 and 3 X 3 matrices with integer entries and integer 
eigenvalues, but this leads naturally to the question: exactly how can one construct, in general, an 
n X n matrix with such a property? The theorem below gives a complete answer, in the sense that 
all such matrices can be built up in the same elementary way. Remarkably, its proof is based on 
only three well-known results. 


THEOREM. The n Xn matrix A with integer entries has integer eigenvalues if and only if it is 
expressible in the form 
n—1 
A=) wy, + kl, 
i=! 
where the u, and v; are row vectors with n integer components such that u,°v, = 0 forl| <i<j< 
n — 1, k is an integer, and I, is the unit n X n matrix. 
The eigenvalues are k, u,°v, + k,..., U,_)°V,_-, + kK. 


Discussion and Proof. Consider the n X n matrix A with integer entries and eigenvalues. We can 
choose one eigenvalue, say k, and form B= 4 — kI,. Clearly B also has integer entries and 
eigenvalues, one of which is 0. It is now sufficient to show that B is expressible in the form of the 
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exists T, m X mand invertible, |T| = +1, with integer entries such that T-'CT is upper triangular. 
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Now rewrite B = (XT~')(TY) and set U = XT~', V = TY. That is, B = UV and VU is upper 
triangular. Letting u’ be the ith column of U and v, be the ith row of V gives 
n—] 
B= )iujy,, and ujov,=0 for i<j. 
i=! 
Since the eigenvalues of B are those of VU, together with 0, these are 0 and u; ov,,i = 1,...,” — 1. 
Hence 4 is of the requisite form and has the stated property. 
This is really only a proof of the “only if’ case; however, the “if” case is merely a direct 
retracement of the above steps. 
We give an example of this construction for a 4 X 4 case: 


2 3 
y {fh -2 1 +f jfl-) 1b do ud+] Fit 2-1 
0 2 
1 0 -3 -10 6 0 
1 4 3 3 
l- = 
, 0 -2 3 4 
0 1 4 6 0 1 


has eigenvalues 1, 1+ 2, 1+ (—3), 14+ 2 or 1,3, —2,3. 


REMARK. The author has used this technique in constructing exercises for students, and found 
it quite useful for 3 x 3 and 4 x 4 cases. However, the restriction u; ev, = 0 for i < j makes it 
clumsy at higher levels (which are in any case beyond classroom work). 


Problem. It would be pleasant to be able also to give a nice formula for the eigenvectors. For 
the 3 X 3 case, we obtain 


eigenvalue eigenvector 
k Vv, X V5 
k+ U, °V, u, 
k + Uy °v, v, X (u, X u,) + (U, °v,)u, 


= (Uy ov, — U, °V) )Uy + (Uy OV, )U, 
This is almost neat, but probably will not generalize easily. 
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Throughout this note, 2 will denote a nonempty set, > a o-algebra of subsets of Q, and pa 
positive measure on the measurable space ({2, 2). We derive characterizations of the measure 
spaces (Q, 2, ) such that 0 < p < q implies either L?(p) C L4(p) or L4(p) C L?(p), thus 
completing and improving some results of B. Subramanian [1]. 
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Now rewrite B = (XT~')(TY) and set U = XT~', V = TY. That is, B = UV and VU is upper 
triangular. Letting u’ be the ith column of U and y, be the ith row of V gives 
n—-1 
B= )i uly, and u;ev,=0 for i<j. 
i=l 
Since the eigenvalues of B are those of VU, together with 0, these are 0 and u, °v,,i = 1,...,” — 1. 
Hence A is of the requisite form and has the stated property. 
This is really only a proof of the “only if?’ case; however, the “if” case is merely a direct 
retracement of the above steps. 
We give an example of this construction for a 4 X 4 case: 


2 3 
y(fh -2 1 +f jfl-) 1b i a+] Fit 2-1 
0 2 
1 0 -3 -10 6 0 
1 4 3 3 
l- = 
, 0 -2 3 4 
0 1 4 6 0 1 


has eigenvalues 1, 1+ 2, 1+ (—3), 1 4+ 2 or 1,3, —2,3. 


REMARK. The author has used this technique in constructing exercises for students, and found 
it quite useful for 3 x 3 and 4 X 4 cases. However, the restriction u, ev, = 0 for i <j makes it 
clumsy at higher levels (which are in any case beyond classroom work). 


Problem. It would be pleasant to be able also to give a nice formula for the eigenvectors. For 
the 3 X 3 case, we obtain 


eigenvalue eigenvector 
k Vv, X V5 
k+u, ov, u, 
k + Uy °V, v, X (u,; X u,) + (U, °v, )u, 


= (Uy ov, — U, ov) )Uy + (Uy ov, Uy 
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DEFINITION. A p-measurable set E Cc Q such that 0 < w(E) < +00 is called an atom whenever 
for any p-measurable subset E, of E we have either n(£,) = 0 or w(E,) = p(£). 


PROPOSITION 0.I [2, pg. 67]. If E is a set of positive measure such that neither E nor any of its 
subsets is an atom, and 0 < a < p(E), then there exists a subset E, of E such that p(E,) = a. 


PROPOSITION 0.II [2, pg. 67]. If E is a set of o-finite p-measure, then E = E, U E, uniquely, 
where neither E, nor any of its measurable subsets is an atom and E, is a union of an at most 
countable number of atoms of finite measure. 

With these preliminaries, we have: 


THEOREM 1. The following conditions on the measure space (Q, X, w) are equivalent: 

(i) L?(y) C L4(p) for some pair p, q of positive numbers such that p < q. 

(ii) There does not exist a disjoint sequence {E,,) of sets of positive measure in Q such that the 
sequence {( E,,)} tends to zero. 

(iii) Every set of finite measure in Q is a finite pairwise disjoint union of atoms, and the measures 
of the atoms in {1 are bounded away from zero. 

(iv) L?(p) C L4(p) for all positive real numbers p and q such that p < q. 


Proof. (i) > (ii). Suppose that there exists a disjoint sequence { £,,} of sets of positive measure 
in Q such that the sequence {u(E,,)) tends to zero. It can be assumed that the sequence {p(E,,)} is 
strictly decreasing and that p(E,,) < 1 for every n € N. Denote by /, the interval [k~°,(k — 1) %), 
where a = (3/2)q/(q — p), k = 2,3,.... One can construct, inductively, two strictly increasing 
sequences of positive integers {h,,} and {k,,)} such that p(Z, ) < J, for every n EN. Let A: 
(2 — R be the function defined as follows: 


h(x) = k# when x € E,, 
h(x) =Owhenx EQ\ UE,, 
n=1 


where B = (3/2)/(q — p). 
We have: 


[redp= DO khPu(B,,) < Lo kek, — I < +00 
Q 


n=] n=\1 
[ridu= OX khtu(E,,) > L klk, * = +00. 
a n=] n=\1 


This contradiction proves (11). 

(ii) => (iii). According to Proposition 0.IJ, if E c Q is a set of finite measure, one can write 
E = E, U E,, where E, is atom free and EF, is purely atomic. If p(£,) > 0, we can construct, by 
Proposition 0.I, a sequence {F,} of measurable sets such that F, = E, > F, D> --- DF,D-::: 
and p(F,) = p(E,)/2*~ | (k = 1,2,... ). Forn € N, define G, = F, \ F,,,. The sequence {G,} is 
a disjoint sequence of sets of positive measure in Q such that n(G,) ~ 0 as n > o. This 
contradicts (11). Hence, one can write EF = {U,.,A;} U E,, where A; € & is, for every i € J, an 
atom and E, is such that n(£,) = 0. Since p(E) < +00, the set J is, at most, countable. If the 
collection of positive real numbers {u(4,),i © J} is not bounded away from zero, it would 
contradict (11). Hence J is a finite set. If there is no number m > 0 such that the set of values of p 
is contained in {0} U [m, oo], we could find atoms in Q whose measure is arbitrarily small, which 
contradicts (11). This proves (iii). 

(iii) > (iv). Assume f: Q — R is a function such that a = fo|f|? du < +00. For everyn EN, 
we set E, = {x € Q: | f(x)| > n). Now 


n?u(E,) <f (fl? du< fifl? du=a< +00, 
E, Q 
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so p(E,,) < +00 for every n EN, and p(E£,,) ~ 0 as n > oo. Condition (iii) implies that there 
exists an Ny © N such that p(E,) = 0 for n > ny. This proves that | f(x)| < np for all x € Q 
except a set of u-measure zero. Hence 


fisltdu = isl? (fle? da < ng? ffl? du < +00, 
Q Q 9 


This proves (iv). 
The proof of (iv) > (1) is obvious. Q.E.D. 


THEOREM 2. The following conditions on the measure space (M2, X, 1) are equivalent: 

(1) L7(p) c L?(p) for some pair p, q of positive numbers such that p < q. 

(II) Every disjoint sequence { E,,) of sets of positive finite measure in Q is such that the sequence 
{p(E,,)} is bounded. 

(IT) 2 = E, U E,, where E, Cc Q is a measurable set such that p(E,) < +00 and E, is either 


empty or an atom of infinite measure. 
(IV) L7(p) C L?(p) for all positive numbers p and q such that p < q. 


Proof. (1) > (II). Suppose that there exists a disjoint sequence { E,,) of sets of positive measure 
such that the sequence {p(E,,)} diverges to + oo. One can assume that the sequence {p(E,,)) is 
strictly increasing and that p(E,,) > 1 for every n € N. Denote by J, the interval (2*,2**'], for 
every k € N. One can construct, inductively, two strictly increasing sequences of positive integers 
{h,,) and {k,,) such that w(E, ) © J, , for every n € N. Let g: 2 — R be the function defined as 
follows: 


g(x) = 2° *»/P when x © E, , g(x) =Owhenx €Q2\ U E,. 
n=] 


We have that {|g|%du < +00 and {|g|? du = +00. This contradiction proves (II). 

(II) > (II). Let B = sup p(F’), where the sup is taken over all measurable subsets F of Q 
satisfying pp(F) < oo. There exists a sequence {F,} of measurable subsets of Q such that 
lim,..#(%,) = B. Let FE, = UP_F,, so p(E,) = B. Condition (If) implies that B < oo. Let 
E,=Q\E,. lf Ec E, is such that w( £) < oo, the construction of E, proves that n(£) = 0. This 
shows that E, is either empty or an infinite atom and (III) is proved. 

(IIT) > (IV). Assume f: 2 — R is a measurable function such that fo| f|? du < + 00. If we set 


A, = {x € By:if(x> >}, 
we have either (A,,) = 0 or p(A,,) = +00. Since 
l 
— q q 
pan) <f Iflfdn < fis dp< +00, 


it follows that w(A,,) = 0 for every n € N. Let A be the set US, A, = {x © E,: f(x) # 0). We 
have »(A) = 0 and 


f\fltap =f |f|7 du < +00. 
Q E, 


Since n(E,) < +00, Hélder’s inequality implies that 


fue dps = fur’ du < (fan) (f.1 an) 


\—p/ P/q 
= n(ENA(f ipitdn) < +00, 
Ey 


This proves (IV). The proof (IV) => (1) is obvious. Q.E.D. 


1—p/q 
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A SHORT PROOF OF THE VARIATIONAL PRINCIPLE FOR APPROXIMATE 
SOLUTIONS OF A MINIMIZATION PROBLEM 
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U.E.R. de Mathematiques, Universite Paul Sabatier, 118, route de Narbonne, 31062 TOULOUSE Cedex, 
France 


Given any problem of minimization, two tasks come up very naturally: the question of the 
existence of a minimum point and the determination of properties (usually in terms of conditions 
for optimality) satisfied by minimum points. The classical variational principle states that if a 
differentiable function attains its minimum at some point x,, then f’(x,.) = 0. When f has only a 
finite lower bound (which it does not necessarily attain), then; for every e > 0, there exists some 
approximate solution, 1.e., a point x, such that: 


(1) inf f < f(x,) <inff+e. 


What conditions do such points necessarily satisfy? To answer this question is precisely the 
purpose of Ekeland’s variational principle [1]. Since its introduction in 1972, this variational 
principle has received a great deal of attention: different aspects and applications of it as well as a 
general proof are reported in [2]. The aim of this note is to present a short proof which relies on 
the regularization method applied to the original problem [3]. Although the proof essentially 
works only in finite dimensions, it is appealing for its geometrical insight and the simplicity of 
techniques involved. 


THEOREM. Let f: R” — (— 00, +00] be a lower semicontinuous function, not identically + 00, 
bounded from below. Then for every point x, Satisfying (1) and every \ > 0, there exists some point x, 
such that 


(2) f(%.) < f(x); 
(3) Iz. — x1 <A, 
(4) Vx ER", f(x,) < f(x) + (eA)Dllx — XII. 


Proof. Given a point x, satisfying (1), we consider the perturbed function 
gi x — f(x) + (e/A)I|x — x¢ll 


Since fis assumed lower semicontinuous and bounded from below, g is clearly lower semicontinu- 
ous and verifies: lim),). + ..g(x) = +00. Therefore there exists xX, minimizing g on R” such that 


(5) Vx ER", f(x.) + (e/AIX. — xell < f(x) + (e/ADI|x — ell. 
By letting x = x, we get 
f(%e) + Ce/A De — ell < f(%e) 


and (2) follows. Now, since f(x,) < inf f + e, we clearly deduce from the above that ||x, — x,|| < A. 
We infer from (5) that 


f(%) < f(x) + Ce/A) le = ell — Me — ell} 
< f(x) + (e/A)||x — X,|| for all x, 
which is the desired inequality (4). O 
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Thus, the closer to x, we desire x, to be, the larger the perturbation of f that must be accepted. 
A good compromise is to take A = e'”*. As an illustration, we consider what the variational 
principle for approximate solutions yields for differentiable /. 


COROLLARY. Let f: R” — R be a differentiable function, bounded from below. For every e > 0, 
there exists some point X, satisfying: 


f(X,) < inf f+ e, 


If’ CeIL < e'7*. 
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THE TRUE GROWTH RATE AND THE INFLATION BALANCING PRINCIPLE 


ROBERT C. THOMPSON 
Mathematics Department, University of California, Santa Barbara, CA 93106 


Twenty-five or so years ago, mathematics of finance was a staple in the undergraduate 
curriculum. However, it fell into disfavor, owing chiefly to the tedious nature of the required 
computations, and perhaps also owing to a lack of interest in it on the part of mathematics 
faculty. There was, however, no scientific reason for this fall from grace—finance is an important 
and tidy branch of applied mathematics. With a reawakening awareness of applied mathematics, 
and the change in the computational aspect from tedious to exciting created by the pocket 
calculator, financial mathematics may recover some importance in faculty eyes. But the traditional 
presentation needs modernizing. 

Beyond any doubt, the most important change needed is to recognize that it no longer is 
adequate to think in terms of interest alone: inflation must be brought squarely into the picture. 
To this end, there are two theorems, small ones to be sure, but not so small to be immediately 
obvious, that greatly facilitate the mathematical treatment of inflation within the context of 
finance. The purpose of this note is to present these two theorems, together with a supplement 
explaining their connection with the mathematics of inflation as it usually is given in economics. 
The author has repeatedly used these theorems in the finance classes that he regularly teaches, 
with very satisfactory student comprehension. 

The basic issue to explore in a treatment of inflation is the true growth rate. If i is the interest 
rate, and r the inflation rate, most people would assert that the true growth rate simply is 


i — r = interest rate — inflation rate. 


An economist, upon being asked about this formula, would probably say “Of course, it’s correct!” 
A mathematician, upon being asked, might agree that it is correct, though possibly with an 
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Supplement. Having now thoroughly convinced the reader that the formula g = i — r for the 
true growth rate is wrong, we now surprise him by asserting that it is correct (but only when 
properly formulated.) 


THEOREM 3. In an inflationary economy, with i the interest rate and r the inflation rate, both 
under continuous compounding, the true growth rate is 


g=i-Tr. 


Proof. The reasoning is the same as in Theorem 1, except that, if the time span is T years 
(instead of n periods), the future values Q and Y are 


QO = Pe'’,” Y= Xe’’, 
under interest under inflation. 


Setting Y = Q then produces X¥ = Pe“~”’, but we also have X = Pe®’ under continuously 
compounded true growth. So g =i-—r. 

In finance, interest is usually presented on a periodically compounded basis. It is true that 
continuously compounded rates have sometimes occurred in commercial practice, but invariably 
the equivalent annually compounded figures have also been prominently mentioned. Inflation, 
though, seems never to be presented to the public on any basis other than monthly compounding 
or annual compounding. (Percents per month inflation, or percents per year, regularly appear in 
the popular press.) In financial work, therefore, the periodically compounded true growth rate 
formula g = (i — r)/(1 + r) is the one to use. However, in economics, the presentation generally 
is more “theoretical,” and the compounding is continuous. Thus, it is perfectly in order for the 
mathematician to tell the financier that the formula g = i — r for the true growth rate is wrong, 
and yet tell the economist that it is right! 

Our final remark is that financial mathematics 1s a beautiful subject when it is made exciting by 
incorporating inflation into the picture, provided that hand-held calculators (not compound 
interest tables) are used for the computations. The author has several years experience to 
demonstrate this, teaching the subject to large classes of students, all taking the course as an 
elective. Moreover, the mathematics graduate students who have been the author’s assistants with 
the course all speak extremely favorably of it, even though almost all found that considerable 
effort was needed to master the material. For them, it is an important and not so trivial branch of 
applied mathematics. The newest feature, still not very much recognized, is that incorporating 
inflation makes it easy to do computations with startling and unsettling conclusions. 


A TECHNIQUE FOR INTEGRATION BY PARTS 


HERBERT E. KASUBE 
Department of Mathematics, Bradley University, Peoria, IL 61625 


When we first encounter integration by parts we are confronted with the usual formula: 


fudo =u — fodu. 


The expressions u and dv are chosen with two criteria in mind: 


(1) v should be easy to find from dv; 
(2) the integral fv du should, in some sense, be “better” or easier than fu dv. 


The difficulty usually arises in choosing u and dv to satisfy these conditions. An acronym gives 
us a method of selection which helps to satisfy criterion (2) above. The word is LIATE, standing 
for Logarithmic Inverse trig Algebraic Trig Exponential. These five basic types of functions are 
those encountered in our integral. 
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If, for example, our integral involves the product of a logarithmic function and an algebraic 
function, we choose u to be the log part and, keeping criterion (1) in mind, the algebraic part is 
chosen for dv. By making this selection, in fv du, the logarithmic character has disappeared by 
virtue of the differentiation. The antidifferentiation of the algebraic part results (most of the time) 
in an algebraic v. Consequently, the integral fv du is strictly algebraic in its integrand and 
hopefully easier than our original integral. 

Similarly, if we have any combination of two of these types of functions in our original 
integral, we choose for u that type that appears first in LIATE and dv is whatever is left, that part 
that appears second in LIATE. 


Rationale: Differentiating logs and inverse trig functions changes their character to algebraic, 
while algebraic functions run the “middle of the road;” differentiating them yields algebraic, 
while integration sometimes yields nonalgebraic, so we would probably differentiate them before 
integrating. Trig and exponential functions “bring up the rear” since their differentiation or 
antidifferentiation results in similar functions. 


As an example, consider fx In(x) dx. Using LIATE, we pick u = In(x) and dv = x dx. We 
obtain du = x_'dx and v = x*/2. Consequently, 


fxin(x) dx = = in( x) — [Fax 


x? x? 
= “z In(x) — 4 + C. 


Students have liked using LIATE and find that choosing u and dv is much less frightening. The 
author has yet to see an integral done by integration by parts to which LIATE cannot be 
successfully applied. 

To be able to use integration by parts, perhaps it is better LIATE than never. 


Reference 


1. Howard Anton, Calculus with Analytic Geometry, Wiley, New York, 1980. 


MISCELLANEA 


95. For the most part [Glaisher] accepted or rejected contributions on his own responsibility. 
The best editors have of course a wonderful power of judging the quality of a manuscript without 
attempting to read it; but the task becomes steadily more difficult as mathematics grows and 
specializes... A private journal, controlled autocratically by a kindly and discriminating editor, 
can serve many useful purposes, and particularly that of giving early encouragement to beginners, 
on whom societies, with councils and referees, are sometimes very severe. 


—G. H. Hardy, “Dr. Glaisher and the ‘Messenger of Mathematics,’” Messenger of 
Mathematics, 58 (1929) 159-160. 


ANSWER TO PHOTOS ON PAGE 195. 


Top: P. S. Alexandroff and H. Hopf (topology). Bottom: A. D. Alexandroff and E. Hopf 
(analysis, of two different but overlapping kinds). 


1983] THE TEACHING OF MATHEMATICS 211 


If, for example, our integral involves the product of a logarithmic function and an algebraic 
function, we choose u to be the log part and, keeping criterion (1) in mind, the algebraic part is 
chosen for dv. By making this selection, in fv du, the logarithmic character has disappeared by 
virtue of the differentiation. The antidifferentiation of the algebraic part results (most of the time) 
in an algebraic v. Consequently, the integral fv du is strictly algebraic in its integrand and 
hopefully easier than our original integral. 

Similarly, if we have any combination of two of these types of functions in our original 
integral, we choose for u that type that appears first in LIATE and dv is whatever is left, that part 
that appears second in LIATE. 


Rationale: Differentiating logs and inverse trig functions changes their character to algebraic, 
while algebraic functions run the “middle of the road;” differentiating them yields algebraic, 
while integration sometimes yields nonalgebraic, so we would probably differentiate them before 
integrating. Trig and exponential functions “bring up the rear” since their differentiation or 
antidifferentiation results in similar functions. 


As an example, consider {x In(x) dx. Using LIATE, we pick u = In(x) and dv = x dx. We 
obtain du = x_'dx and v = x*/2. Consequently, 


fxin(x) dx = n(x) — [Fax 


x? x? 
= “z In(x) — 4 + C. 


Students have liked using LIATE and find that choosing u and dv is much less frightening. The 
author has yet to see an integral done by integration by parts to which LIATE cannot be 
successfully applied. 

To be able to use integration by parts, perhaps it is better LIATE than never. 


Reference 


1. Howard Anton, Calculus with Analytic Geometry, Wiley, New York, 1980. 


MISCELLANEA 


95. For the most part [Glaisher] accepted or rejected contributions on his own responsibility. 
The best editors have of course a wonderful power of judging the quality of a manuscript without 
attempting to read it; but the task becomes steadily more difficult as mathematics grows and 
specializes... A private journal, controlled autocratically by a kindly and discriminating editor, 
can serve many useful purposes, and particularly that of giving early encouragement to beginners, 
on whom societies, with councils and referees, are sometimes very severe. 


—G. H. Hardy, “Dr. Glaisher and the ‘Messenger of Mathematics,’” Messenger of 
Mathematics, 58 (1929) 159-160. 


ANSWER TO PHOTOS ON PAGE 195. 


Top: P. S. Alexandroff and H. Hopf (topology). Bottom: A. D. Alexandroff and E. Hopf 
(analysis, of two different but overlapping kinds). 


PROBLEMS AND SOLUTIONS 
EDITED BY G. L. ALEXANDERSON, DAVID BORWEIN, H. M. W. EDGAR, AND D. H. MUGLER 


EDITOR EMERITUS: EMORY P. STARKE. COLLABORATING EDITORS: LEONARD CARLITZ, GULBANK D. 
CHAKERIAN, RICHARD A. GIBBS, RICHARD M. GRASSL, ISRAEL N. HERSTEIN, MURRAY S. KLAMKIN, DANIEL 
J. KLEITMAN, MARVIN Marcus, W. C. WATERHOUSE, ALBERT WILANSKY, S. F. BAY AREA PROBLEMS GROUP: 
LEROY BEASLEY, VINCENT BRUNO, DAN FENDEL, JAMES FOSTER, CLARK GIVENS, ROBERT H. JOHNSON, 
DANIEL JURCA, FREDERICK W. LUTTMANN, LOUISE E. MOSER, M. J. PELLING, HOWARD E. REINHARDT, 
BRUCE RICHMOND, AND EDWARD T. H. WANG. 


Send all proposed problems, in duplicate if possible, to Professor G. L. Alexanderson, Department of Mathe- 
matics, University of Santa Clara, Santa Clara, CA 95053. Please include solutions, relevant references, etc. 

An asterisk (*) indicates that neither the proposer nor the editors supplied a solution. 

Solutions should be sent to the addresses given at the head of each problem set. 

A publishable solution must, above all, be correct. Given correctness, elegance and conciseness are preferred. 
The answer to the problem should appear right at the beginning. If your method yields a more general result, so 
much the better. If you discover that a MONTHLY problem has already been solved in the literature, you should of 
course tell. the editors, include a copy of the solution if you can. 


ELEMENTARY PROBLEMS 


Solutions of these Elementary Problems should be mailed in duplicate to Professor G. L. Alexanderson, 
Department of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by July 31, 1983. Please place 
the solver’s name and mailing address on each (double-spaced) sheet. Include a self-addressed card or label (for 
acknowledgment). 


E 2989. Proposed by M. Goldberg, University of Waterloo, and S. C. Locke, Florida Atlantic 
University. 


Let A = (a, a’) and B = (5, b’) be lattice points (points in the plane with integral coordinates), 
and let d(A, B) denote |a — b| + |a’ — b’|. Let S be the set of points at d-distance at most k from 
the origin. Calculate 


f(k)= d, d(A,B). 


A,BES 
E 2990. Proposed by H. Eves, University of Maine, and C. Kimberling, University of Evansville. 


Let ABC be a triangle and L a line in the plane of ABC not passing through 4, B, or C. 


i) Prove that the isogonal conjugate of L is an ellipse, parabola or hyperbola according as L 
meets the circumcircle of ABC in zero, one or two points. 

ii) Prove that the isotomic conjugate of L is an ellipse, parabola, or hyperbola according as L 
meets E in zero, one, or two points, where E£ is the ellipse through A, B, C having the 
centroid of triangle ABC as center. 

E 2991. Proposed by Z. F. Starc, Vrsac, Yugoslavia. 


Prove or disprove the following inequality: 
atanh(x) > sin(ax) (x>0,a> 1) 


SOLUTIONS OF ELEMENTARY PROBLEMS 


f(n) = ged{k" — kk = 2,3,4,...) 
E 2901 [1981, 538]. Proposed by S. C. Locke and A. Mandel, University of Waterloo. 


Let f(n) = gced{k” — k|k = 2,3,4,...} for N > 2. Evaluate f(n). In particular, show that 
f(2n) = 2. 
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The answer to the problem should appear right at the beginning. If your method yields a more general result, so 
much the better. If you discover that a MONTHLY problem has already been solved in the literature, you should of 
course tell. the editors, include a copy of the solution if you can. 


ELEMENTARY PROBLEMS 


Solutions of these Elementary Problems should be mailed in duplicate to Professor G. L. Alexanderson, 
Department of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by July 31, 1983. Please place 
the solver’s name and mailing address on each (double-spaced) sheet. Include a self-addressed card or label (for 
acknowledgment). 


E 2989. Proposed by M. Goldberg, University of Waterloo, and S. C. Locke, Florida Atlantic 
University. 


Let A = (a, a’) and B = (5, b’) be lattice points (points in the plane with integral coordinates), 
and let d(A, B) denote |a — b| + |a’ — b’|. Let S be the set of points at d-distance at most k from 
the origin. Calculate 


f(k)= QL d(A,B). 


A,BES 
E 2990. Proposed by H. Eves, University of Maine, and C. Kimberling, University of Evansville. 


Let ABC be a triangle and L a line in the plane of ABC not passing through 4, B, or C. 


i) Prove that the isogonal conjugate of L is an ellipse, parabola or hyperbola according as L 
meets the circumcircle of ABC in zero, one or two points. 

ii) Prove that the isotomic conjugate of L is an ellipse, parabola, or hyperbola according as L 
meets E in zero, one, or two points, where E is the ellipse through A, B,C having the 
centroid of triangle ABC as center. 

E 2991. Proposed by Z. F. Starc, Vrsac, Yugoslavia. 


Prove or disprove the following inequality: 
atanh(x) > sin(ax) (x>0,a> 1) 


SOLUTIONS OF ELEMENTARY PROBLEMS 


f(n) = ged(k" — klk = 2,3,4,...) 
E 2901 [1981, 538]. Proposed by S. C. Locke and A. Mandel, University of Waterloo. 


Let f(n) = gcd{k” — k|k = 2,3,4,...} for N > 2. Evaluate f(n). In particular, show that 
f(Qn) = 2. 
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Solution by Robert Breusch, Amherst College. 

1. f(n) cannot contain a factor p* for any prime p (p > 2) because p? + k(k"~' — 1) for 
k =p. 

2. For every n, 2|f(n). 

3. If p is an odd prime and if a is a primitive root (mod p), then p|a(a” ° — 1) only if 
(p — 1)|(n — 1). On the other hand, if (p — 1)|(” — 1) then p|(k” — k) for every k. Thus if P is 
the product of the distinct odd primes p such that (p — 1)|(” — 1), then f(n) = 2P. (In particular 
6|f(n) for every odd n.) 

4. (p — 1) 1s not a divisor of (2n — 1) for any odd prime p. Thus f(2n) = 2. 


Also solved by J. T. Arnold, L. Belohrad (Czechoslovakia), M. Bencze (Romania), K. L. Bernstein, J. Browkin 
(Poland), P. S. Bruckman, H. M. Edgar, L. Foster, R. Gilmer, T. Herman, M. Josephy (Costa Rica), V. S. Joshi & A. 
M. Vaidya (India), W. T. M. Kars (Netherlands), R. E. Kennedy, J. O. Kiltinen, C. -N. Lee, M. Lehtinen (Finland), 
O. P. Lossers (Netherlands), J. G. Merickel (student), R. Morelli (student), R. B. Nelson, W. A. Newcomb, D. A. 
Rawsthorne, D. M. Rosenblum, R. P. Savage, Jr., R. E. Shafer, G. Shulman, S. Singh, D. Singmaster (England), D. 
B. Shapiro, L. Somer, J. M. Stark, A. Stenger, R. Teitler (United Kingdom), E. Trost (Switzerland), S. Wagon, M. 
Woltermann, P. Yiu (Canada), B. Zinner (Germany), an anonymous solver, and the proposer. 


l 


Kiltenen referred to his article, Linearity of Exponentiation, Mathematics Magazine, 52(1979) 3-9. Bruckman 
pointed out that f(2m + 1) is equal to the denominator of B,,,. Foster noted f(n) = 6 for infinitely many zn. 


ADVANCED PROBLEMS 


Solutions of these Advanced Problems should be mailed in duplicate to Professor G. L. Alexanderson, 
Department of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by July 31, 1983. The solver’s 
full post-office address should be on each sheet. 


6422. Proposed by Harvey Diamond, West Virginia University, and Bruce Montgomery, 
Carnegie-Mellon University. 


Let f: [0, 00) — (— 00, 0) be a measurable function. If p > 1, is it possible that f(x) — {¢|f|? 
— 00 as xX —> 00? 


SOLUTIONS OF ADVANCED PROBLEMS 
Generation of the Alternating Group by Three Involutions 


6350 [1981,540]. Proposed by James Wiegold, University College, Cardiff, Wales. 


Prove that the alternating group A, (n > 4) can be generated by three involutions. Can two of 
them be conjugate? (*) Can one of them be preassigned? 


Solution by Alan G. Williamson, University College, Galway, Ireland. 


PROPOSITION. For n > 4, the alternating group A,, can be generated by three involutions, one of 
which is preassigned and two of which are conjugate. 


Notation. We take A, acting on the set Q = ({1,2,..., }. The degree of a permutation is the 
number of points that it displaces. Thus an involution in A, has degree divisible by 4; and two 
involutions in A, are conjugate (in 4,,) if and only if they have the same degree. By relabelling the 
points (if necessary) we may assume that the preassigned involution is one of those defined below. 
Our construction of generators utilizes the following well-known result of Jordan: 


THEOREM. (See [1].) Let G be a doubly transitive permutation group of degree n. If either (a) G 
contains a 3-cycle, or (b) G contains an involution of degree 4 and n > 8, then G contains A,,. 
It is convenient to consider separately five cases. First of all, we deal with A,: 


Case A. n = 8. Letu = (1,2) 3,4), v = (1,5) (6,7), w = (1, 6) @, 7) (4, 5) (2, 8). Here (u, v, w) 
is transitive on Q, (u,v, w ‘pw ) is transitive on {2\ {8}. Hence (u, v, w) is doubly transitive on 
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Solution by Robert Breusch, Amherst College. 


1. f(n) cannot contain a factor p* for any prime p (p > 2) because p? + k(k”~' — 1) for 
k =p. 

2. For every n, 2|f(n). 

3. If p is an odd prime and if a is a primitive root (mod p), then p|a(a”' — 1) only if 


(p — 1)|(” — 1). On the other hand, if (p — 1)|(m — 1) then p|(k” — k) for every k. Thus if P is 
the product of the distinct odd primes p such that (p — 1)|(n — 1), then f(n) = 2 P. (In particular 
6|f(n) for every odd n.) 

4. (p — 1) 1s not a divisor of (2n — 1) for any odd prime p. Thus f(2n) = 2. 


Also solved by J. T. Arnold, L. Belohrad (Czechoslovakia), M. Bencze (Romania), K. L. Bernstein, J. Browkin 
(Poland), P. S. Bruckman, H. M. Edgar, L. Foster, R. Gilmer, T. Herman, M. Josephy (Costa Rica), V. S. Joshi & A. 
M. Vaidya (India), W. T. M. Kars (Netherlands), R. E. Kennedy, J. O. Kiltinen, C. -N. Lee, M. Lehtinen (Finland), 
O. P. Lossers (Netherlands), J. G. Merickel (student), R. Morelli (student), R. B. Nelson, W. A. Newcomb, D. A. 
Rawsthorne, D. M. Rosenblum, R. P. Savage, Jr., R. E. Shafer, G. Shulman, S. Singh, D. Singmaster (England), D. 
B. Shapiro, L. Somer, J. M. Stark, A. Stenger, R. Teitler (United Kingdom), E. Trost (Switzerland), S. Wagon, M. 
Woltermann, P. Yiu (Canada), B. Zinner (Germany), an anonymous solver, and the proposer. 


Kiltenen referred to his article, Linearity of Exponentiation, Mathematics Magazine, 52(1979) 3-9. Bruckman 


pointed out that f(2m + 1) is equal to the denominator of B,,,. Foster noted f(n) = 6 for infinitely many x. 


ADVANCED PROBLEMS 


Solutions of these Advanced Problems should be mailed in duplicate to Professor G. L. Alexanderson, 
Department of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by July 31, 1983. The solver’s 
full post-office address should be on each sheet. 


6422. Proposed by Harvey Diamond, West Virginia University, and Bruce Montgomery, 
Carnegie-Mellon University. 


Let f: [0, 00) — (— 00, 00) be a measurable function. If p > 1, is it possible that f(x) — {¢| fi? 
> 0 asx > 0? 


SOLUTIONS OF ADVANCED PROBLEMS 
Generation of the Alternating Group by Three Involutions 


6350 [1981, 540]. Proposed by James Wiegold, University College, Cardiff, Wales. 


Prove that the alternating group 4, (n > 4) can be generated by three involutions. Can two of 
them be conjugate? (*) Can one of them be preassigned? 


Solution by Alan G. Williamson, University College, Galway, Ireland. 


PROPOSITION. For n > 4, the alternating group A,, can be generated by three involutions, one of 
which is preassigned and two of which are conjugate. 


Notation. We take A, acting on the set Q = {1,2,..., 2}. The degree of a permutation is the 
number of points that it displaces. Thus an involution in A, has degree divisible by 4; and two 
involutions in 4, are conjugate (in A,,) if and only if they have the same degree. By relabelling the 
points (if necessary) we may assume that the preassigned involution is one of those defined below. 
Our construction of generators utilizes the following well-known result of Jordan: 


THEOREM. (See [1].) Let G be a doubly transitive permutation group of degree n. If either (a) G 
contains a 3-cycle, or (b) G contains an involution of degree 4 and n > 8, then G contains A,,. 
It is convenient to consider separately five cases. First of all, we deal with A,: 


Case A. n = 8. Let u = (1,2) 3,4), v = (1,5) (6,7), w = (1,6) ©@, 7) (4,5) (2, 8). Here (u, v, w) 
is transitive on Q, (u,v, w ‘ow ) is transitive on {2 \ {8}. Hence (u, v, w) is doubly transitive on 
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Solution by Robert Breusch, Amherst College. 


1. f(n) cannot contain a factor p* for any prime p (p > 2) because p? + k(k"~' — 1) for 
k =p. 

2. For every n, 2|f(n). 

3. If p is an odd prime and if a is a primitive root (mod p), then p|a(a”~' — 1) only if 


(p — 1)|(” — 1). On the other hand, if (p — 1)|(m — 1) then p|(k” — k) for every k. Thus if P is 
the product of the distinct odd primes p such that (p — 1)|(n — 1), then f(n) = 2 P. (In particular 
6|f(n) for every odd n.) 

4. (p — 1) is not a divisor of (2n — 1) for any odd prime p. Thus f(2n) = 2. 


Also solved by J. T. Arnold, L. Belohrad (Czechoslovakia), M. Bencze (Romania), K. L. Bernstein, J. Browkin 
(Poland), P. S. Bruckman, H. M. Edgar, L. Foster, R. Gilmer, T. Herman, M. Josephy (Costa Rica), V. S. Joshi & A. 
M. Vaidya (India), W. T. M. Kars (Netherlands), R. E. Kennedy, J. O. Kiltinen, C. -N. Lee, M. Lehtinen (Finland), 
O. P. Lossers (Netherlands), J. G. Merickel (student), R. Morelli (student), R. B. Nelson, W. A. Newcomb, D. A. 
Rawsthorne, D. M. Rosenblum, R. P. Savage, Jr., R. E. Shafer, G. Shulman, S. Singh, D. Singmaster (England), D. 
B. Shapiro, L. Somer, J. M. Stark, A. Stenger, R. Teitler (United Kingdom), E. Trost (Switzerland), S. Wagon, M. 
Woltermann, P. Yiu (Canada), B. Zinner (Germany), an anonymous solver, and the proposer. 


Kiltenen referred to his article, Linearity of Exponentiation, Mathematics Magazine, 52(1979) 3-9. Bruckman 


pointed out that f(2m + 1) is equal to the denominator of B,,,. Foster noted f(n) = 6 for infinitely many x. 


ADVANCED PROBLEMS 


Solutions of these Advanced Problems should be mailed in duplicate to Professor G. L. Alexanderson, 
Department of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by July 31, 1983. The solver’s 
full post-office address should be on each sheet. 


6422. Proposed by Harvey Diamond, West Virginia University, and Bruce Montgomery, 
Carnegie-Mellon University. 


Let f: [0, 00) > (— 00, 00) be a measurable function. If p > 1, is it possible that f(x) — {¢|f |? 
> 0 asx > 0? 


SOLUTIONS OF ADVANCED PROBLEMS 
Generation of the Alternating Group by Three Involutions 


6350 [1981, 540]. Proposed by James Wiegold, University College, Cardiff, Wales. 


Prove that the alternating group A, (” > 4) can be generated by three involutions. Can two of 
them be conjugate? (*) Can one of them be preassigned? 


Solution by Alan G. Williamson, University College, Galway, Ireland. 


PROPOSITION. For n > 4, the alternating group A,, can be generated by three involutions, one of 
which is preassigned and two of which are conjugate. 


Notation. We take A, acting on the set Q = {1,2,..., }. The degree of a permutation is the 
number of points that it displaces. Thus an involution in A, has degree divisible by 4; and two 
involutions in 4, are conjugate (in A,,) if and only if they have the same degree. By relabelling the 
points (if necessary) we may assume that the preassigned involution is one of those defined below. 
Our construction of generators utilizes the following well-known result of Jordan: 


THEOREM. (See [1].) Let G be a doubly transitive permutation group of degree n. If either (a) G 
contains a 3-cycle, or (b) G contains an involution of degree 4 and n > 8, then G contains A,,. 
It is convenient to consider separately five cases. First of all, we deal with A,: 


Case A. n = 8. Let u = (1,2) (3,4), v = (1,5) (6,7), w = (1,6) (3,7) (4, 5) (2,8). Here (u, v, w) 
is transitive on Q, (u,v, w ‘vw) is transitive on Q \{8). Hence (u, v, w) is doubly transitive on 
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Q and contains the 3-cycle (uv). 

Case B. n = 4m > 8. Let 

x = (1,2) 3,4)--- (QQr—1,2r)--- (4m —- 1,4m) 
of degree 4m. Let 
y = (1,2) (4,5) --- (27,27 + 1)--- (4m — 2,4m — 1) (3,4m) 
of degree 4m. For any positive integer s < m, we define 
z = (1,3) (4,5)--- (27,27 + 1)--- (45,485 + 1) 

of degree 4s. Since xy = (3,5,...,4m— 1) (4m,4m — 2,..., 4), it is clear that (xy,z) is 
transitive on Q \ {2} and (x, y, z) is transitive on (2. Therefore (x, y, z) is doubly transitive on Q 
and contains the involution ( yz)” = (1,3) (2,4m). 

Case C. n= 4m+4 1 > 5. Take x as above. For m > 1, we define 

v = (2,3) (4,5) --- (27,2r + 1)--- (4m — 2,4m — 1) (1,4m)(4m + 1) 
of degree 4m. (For m = 1, take v = (2,3) (4,5).) Let 
w= (4m — 45 + 3,4m — 49 + 4)--- (27 — 1,2r) ++: (4m — 1,4m) (1,4m + 1) 


of degree 4s. Here (x, v) is transitive on 2Q\ {4m + 1}, (x, v, w) is transitive on 2. So (x, v, w) 
is doubly transitive on Q, and contains the 3-cycle (xw)* = (1,4m + 1,2). 


Case D. n= 4m + 2 > 6. Take x as above. For m > 1, we define 

u = (1,3) (2,5) --- (27,27 + 3)--- (4m — 4,4m — 1) (4m — 2,4m + 1) (4m) (4m + 2) 
of degree 4m. (For m = 1, take u = (1,3) (2,5).) Then for s > 1, we define 

t= (4m — 4s 4+ 1,4m— 4s + 2)--- (Qr—1,2r)--- 
xX (4m — 3,4m — 2) (4m — 1,4m 4+ 2) (4m) (4m + 1) 

of degree 4s. (For s = 1, we take ¢t = (4m — 3,4m — 2) (4m — 1,4m + 2) of degree 4.) Now 
(x, u) is transitive on Q\ (4m + 2}, (x, u, t) is transitive on Q. So (x, u, t) is doubly transitive 
on Q and contains the 3-cycle (xt)? = (4m,4m — 1,4m + 2). 

Case E. n= 4m+3 > 7. Take x and u as above. For m > s > 1, we define 

d= (4m — 4s 4+. 1,4m — 4s 4+ 2)--- (Qr—1,2r)--- 
x (4m — 5,4m — 4) (4m + 1,4m 4+ 2) (4m — 1,4m 4+ 3), 
which is of degree 4s and fixes the points 4m, 4m — 2,4m — 3. (Fors = 1, taked = (4m+ 1,4m 
+ 2) (4m — 1,4m + 3).) Here (x, u) is transitive on Q\ {4m 4+ 2,4m + 3) and (x, u,d) is 
transitive on Q. Now (x, u, d~'xd) is transitive on Q\ {4m + 2}, because d~ 'xd interchanges 4m 
and 4m + 3, whilst fixing 4m + 2. Hence (x, u, d) is doubly transitive and contains the 3-cycle 
(xd)? = (4m,4m — 1,4m + 3). 
Reference 


1. H. Wielandt, Finite permutation groups, Academic Press, New York, 1964. 


Also solved by LeRoy Beasley, who refers to a theorem of Alan Williamson: If a primitive transitive permutation 
group G of degree n contains a t-cycle, then G contains A,, unless t > (n — t)!. 


Perron Roots 


6354 [1981,541] Proposed by Emeric Deutsch, Polytechnic Institute of New York. 


Let A be a nonnegative irreducible n X n matrix with Perron root r. Show that there exists a 
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Q and contains the 3-cycle (uv). 

Case B. n = 4m > 8. Let 

x = (1,2) (3,4)--- (Qr—1,2r)--- (4m—- 1,4m) 
of degree 4m. Let 
y = (1,2) (4,5) --- (2r,2r + 1)--- (4m — 2,4m — 1) (3,4m) 
of degree 4m. For any positive integer s < m, we define 
z = (1,3) (4,5) --- (2r,2r + 1)--- (45,45 + 1) 

of degree 4s. Since xy = (3,5,...,4m— 1) (4m,4m — 2,..., 4), it is clear that (xy, z) is 
transitive on Q \ {2} and (x, y, z) is transitive on 2. Therefore (x, y, z) is doubly transitive on Q 
and contains the involution (yz)* = (1,3) (2,4m). 

Case C. n= 4m-4 1 > 5. Take x as above. For m > 1, we define 

v = (2,3) (4,5) --- (27,2r + 1)--- (4m — 2,4m — 1) (1,4m)(4m 4 1) 
of degree 4m. (For m = 1, take v = (2,3) (4,5).) Let 
w= (4m — 45 + 3,4m — 49 + 4)--- (27 — 1,2r) ++» (4m — 1,4m) (1,4m + 1) 


of degree 45. Here (x, v) is transitive on 2\ {4m + 1}, (x, v, w) is transitive on Q. So (x, v, w) 
is doubly transitive on Q, and contains the 3-cycle (xw)* = (1,4m + 1,2). 


Case D. n = 4m + 2 > 6. Take x as above. For m > 1, we define 

u = (1,3) (2,5) --- (27,2r + 3)--- (4m — 4,4m — 1) (4m — 2,4m + 1) (4m) (4m + 2) 
of degree 4m. (For m = 1, take u = (1,3) (2,5).) Then for s > 1, we define 

t= (4m — 4s 4+ 1,4m— 4s + 2)--- (Qr—1,2r)--- 
x (4m — 3,4m — 2) (4m — 1,4m + 2) (4m) (4m + 1) 

of degree 4s. (For s = 1, we take t = (4m — 3,4m — 2) (4m — 1,4m + 2) of degree 4.) Now 
(x, u) is transitive on Q\ (4m + 2}, (x, u, t) is transitive on Q. So (x, u, t) is doubly transitive 
on Q and contains the 3-cycle (xt)* = (4m,4m — 1,4m + 2). 

Case E. n= 4m+ 3 > 7. Take x and u as above. For m > s > 1, we define 

d= (4m — 4s + 1,4m— 4s + 2)--- Qr—1,2r)--- 
x (4m — 5,4m — 4) (4m + 1,4m + 2) (4m — 1,4m + 3), 
which is of degree 4 and fixes the points 4m, 4m — 2,4m — 3. (Fors = 1, taked = (4m+ 1,4m 
+ 2) (4m — 1,4m + 3).) Here (x, u) is transitive on Q\{4m 4+ 2,4m + 3) and (x, u,d) is 
transitive on Q. Now (x, u, d 'xd) is transitive on Q\ {4m + 2}, because d~ 'xd interchanges 4m 
and 4m + 3, whilst fixing 4m + 2. Hence (x, u, d) is doubly transitive and contains the 3-cycle 
(xd)* = (4m,4m — 1,4m 4+ 3). 
Reference 


1. H. Wielandt, Finite permutation groups, Academic Press, New York, 1964. 


Also solved by LeRoy Beasley, who refers to a theorem of Alan Williamson: /f a primitive transitive permutation 
group G of degree n contains a t-cycle, then G contains A,, unless t > (n — t)!. 


Perron Roots 


6354 [1981, 541] Proposed by Emeric Deutsch, Polytechnic Institute of New York. 


Let A be a nonnegative irreducible n X n matrix with Perron root r. Show that there exists a 
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constant K > 0 such that 


for all t > r. 


_] K 
ar A) I< 


—r 
Here J denotes the identity matrix and || || is the operator norm induced by the Euclidean norm 
x > (x'x)!/7. *Find the best value of the constant K. 


Solution by H. Kestelman, University College London. We have to show that ||M,|| < K for all 
t > r, where M, = (t — r)(tI — A)~'. It is well known that M, > 0. From the hypothesis on A, 
there is a positive vector v satisfying Av = rv and therefore M,v = v. If u is the vector with all 
components 1, then pu < v < qu where p is the least and g the greatest component of v. Now take 
any x with ||x|| = 1; then |x| < u and 


0< M, |x| <Mu<p 'Mv=p'v <q 'u. 
So 
MII < qp7 |||] = goo 'n'7?. 


Also solved by Michele Elia (Italy), Mauri Koskela (Finland), Beresford Parlett, and the proposer. 
No expression for the best value of K substantially better than the trivial K = sup,, , || M,|| was obtained. There is 


0 a > 0. Then r = a!/2, M, = (1 + r)~ "(tI + A) and 


no bound on K as A varies. To see this consider A = ( ; ‘I. 


for x = (j.).> r, we have 


-1, 99 1/2 l 1\'7? 
IMI > | xll= (t+r) (4° + 1)" so that K > 5 1+ 7 ; 


Abundant and Deficient Numbers 


6356 [1981, 623]. Proposed by C. R. Wall, Trident Technical College, Charleston, S.C. 


A number N is called deficient if the sum of its proper divisors is less than N: o(N) < 2N. It 
is called abundant if o(N) > 2N. 

(a) Let k be fixed. Do there exist sequences of k consecutive abundant numbers? 

(b) Prove that there are infinitely many sequences of 5 consecutive deficient numbers. 


Solution by Lorraine L. Foster, California State University, Northridge, California. Let p,, denote 
the nth prime. 

(a) Since [I(1 + 1/p,) = 00, for each positive integer m there is an integer N(m) such that 
TT") (1 + 1/p,) > 2. Hence TT? p, is abundant. Defing m, = 1; m,,, = N(m,) + 1 for 
j > 1, and let 

N(m,) 


t= |] p,forj > 1. 
a=m, 


Since the ¢, are pairwise relatively prime, we may choose X > 0 such that X + j = 0 (mod ¢,) for 
j = 1,2,..., k. Since a multiple of an abundant number is abundant, X + 1, X + 2,..., X¥ + kis 
a sequence of k consecutive abundant numbers. 

(b) Since lim, ,,, (1 + 2/(nlog n))” = 1 and, by the prime number theorem, lim, ,., p,,/ 
(nlog n) = 1, we may suppose n to be so large that 


(1 + 2/(nlog n))" < 1.01 and p, > (nlog n)/2 > 60. 


Define P, =I17=;) p,, M, = 60P, + 1, and let x,=(M,+i-—1)/i for i = 1,2,3,4,5. Since 
n > 3, it 1s easy to see that each x, is an integer relatively prime to P,. Hence each x, is of the form 
I]}_, P, where t < n,n, > n (and the n, are not necessarily distinct). Since 1 + 1/p, <1 + p, 
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(b) Prove that there are infinitely many sequences of 5 consecutive deficient numbers. 
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and o( p*)/p’ < (o(p)/p)’ for s > 1, p prime, we conclude that 
o(x,)/x,< (1+ 1/p,)" < (1 + 2/(nlog n))" < 1.01 fori = 1,2,3,4,5. 


Thus 
o(M,+i- 1) a(i) o(x,) 
ee < = 
M+ i x. (1.75)(1.01) < 2 fori = 1,2,3,4,5, 
and so M,, M, + 1,..., M, + 4 is a sequence of 5 consecutive deficient numbers. The result is 


best possible since any multiple of 6 1s abundant. 


Also solved by Claudia Spiro and the proposer. Jerzy Browkin (Poland) and Jeff Loveland also solved part (a). 


REVIEWS 


EDITED BY ALLAN L. EDMONDS AND JOHN H. EWING 
COLLABORATING EDITOR FOR FILMS: SEYMOUR SCHUSTER, CARLETON COLLEGE 


Mathematics: Problem Solving through Recreational Mathematics. By Bonnie Averbach and Orin 
Chein. W. H. Freeman, San Francisco, 1980. pp. vii + 400. $16.50. Accompanying instructor’s 
guide, 169 pp. 


MuRRAY S. KLAMKIN 
Department of Mathematics, University of Alberta, Edmonton, Alberta, Canada T6G2G1 


For some time now, Problem Solving has been in high fashion in mathematical education 
circles. As a consequence, there has been a steady stream of books in this field. Whether or not 
you judge these books to be worthwhile depends highly on your own background and your own 
views on Problem Solving. Also, one should take into consideration the aims of the authors even if 
it turns out that one is not wholly in agreement with them. 

What is Problem Solving? As an indication, Halmos in a rather charming article [4] notes that: 


Mathematicians sometimes classify themselves and each other as either problem-solvers or 
theory-creators. The problem-solvers answer yes-or-no questions and discuss vital special cases and 
concrete examples that are the flesh and blood of all mathematics; the theory-creators fit the results 
into a framework, illuminate it all, and point it in a definite direction—they provide the skeleton and 
the soul of mathematics. One and the same human being can be both a problem-solver and a 
theory-creator, but usually, he is mainly one or the other. The problem-solvers make geometric 
constructions, the theory-creators discuss the foundations of Euclidean geometry; the problem-solvers 
find out what makes switching diagrams tick, the theory-creators prove representation theorems for 
Boolean algebras. 


_ Many problem-solvers (and I classify myself that way) are not only concerned with solving 
problems per se but also with connecting them with other problems, and extending their 
applicability, 1.e., theory-creating. Dually, the theory-creators have to be problem-solvers in 
establishing their necessary sequences of lemmas and theorems for their theories. 

As a better indication for Problem Solving, I can do no better than to quote from George 
Polya. His five books [5]-[9] are still, in my view, the best books written on Problem Solving. 


A great discovery solves a great problem but there is-a grain of discovery in the solution of any 
problem. Your problem may be modest; but if it challenges your curiosity and brings into play your 
inventive faculties, and if you solve it by your own means, you may experience the tension and enjoy 
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constructions, the theory-creators discuss the foundations of Euclidean geometry; the problem-solvers 
find out what makes switching diagrams tick, the theory-creators prove representation theorems for 
Boolean algebras. 


_ Many problem-solvers (and I classify myself that way) are not only concerned with solving 
problems per se but also with connecting them with other problems, and extending their 
applicability, i.ec., theory-creating. Dually, the theory-creators have to be problem-solvers in 
establishing their necessary sequences of lemmas and theorems for their theories. 

As a better indication for Problem Solving, I can do no better than to quote from George 
Polya. His five books [5]-[9] are still, in my view, the best books written on Problem Solving. 


A great discovery solves a great problem but there is-a grain of discovery in the solution of any 
problem. Your problem may be modest; but if it challenges your curiosity and brings into play your 
inventive faculties, and if you solve it by your own means, you may experience the tension and enjoy 
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the triumph of discovery. Such experiences at a susceptible age may create a taste for mental work and 
leave their imprint on mind and character for a lifetime. 

Thus, a teacher of mathematics has a great opportunity. If he fills his allotted time with drilling his 
students in routine operations he kills their interest, hampers their intellectual development, and 
misuses his opportunity. But if he challenges the curiosity of his students by setting them problems 
proportionate to their knowledge, and helps them to solve their problems with stimulating questions, 
he may give them a taste for, and some means of, independent thinking. [5, p.v] 

1. Solving a problem means finding a way out of a difficulty, a way around an obstacle, attaining 
an aim which was not immediately attainable. Solving problems is the specific achievement of 
intelligence, and intelligence is the specific gift of mankind: solving problems can be regarded as the 
most characteristically human activity. The aim of this work is to understand this activity, to propose 
means to teach it, and, eventually, to improve the problem-solving ability of the reader. 

2. This work consists of two parts; let me characterize briefly the role of these two parts. 

Solving problems is a practical art, like swimming, or skiing, or playing the piano: you can learn it 
only by imitation and practise. This book cannot offer you a magic key that opens all the doors and 
solves all the problems, but it offers you good examples for imitation and many opportunities for 
practise: if you wish to become a problem solver you have to solve problems. [6, p.v] 


The authors, Averbach and Chein, address the nontrivial problem of how liberal arts students 
can be given a feeling for mathematics in a one or two semester college course. They have 
concluded that with most of the usual approaches through history, culture, or applications, the 
student is merely an observer which is not good enough. And in line with Polya’s comments, their 
aim is to get the students to think for themselves on problems which are not essentially the same 
as those presented in class. They eschew applications since they feel that only highly simplified 
cases can usually be considered. Their approach is based on recreational mathematics—recrea- 
tional problems, puzzles and games, since firstly they are fun and provide motivation which 
clearly is important and, secondly, historically many important mathematical problems arose from 
problems which were recreational in origin. While I agree with these two latter points, it should 
also be noted that many recreational problems were not important mathematically and that many 
important mathematical problems arose from considering applications. Also, the authors’ empha- 
sis is not on the mathematical results themselves but rather on how these results can be used in 
thinking about problems and solving them. 

If one accepts the authors’ premises, then they have done a good job and it is likely that 
students using the book will be motivated by the discussions and problems. The chapters are 
essentially independent and each contains several types of problems. Many of them have appeared 
previously in columns, journals, and puzzle or problem books in recreational mathematics. There 
are sample problems with solutions to whet the reader’s appetite and to motivate the discussions 
in the chapters. As a useful aid to the student, the exercises are marked in three ways; those that 
are solved completely in the Hints and Solution section, those that have answers provided and 
those that provide helpful hints. Problems are also prefixed with one or two stars to denote those 
problems which are difficult or lengthy, or very difficult, respectively. 

I found myself being somewhat turned off by the book, after an initial refreshing feeling, 
because of the overemphasis on the recreational aspects. No doubt, part of this was due to 
overexposure since I had already seen very many of the problems given or very similar ones. I 
would prefer a book for liberal arts students which blended in some history and culture together 
with problem solving, both pure and applied. Mathematics is much more than just fun and games, 
and to me the authors have overemphasized these aspects. Also, I do not agree with the authors 
that there are not any relatively simple realistic applications which can be treated. For example, 
one may consider the principles behind the following three instruments: a vermier, a stroboscope, 
and a spherometer. The first two lead to nice arithmetic applications while the third one is 
geometric. As a precursor for the third one, one can pose the problem of giving a Euclidean 
construction for the diameter of a given solid ball using a compass, a straight edge and a plain 
plane piece of paper. For other elementary applications, see [11], [13]. 
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Other books which can be compared to the present one for the same purpose are 
[1], [2], [3], [10], [14]. Whether they are actually better or not cannot really be decided upon unless 
we also receive input from students who actually use these books for the purposes intended. 

Finally, if I had to choose a text to be used solely for problem solving, I would prefer picking 
one or two of the five mentioned books by Polya, since they have a better treatment of “Problem 
Solving Strategies,” such as symmetry, continuity, physical analogies, etc., and are beautifully 
written. Nevertheless, at the least, the Averbach-Chein book is definitely useful as a supplemen- 
tary text or reference since it does provide much interesting material for the students. 
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Quantum Mechanics in Hilbert Space, Second Edition. By Eduard Prugovecki. Academic Press, 
New York, 1981. xxii + 685 pp., $39.50. 


JOHN CHALLIFOUR 
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The power and beauty of mathematical models in physics has often been noted, sometimes 
with frustration but most often with awe. Quantum mechanics is no exception, and has come to 
mean those mathematical models or equations which describe the dynamics of the atomic and 
molecular world. At the mathematical level quantum mechanics describes physical phenomena by 
assigning to each “idealized state” of a physical system a unit vector in a complex (separable) 
Hilbert space. Physical measurements, or observables, are related to appropriate self-adjoint 
operators in Hilbert space, and the actual measured values are directly associated with the spectra 
of the corresponding operators. Typically, such a unit vector might represent the location of the 
valence electrons in an atom, and a positive, self-adjoint operator, called the Hamiltonian, has as 
its spectrum the energy levels which appear as spectral lines in the laboratory. 

Since its inception in 1925, nonrelativistic quantum mechanics, independently due to W. 
Heisenberg and E. Schrédinger, has occupied a central place in the study of self-adjoint operators 
in Hilbert space by providing nontrivial problems that challenge the usefulness of the general 
theory and, by their physical setting, offer hints for solution. Among these are the spectral 
theorem (E. Hellinger, 1909; J. von Neumann, 1929), perturbations of unbounded operators 
(F. Rellich, 1939; T. Kato, 1957), wave operators and perturbation of absolutely continuous 
spectra (C. Moller, 1946; K. Friedrichs, 1948; T. Kato and S. T. Kuroda, 1959), and a general 
theory of operator algebras in Hilbert space (J. von Neumann, 1927; F. J. Murray and J. von 
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Neumann, 1936-1949). Presence of spin and other discrete quantum numbers in Nature spurred 
and added greatly to the study of representations of Lie groups (H. Weyl, 1926; E. Wigner, 1939). 
One might then ask whether after fifty-seven years of intensive and successful research the 
usefulness of quantum mechanics as a mathematical theory has not in fact been plumbed, leaving 
possibly difficult but rather specialized questions. 

For multiparticle systems (large atoms, electrons in a metal, quantum statistical mechanics) the 
answer must certainly be no. In fact, quantum statistical mechanics has provided examples of 
operator algebras in Hilbert space which have been absolutely pivotal in studying and classifying 
these objects beyond the work of Murray and von Neumann; see, for example; [1]. In the 
two-particle case where the simple Schrédinger equation is at work, there is a close relation 
between quantum mechanics and diffusion theory in terms of the heat equation; see [3], [5], [8] for 
precise details. Recent advances in both relativistic and nonrelativistic quantum mechanics have 
exploited this relation to such an extent that the whole framework of Hilbert space has given way 
to one of probability theory. Nevertheless there remains a unique feature of quantum mechanics 
and the Schrédinger equation which as yet has no intrinsic formulation in terms of the related 
diffusion: the so-called scattering theory describing mathematically the effects of particles 
scattering off a target. An answer to this question would offer a complete alternative to the 
Hilbert space formulation of quantum mechanics along probabilistic lines (R. P. Feynman, 1948) 
and tie yet another branch of mathematics intimately to quantum mechanics. 

The last ten years have seen a number of advanced texts on the mathematics and mathematical 
physics of quantum mechanics, see [1], [3], [6], [7], [8] among others, and it is clear that this subject 
has reached a spritely and healthy middle age. However, where could a first-year graduate student 
in mathematics or physics begin to make his bones in this subject? For one wishing to learn in 
company with one of the standard physics texts at this level [4], formulating the physics rigorously 
and carrying out many of the standard theoretical physics calculations, Prugovecki’s book is a 
good if somewhat lengthy companion. For such a reader one of this book’s strong points is that it 
is self-contained, starting with vector spaces and measure spaces, taking the reader through the 
Lebesgue integral and a complete account of linear operator theory on Hilbert space. In fact, this 
preparation accounts for more than a third of the book, with a middle third on the standard 
quantum mechanics of Schrédinger and Heisenberg. The main body of application occupies the 
last third of the text with a complete and extensive treatment of quantum mechanical scattering 
theory. It is here that most changes appear from the second edition with additions to bring the 
reader closer to the frontier in this area of research. The reader whose appetite for scattering 
theory is still not slaked by this account would be ready for volume 3 of [6]. 

The mathematics of quantum mechanics is a heady brew and not for the dilettante, but it is 
fascinating beyond measure. This book follows closely the syllabus of a first-year graduate course 
in quantum mechanics in a physics department but with mathematical rigor. For me, it followed 
that syllabus too closely, and with too much elaboration of similar ground, to set apart those parts 
of the subject that are solely technical in a mathematical sense from those parts that are 
conceptual. The one striking omission in a work of this length and focus was the lack of a 
substantive treatment of the Rayleigh-Schrédinger perturbation theory for the discrete spectrum 
of the Hamiltonian. After all, this formed the essential physical aspect of quantum mechanics and 
has prompted much mathematical research; see, for example, the remarks in [2]. Nevertheless the 
dedicated reader can learn much Hilbert space technique and quantum mechanics from this book. 
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MISCELLANEA 


Russell and Whitehead 


96. Symbolism is useful because it makes things difficult. Now in the beginning everything is 
self-evident, and it is hard to see whether one self-evident proposition follows from another or not. 
Obviousness is always the enemy to correctness. Hence we must invent a new and difficult 
symbolism in which nothing is obvious. 


Bertrand Russell 


It is a safe rule to apply that, when a mathematical or philosophical author writes with a misty 
profundity, he is talking nonsense. 


A. N. Whitehead 


—Quoted by R. E. Moritz, Memorabilia 
Mathematica, Macmillan, New York, 1914, pp. 
199, 210. 


How’s that again department 


97. The Rasmussen Report estimates there will be one meltdown every 20,000 reactor-years, 
and one fatality (from cancer) every 50 reactor-years. Conjoin those data (20,000 divided by 50) 
and you get the figure of 400 deaths a year. 


—W. F. Buckley, Chicago Sun-Times, 
19 May 1979, p. 36 


98. Computation is not barren when it supplies subsistence; it ceases to be an unmitigated evil 
when it is dignified by ministering to the necessities of material life and comfort. But the 
computation of the geometer, however tedious it may be, has a loftier aspiration. It provides 
spiritual nourishment: hence it is life itself, and is the worthy occupation of an immortal soul. The 
arithmetic formula, considered as an end, is the embodiment of fact, and isolated fact is as 
worthless as the idle gossip of the parlor or the club; whereas facts combined into formulae and 
formulae organized into theory penetrate the whole domain of physical science, and ascend to the 
very throne of ideality. 


—Benjamin Peirce, [deality in the 
Physical Sciences, Little Brown, Boston, 1881. 
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NEW IN 1983 
FROM BROOKS/COLE 


Linear Algebra with Applications 

Second Edition 

by Jeanne Agnew, Oklahoma State University; and R. C. Knapp, Herkimer 
County Community College. An informal, highly applied introduction for 
mathematics, computer science, engineering, and physical science students. 


Elements of Caiculus 

by Don Allen, Charles Chut, and William Perry, Texas A & M University. Fulfills 
the elementary calculus requirements for students majoring in business, biology, 
and the social sciences. 


College Algebra and Trigonometry 
by David Gustafson and Peter Frisk, Rock Valley College. Covers topics with 
precision and detail in a lively and accessible style. 


Precalculus Mathematics 

A Functional Approach, Second Edition 

by Karl J. Smith, Santa Rosa Junior College. Highly student motivated—includes 
an innovative treatment of the graphing of trigonometric functions. 


College Algebra 

Second Edition 

by David Gustafson and Peter Frisk, Rock Valley College. Provides a solid 
background In the algebra skills required in calculus. 


Essentials of Trigonometry 
by Karl J. Smith, Santa Rosa Junior College. Features interesting problems and 
step-by-step calculator procedures. 


Mathematics: its Power and Utility 
by Karl J. Smith, Santa Rosa Junior College. Emphasizes acquisition of basic 
arithmetic and algebraic problem-solving skills in everyday applications. 


Technical Caiculus with Analytic Geometry 
by Peter Kuhfittig, Milwaukee School of Engineering. Introduces calculus for 
students in various engineering technology programs. 


A Transition to Advanced Mathematics 

by Douglas Smith, Central Michigan University; Maurice Eggen, Trinity 
University; and Richard St. Andre, Central Michigan University. A new book for 
the course that bridges the gap between the calculus sequence and the upper 
division mathematics courses. 


For further information write Craig Barth, Editor 
BROOKS/COLE PUBLISHING COMPANY 
555 Abrego Street, Monterey, California 93940 


MAA STUDIES IN 
ATHEMATICS 


MAA STUDIES IN MATHEMATICS present to the mathematical community 
expository papers in topics at the research frontiers written for a broad 
audience. Each volume is edited by a prominent mathematician active in 
the area of the study. There are twenty-one volumes currently available in 


this series: 
List Price Member Price 

Vol. 1. Studies in Modern Analysis, R. C. Buck, $16.50 $12.00 
editor 

Vol. 2. Studies in Modern Algebra, A. A. Albert, 16.50 12.00 
editor 

Vol. 3. Studies in Real and Complex Analysis, 16.50 12.00 
I. I. Hirschmann, Jr., editor 

Vol. 4. Studies in Global Geometry and Analysis, 16.50 12.00 
S. S. Chern, editor 

Vol. 5. Studies in Modern Topology, P. J. Hilton, 16.50 12.00 
editor 

Vol. 6. Studies in Number Theory, W. J. Le Veque, 16.50 12.00 
editor 

Vol. 7. Studies in Applied Mathematics, 16.50 12.00 
A. H. Taub, editor 

Vol. 8. Studies in Model Theory, M. D. Morley, 16.50 12.00 
editor 

Vol. 9. Studies in Algebraic Logic, 16.50 12.00 
Aubert Daigneault, editor 

Vol. 10. Studies in Optimization, George B. Dantzig 16.50 12.00 
and B. Curtis Eaves, editors 

Vol. 11. Studies in Graph Theory, Part I, 16.50 12.00 
D. R. Fulkerson, editor 

Vol. 12. Studies in Graph Theory, Part II, 16.50 12.00 
D. R. Fulkerson, editor 

Vol. 13. Studies in Harmonic Analysis, J. M. Ash, 21.00 16.00 
editor 

Vol. 14. Studies in Ordinary Differential Equations, 21.00 16.00 
Jack Hale, editor 

Vol. 15. Studies in Mathematical Biology, Part I, 21.00 16.00 


“Cellular Behavior and Development of 
Pattern,” Simon Levin, editor 
Vol. 16. Studies in Mathematical Biology, Part II, 21.00 16.00 
“Populations and Communities, ” 
Simon Levin, editor 


Vol. 17. Studies in Combinatorics, Gian-Carlo Rota, 21.00 16.00 
editor 

Vol. 18. Studies in Probability, Murray Rosenblatt, 21.00 16.00 
editor 

Vol. 19. Studies in Statistics, R. V. Hogg, editor 18.00 13.50 

Vol. 20. Studies in Algebraic Geometry, 16.00 12.00 
Abraham Seidenberg, editor 

Vol. 21. Studies in Functional Analysis, 19.00 14.00 


Robert G. Bartle, editor 


ORDER FROM: THE MATHEMATICAL ASSOCIATION OF AMERICA 
1529 Eighteenth Street, N.W. 
Washington, D.C. 20036 


@ APPLIED MATHEMATICS: PRINCIPLES, TECHNIQUES, AND 
APPLICATIONS James Cochran, Washington State University 


Senior/graduate-level textbook for students in mathematics, physics, and engineering. 
Contents: 1. Linear Algebra and Computation; 2. Eigenvalue Problems for Differential 
Equations; 3. The Special Functions of Applied Mathematics; 4. Optimization and the Calculus 
of Variations; 5. Analytic Function Theory; 6. Conformal Mapping; 7. Integral Transforms; 

8. Green’s Functions; 9. Generalized Functions; 10. Linear Integral Functions; 11. Asymptotics. 
1982 390 pp. cloth ISBN 0-534-98018-X $34.95 


@ CONFERENCE ON HARMONIC ANALYSIS IN HONOR OF 


ANTONI ZYGMUND, in two volumes. Edited by William Beckner, Alberto Calderon, 
Robert Fefferman, and Peter Jones, University of Chicago 

In March 1981, some two hundred mathematicians gathered at the University of Chicago to 
honor Professor Antoni Zygmund on the occasion of his eightieth birthday. The most promi- 
nent analysts in the world, including L. Carleson, J.-P. Kahane, E. M. Stein, C. Fefferman, 

P. Malliavin, and A. Calderén, have contributed to this collection of fifty-eight papers. 

1982 880 pp. cloth ISBN 0-534-98043-0 $79.95. 


® INTRODUCTION TO CLASSICAL REAL ANALYSIS 
- Karl Stromberg, Kansas State University 


Advanced undergraduate introduction to analysis. “Outstanding Academic Book 1981-2’ 
(Choice). “Could very well become a classic” (American Mathematical Monthly). “A summary 
cannot do justice to the author’s style, the wealth of his material, or the lucidity of his metic- 
ulous exposition” (Mathematical Gazette). 1981 576 pp. cloth ISBN 0-534-98012-0 $31.95 


@ ELEMENTARY MATHEMATICAL ANALYSIS, Second Edition 
Colin Clark, University of British Columbia 


Undergraduate textbook for beginning analysis; analysis for secondary teachers. 
Contents: 1. Sequences, Limits, and Real Numbers; 2. Infinite Series; 3. Limits, Continuity, 
and Differentiability; 4. Properties of Continuous Functions; 5. Some Theorems of Calculus; 


6. Limits and Continuity in n Dimensions. 1982 259 pp. cloth ISBN 0-534-98018-X 
LC 81-4759 $21.95 


@ TOPICS IN ALMOST EVERYWHERE CONVERGENCE 


Adriano Garsia, University of California, San Diego 
The 1970 edition of this classic is now available from Wadsworth. 


Contents: 1. Some Basic Principles; 2. Some Ergodic Theory; 3. Combinatorial Inequalities 
and Convergence Almost Everywhere of Some Orthogonal Expansions; 4. Carleson’s “‘log 
log n” Result. 1970 154 pp. paper ISBN 0-534-98045-7 $14.95 


@ ALGEBRAIC NUMBERS AND FOURIER ANALYSIS, 
Raphael Salem, and 


SELECTED PROBLEMS ON EXCEPTIONAL SETS, 


Lennart Carleson 


A reprint of these two classic works is now available in one volume. 1983 192 pp. cloth 
ISBN 0-534-98049-X $29.95 


N WADSWORTH INTERNATIONAL GROUP 
\ 10 Davis Drive 
Y Belmont, CA 94002 


The Mathematical Association of America 


CARUS MATHEMATICAL 
ONOGRAPHS - 


students and teachers of mathematics as well as nonspecialists and 
workers in other fields, will all appreciate the expository excellence of 
these monographs. A wide range of topics in pure and applied mathe- 
matics are covered, each one written by a renowned expositor in the 
field. A glance at the list of titles of the books and the list of authors will 
convince you of the quality of this respected series of monographs. lwenty 
titles are currently available: 


List price MAA Member 


No. 1. Calculus of Variations, by G. A. Bliss $16.50 $12.00 

No. 2. Analytic Functions of a Complex Variable, 16.50 12.00 
by D. R. Curtiss 

No. 3. Mathematical Statistics, by H. L. Rietz 16.50 12.00 

No. 4.  Projective Geometry, by J. W. Young 16.50 12.00 

No. 6. Fourier Series and Orthogonal Polynomials, 16.50 12.00 
by Dunham Jackson 

No. 8. Rings and Ideals, by N. H. McCoy 16.50 12.00 

No. 9. The Theory of Algebraic Numbers 16.50 12.00 
(Second edition) by Harry Pollard, and 
H. G. Diamond 

No. 10. The Arithmetic Theory of Quadratic Forms, 16.50 12.00 
by B. W. Jones 

No. 11. Irrational Numbers, by Ivan Niven 16.50 12.00 

No. 12. Statistical Independence in Probability, 16.50 12.00 
Analysis and Number Theory, 
by Mark Kac 

No. 13. A Primer of Real Functions (Third edition), 16.50 12.00 
by R. P. Boas, Jr. 

No. 14. Combinatorial Mathematics, by H. J. Ryser 16.50 12.00 

No. 15. Noncommutative Rings, by I. N. Herstein 16.50 12.00 

No. 16. Dedekind Sums, Hans Rademacher and 16.50 12.00 
Emil Grosswald 

No. 17. TheSchwarz Function and its Applications, 16.50 12.00 
by P. J. Davis 

No. 18. Celestial Mechanics, by Harry Pollard 16.50 12.00 

No. 19. Field Theory and its Classical Problems, 21.00 16.00 
by Charles Hadlock 

No. 20. The Generalized Rieman Integral, 18.00 13.50 


by R. M. McLeod 


ORDER FROM: 

THE MATHEMATICAL ASSOCIATION OF AMERICA 
1529 Eighteenth Street, NW. 

Washington, D.C. 20036 


RICE UNIVERSITY Position in Operations Research 


The Mathematical Sciences Department of Rice University expects 
to have a position in Operations Research available beginning Fall, 
1983. Although the position is expected to be at a senior level, applica- 
tions at all levels are invited. Primary duties include providing leader- 
ship in teaching, research, and direction of graduate students in the 
department's Operations Research program. The department currently 
has responsibilities in the areas of computer science, numerical analy- 
sis, operations research, probability, statistics, and physical mathe- 
matics and biomathematics. The department has an active undergrad- 
uate and graduate program in these areas. If interested, please send 
resume and names of four references to 


‘open up fields of seeming- 


ly inexhaustible wealth’ 
Prof. Alexander Grothendieck 


Professor Richard Tapia 


Chairman 
Mathematical Sciences Dept. 
Rice University 
P.O. Box 1892 
Houston, TX 77251 


‘represent tremendous am- 
ounts of new information 


Prof. Morris Newman 


SSS’companion volumes on new properties, methods 
of analysis, & invariants of general algebraic curves 


SYMMETRY, An Analytical Treatment 


by J. Lee Kavanau 
August,1980, 656pp., illus., $29.95 


“One of the most original treatments of plane 
curves to appear in modern times. The author’s 
new and deeper studies...reveal a great number 
of beautiful & heretofore hidden properties of al- 
gebraic plane curves.” Prof Basil Gordon 


“Provides sharp new tools for studying the prop- 
erties of general algebraic curves.” 

Prof Richard Fowler 
“Striking new results onsymmetry & classifi- 
cation of curves...Read this book for more in sym- 
metry thanmeets the eye.” = Amer. Math. Monthly,1981 


“Extremely richincontent.” Nordisk Matem Tids.,1981 
Send SASE for $2500 Geometry Competition details 


Science Software Systems, Inc., 


CURVES & SYMMETRY, vol. 1 


by J. Lee Kavanau 


Jan.,1982, 448pp., over 1,000 
indiv. curves, $21.95, $47 the set 


“Casts much new light on inversion & 
its generalization, the linear fraction- 
al (Moebius) transformation,with 
promise of increasing their utility by 
an order of magnitude?’ Prof. Richard Fowler 


“Replete with fascinating, provocative 
new findings...accompanied by a wealth 
of beautiful & instructive illustrations.” 

Prof. Basil Gordon 
“Extends the idea of inversion into quite 
a new field.” E.H Lockwood 


“Examines many classical curves from 
new standpoints.” Nordisk Matem Tids ,1982 
BankAmericard-213-477-8541-Master Card 


11899 W. Pico Blvd., Los Angeles, Calif, 90064 


SOME INFORMATIVE AND USEFUL 
BOOKS FROM THE MAA... 


Browse through this list and see if your library is missing some of these important books published 


by the Association. 


APPLICATIONS OF UNDERGRADUATE 
MATHEMATICS IN ENGINEERING — written and 
edited by Ben Noble. A collection of articles prepared 
by engineers for the Committee on the Undergraduate 
Program in Mathematics. 364 pp. Hardbound. 

List: $19.00. MAA Member: $13.00. 


ANNOTATED BIBLIOGRAPHY OF EXPOSITORY 
WRITING IN THE MATHEMATICAL SCIENCES, 
prepared by M. P. Gaffney and L.A. Steen. An invaluable 
reference source of expository articles in mathematics. 
282 pp. Paperbound. 

List: $12.00. MAA Member: $8.00. 


AN ANNOTATED BIBLIOGRAPHY OF FILMS AND 
VIDEOTAPES FOR COLLEGE MATHEMATICS, by 
David I. Schneider. An up-to-date listing of films and 
videotapes available for classroom use. 107 pp. 
Paperbound. 

List: $9.00. MAA Members: $6.00. 


A BASIC LIBRARY LIST FOR TWO-YEAR COLLEGES, 
prepared by the Committee on Basic Library Lists. A 
recommended library nucleus for two-year colleges. 

66. pp. Paperbound. 

List: $8.00. MAA Member: $6.00. 


A BASIC LIBRARY LIST FOR FOUR-YEAR 
COLLEGES, prepared by CUPM. Presents listings of 
books and journals that should be in every college 
library. 106 pp. Paperbound. 

List: $9.00. MAA Member: $6.50 


THE CHAUVENET PAPERS. A Collection of Prize 
Winning Expository Papers in Mathematics, edited by 
James C. Abbott. Two-volumes of the collected prize 
winning Chauvenet Papers. Vol. 1— 312 pp. 
Hardbound. Vol. 2— 282 pp. Hardbound. 
List: $21.00 each. MAA Member $16.00 each. 

Two volume sets 

List: $36.00. MAA Member: $27.00. 


CRITICAL VARIABLES IN MATHEMATICS 
EDUCATION: Findings from a Survey of the 
Empirical Literature, by E.G. Begle. A joint publication 
of the MAA and the National Council of Teachers of 
Mathematics. List: $8.00. MAA Member: $6.40. 


A COMPENDIUM OF CUPM RECOMMENDATIONS. 
Volumes I and II. A collection of the major 
recommendations of the Committee on the 
Undergraduate Program in Mathematics. Two volumes. 
756 pp. Hardbound. 

List: $16.50. MAA Member: $12.00. 


THE WILLIAM LOWELL, PUTNAM MATHEMATICAL 
COMPETITION: PROBLEMS AND SOLUTIONS — 
1938-1964. Compiled the R. E. Greenwood, A. M. 
Gleason, and L. M. Kelly. Contains problems and 
solutions to the first 25 Putnam Exams. 652 pp. 
Hardbound. 

List: $35.00. MAA Member: $26.00. 


THE MATHEMATICAL ASSOCIATION OF AMERICA: 
Its First Fifty Years. An historical perspective of the 
Association. 170 pp. Hardbound. 

List: $10.00. MAA Member: $5.00. 


FIFTY YEAR INDEX OF THE MATHEMATICS 
MAGAZINE, edited by Lynn A. Steen, and J. Arthur 
Seebach. Cumulative index of volumes 1-50. 163 pp. 
paperbound. 

List: $10.00. MAA Member: $6.50. 


INDEX OF THE AMERICAN MATHEMATICAL 
MONTHLY. Contains the tables of contents for each 
issue of volumes 1-80 as well as subject and author 
indices. 269 pp. Hardbound. 

List: $19.00. MAA Member: $13.00. 


PRIME-80. Proceedings of a Conference on 
Prospects in Mathematics Education in the 1980's. 
Included are the background of the conference and the 
recommendations that resulted. 

84 pp. Paperbound. $3.50. 


PROFESSIONAL OPPORTUNITIES IN THE 
MATHEMATICAL SCIENCES, Tenth Edition. 1978. 
Designed for the student interested in a career in 
mathematics. 35 pp. Paperbound. $1.50 each. 95¢ for 
orders of five or more. 


RECOMMENDATIONS ON A GENERAL 
MATHEMATICAL SCIENCES PROGRAM. Prepared by 
the Committee on the Undergraduate Program in 
Mathematics. (CUPM) 102 pp. Paperbound. $3.50 each. 


Order From: 


THE MATHEMATICAL ASSOCIATION 
OF AMERICA 

1529 Eighteenth Street, N.W. 
Washington, D.C. 20036 


New from The Mathematical 
Association of America 


VOLUME 6 — DOLCIANI MATHEMATICAL EXPOSITIONS 


MAXIMA AND MINIMA WITHOUT 
CALCULUS, 


by Ivan Niven 
xv + 323 pp. Hardbound 
List: $24.50 MAA Member: $18.50 


Ivan Niven, distinguished author of several books on number theory and 
probability, has compiled the basic elementary techniques for solving 
maxima and minima problems. Since many books and courses already cover 
calculus and linear programming techniques, the author deliberately omits 
these areas from his discussions and concentrates instead on methods in 
algebra and geometry not so widely known. These methods are organized 
according to the mathematical ideas used, and many chapters can be read 
independently without reference to what precedes or follows. Some of the 
problems presented are left.for the reader to solve with sketches of solutions 
given in the later pages. 

The book is written for an audience at or near the maturity level of 
second- or third-year college students with a good working knowledge of 
precalculus mathematics. Although calculus is not a prerequisite, a prior 
knowledge of that subject will enhance the reader’s comprehension. An 
excellent sourcebook in the area of maxima and minima, the book will serve 
as a textbook or as enrichment material for the talented undergraduate. 

The main topics covered are: 

Simple Algebraic Results 

Elementary Geometric Questions 
Isoperimetric Results 

Basic Trigonometric Inequalities 
Polygons Inscribed and Circumscribed 
Ellipses 

The Bees and Their Hexagons 

Further Geometric Results 

Applied and Miscellaneous Problems 
Euclidean Three-Space 

Isoperimetric Results not Assuming Existence 
Postscript on Calculus 


Order from: 

The Mathematical Association of America 
1529 Eighteenth St., NW 

Washington, DC 20036 


Benjamin/ Cummings 


Compare our Math Texts and Find 
Incomparable Quality 


NEW IN ’83 
Mario F. Triola 


Elementary Statistics 

NEW Second Edition 

Like the successful first edition, this text provides a 
solid introduction to probability and statistics for 
non-mathematics students. Over 65 illustrated mar- 
gin essays demonstrate modern applications to statis- 
tics. The Instructor’s Resource Guide (39322) con- 
tains transparency data sets and computer projects. 
The Student Solutions Manual (39321) with worked 
out solutions to selected odd exercises is available for 
student purchase. 512 pp/HB/39320/1983 


William Derrick 

College Algebra and 

Trigonometry  624pp/HB/32308/1982 
College Algebra — 528pp/HB/32307/1982 
College Trigonometry  368pp/HB/32309 
1982 

Through numerous exercises, word problems, ap- 
plications, and historical anecdotes, these books offer 
a straightforward presentation in an elegant, two- 
color design. Solutions manuals available for all three 


texts. 


The Washington Series — The Ac- 
knowledged Classics in Mathematics 
by Allyn J. Washington 


Essentials of Basic Mathematics, 
Third Edition = 476pp/HB/39529 
Introduction to Technical Mathema- 
tics, Second Edition = s80pp/HB/39525 
Basic Technical Mathematics, Third 
Edition — 682pp/HB/39520 

Basic Technical Mathematics with Cal- 
culus, Third Edition — 9%6spp/HB/39521 

Basic Technical Mathematics with Cal- 
culus, Metric Version  965pp/HB/39523 
Technical Calculus with Analytic 
Geometry, Second Edition = 544pp/HB/39519 


Coming Soon... Problems Supplement for 
Technical Mathematics (containing over 
1,000 additional problems) 


Billstein, Libeskind and Lott 

A Problem-Solving Approach to 
Mathematics for Elementary School 
Teachers 

Written to meet the recommendations for school 
mathematics of the 1980’s set forth by the National 
Council of Teachers of Mathematics, this widely used 
text presents problem-solving techniques in such a 
way that they form an integral part of mathematics 
for prospective elementary school teachers. Also of 
interest to instructors of liberal arts courses or com- 
bined liberal arts and teacher education courses. 
656pp/HB/30851/1981. Instructor’s Manual Available (30852) 


A. Wayne Roberts 

Elementary Linear Algebra 

This text was written to meet the challenge of 
covering the basics while indicating linear algebra’s 
applications and relationships to other areas of 
mathematics. In addition to the use of unique vig- 
nettes, Roberts explains the ‘‘why” behind the 
material in a clear, concise writing style. 480pp/HB/ 
38302/1982. 


Other Titles of Interest 


Peter Linz 
Programming Concepts and Problem 
Solving: An Introduction to Computer 


Science Using Pascal 
41 6pp/HB/35710/1983 


D.M. Etter 

Structured Fortran 77 for Engineers and 
Scientists 

376pp/SB/325 20/1983 

Grady Booch, U.S. Air Force Academy 


Foreword by Larry Druffel 

Software Engineering with Ada* 
480pp(tent)¢SB/30600/1983 

*Ada is a registered trademark of The Department of 
Defence. 


For more information contact your local sales representative 


or write: 
e242, Benjamin/Cummings 
a 2 Publishing Company, Inc. 
ON ja 2727 Sand Hill Road 


Ou\* Menlo Park, California 94025 


You can figure on success 
with 
Scott, Foresman Mathematics texts! 


Mathematics with 
Applications in the 
Management, Natural, 
and Social Sciences 


Third Edition 
Margaret L. Lial / Charles D. Miller 


The Third Edition offers more algebra review, more 


word problems, and improved coverage of probabil- 


ity; calculus topics are treated in greater depth. 
Available Now, © 1983, 690 pages, hardbound, 
with Instructor’s Guide including complete testing 
program, selected exercise solutions, and two 
computer programs in BASIC. 

& Precalculus 
Mathematics 

A Functions Approach 

Floyd F. Helton / Margaret L. Lial 

Abundant use of interesting and relevant applica- 
tions; numerous, detailed examples; 3000 exer- 
cises, including miscellaneous and review exer- 
cises; 290 illustrative text figures. Fundamental 
algebraic topics are reviewed. Functions and their 
graphs are emphasized. Available Now, © 1983, 
approx. 514 pages, hardbound, with Instructor's 
Guide including answers to even-numbered text 
exercises, selected solutions, suggestions for 
using the text, and a testbank. 

Algebra and 
Trigonometry 

Margaret L. Lial / Charles D. Miller 
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MATHEMATICAL METHODS OF ECONOMICS* 


JOEL FRANKLIN 
California Institute of Technology, Pasadena, California 91125 


When Dr. Golomb and Dr. Bergquist asked me to give a talk on economics, my first impulse 
was to try to get out of it. 

“Sol,” I said, “I’m not an economist. You know that.” 

“T know,” said Golomb. 

“Tf you want an economist, I can get you one,” I said. “I know some excellent economists.” 

“No,” he said, “we want a mathematician to talk about the subject to other mathematicians 
from their own point of view.” . 

That made sense, and I hit on this idea: I won’t try to tell you what mathematics has done for 
economics. Instead, Pll do the reverse: I'll tell you some things economics has done for 
mathematics. I'll describe some mathematical discoveries that were motivated by problems in 
economics, and I'll suggest to you that some of the new mathematical methods of economics 
might come into your own teaching and research. 

One of these methods is called linear programming. I learned about it in 1958. I had just come 
to Caltech as a jumior faculty member associated with the computing center. The director and I 
made a cross-country trip to survey the most important industrial uses of computers. In New 
York, we visited the Mobil Oil Company, which had just put in a multi-million-dollar computer 
system. We found out that Mobil had paid off this huge investment in two weeks by doing linear 
programming. 

Back at Caltech, Professor Alan Sweezy in economics and Professors Bill Corcoran and Neil 
Pings in chemical engineering urged me to teach a course in linear programming. When I told 
them I didn’t know linear programming, they said: Fine, Joel, /earn it. Seeing they meant business, 
I did study the subject and give the course. The students loved it, and so did I. Perhaps you will 
have a similar experience. 

One surprising thing I found was this: the mathematics was delightful. I knew it was useful, but 
I hadn’t expected it to be beautiful. I was surprised to find that linear programming wasn’t just 
business mathematics or engineering mathematics; it was the general mathematics of linear 
inequalities. Later I found this mathematics coming into some of my own special fields of research 
(statistics, numerical analysis, ill-posed problems). Here again, you may have a similar experience. 

Linear programming is one of the many mathematical methods of economics. Here are a few 
others: quadratic programming, geometric programming, general nonlinear programming; fixed- 
point theorems—especially the Kakutani theorem; calculus of variations, control theory, dy- 
namics programming; theory of convex sets—especially convex cones; probability, statistics, 
stochastic processes; finite structures (graph theory, lattice theory); matrix theory; calculus, 
ordinary differential equations; and special topics like game theory and Arrow’s theory of rational 
preference orderings. 

Plato said mathematics is the essence of reality; Willard Gibbs said mathematics is the 
language of science. If they are nght, we shouldn’t be surprised to find uses for any branch of 


The author is Professor of Applied Mathematics at the California Institute of Technology. At Stanford 
University, in 1953, he received his Ph.D. in pure mathematics. His published research pertains to numerical 
analysis, to ill-posed problems, to stochastic processes, and to mathematical problems in engineering, in crystallogra- 
phy, in geophysics, and in cell biology. At Caltech he has twice received the Associated Students Award for 
Excellence in Teaching. He has written two textbooks for undergraduates in mathematics: Matrix Theory and 
Methods of Mathematical Economics. 

*Invited address to the Mathematical Association of America and to the Society for Industrial and Applied 
Mathematics, November 14, 1981, Santa Barbara, California. 
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mathematics in any science. Every branch of mathematics may have some use in the science of 
economics. Here are two bizarre examples: 

Have you heard of nonstandard analysis? Ive heard of it, but know next to nothing about it. 
Nevertheless, on November 10, 1981, I heard Yale Economics Professor Donald J. Brown give a 
colloquium on the nonstandard analysis of hyper-finite economies (see [4] and [20]). 

You have heard of Bourbaki; so have I. I always thought that stuff would never be good for 
anything. Nevertheless, Bourbaki ultrafilters appear in a paper in the Journal of Economic Theory 
[17]. The authors, A. Kirman and D. Sondermann, use ultrafilters to generalize Kenneth Arrow’s 
fundamental theorem of welfare economics [1]. 

Mathematics appears in all parts of economics, especially in mathematical economics and in 
econometrics. Mathematical economics is like mathematical physics: it is theoretical, nonempirical, 
sometimes speculative. For instance, Alfred Marshall hypothesized the existence of certain curves 
(supply and demand schedules) whose intersections determine commodity prices. Very pretty, but 
he didn’t show how to measure or predict numerical values for specific supply-demand schedules. 

In general, measurement and prediction belong to econometrics. As you would expect, economet- 
rics uses a lot of mathematical statistics, probability theory, and numerical analysis. A Nobel prize 
was given in 1980 to Lawrence Klein for his work in building econometric models. 

In 1969 the first Nobel prize in economics was given to Ragnar Frisch and Jan Tinbergen “for 
having developed and applied-dynamic models for the analysis of economic processes”; in other 
words, the prize was given for mathematics applied to economics. Later, I'll show you a list of all 
the Nobel prizes in economics, and you'll see that at least 7 of the 12 prizes given from 1969 
through 1981 were given for work that could be called applied mathematics. In fact, in 1975 a 
Nobel prize in economics was given to Leonid Kantorovich, who is a mathematician. 

In 1969 a spokesman for the Nobel foundation welcomed the new prize subject, economics, as 
“the oldest of the arts, the youngest of the sciences.” It might be fair to say that economics 
became a science when it started making significant use of mathematics. When was that? I’d say 
the nineteenth century. 

In 1817 the stockbroker David Ricardo proved a theorem that establishes an astounding 
principle of international economics. Ricardo proved mathematically that free trade is (under 
certain assumptions ) advantageous to consumers in all nations. 

Alfred Marshall was another great nineteenth-century economist. Marshall started out to be a 
mathematician; he was First Wrangler in mathematics at Cambridge. Although his work is seldom 
explicitly mathematical, any mathematician reading it can sense its mathematical core. Marshall 
was a teacher of John Maynard Keynes, whose work contains plenty of explicit mathematics. But, 
at least to my taste, Marshall’s work shows more mathematical insight. 

As Gerard Debreu wrote in his Theory of Value [7], mathematical economics has become 
increasingly geometric and qualitative. If we want precise numerical information, we have to turn 
to econometrics. Whereas Marshall drew his supply-and-demand curves in a nonnumerical, 
qualitative way, the econometrician would have the hard problem of giving numerical values for 
these curves for specific commodities at specific times. 

An example of econometrics appears in an article [29] by the mathematician Jacob Schwartz. 
He used a Wharton econometric model for residential housing. You can see it in Fig. 1. There you 
see a typical awful equation of econometrics; please don’t try to understand it. I just want you to 
see what it looks like. It predicts the rate of investment in residential housing as a function of 
various factors (the numerical subscripts refer to time lags). The coefficients (58.26, 0.0249, etc.) 
come from a numerical curve fit to data for 1948-1964; the model was published in 1967. 

There is an old Chinese proverb: It is always difficult to predict—especially the future. For that 
reason econometrics is difficult. The Wharton model of 1967 “predicts” housing starts for 
1948—1964— not for the future. In general, econometric models are not laws of nature like f = ma 
or E = mc’; they are empirical studies whose predictive value depends on the constancy of the 
underlying relationships. 


1983] MATHEMATICAL METHODS OF ECONOMICS 231 


1967 Wharton econometric model (for 1948-1964) 


Ph 


r 


I, = 58.26 + 0.0249Y — 45.52 ( ] + 1.433(i, — i,)_3 + 0.0851 (If)_, 
3 


I, = rate of investment ($107) in residential housing 
per quarter (3 months) 


Y = total disposable income 
P, = average housing price 
Pp, = average rental price 
i; = long-term interest rate 

i, = short-term interest rate 
Ij, = rate of housing starts 


Negative subscripts denote time lags. 


FiG. 1. 


What Do Economists Think of Mathematics? That question has had different answers at 
different times. Now the answer would be overwhelmingly favorable, if not unanimous. But not so 
in the old days. Adam Smith published his great book Wealth of Nations in 1776. It is readable, 
fascinating, and important; but it contains almost no mathematics. 

I told you the great nineteenth-century economist Alfred Marshall had been First Wrangler in 
mathematics at Cambridge. Later, he talked about the role mathematics played in his work: 


I had a growing feeling in the later years of my work at the subject that a good mathematical 
theorem dealing with economic hypotheses was very unlikely to be good economics: and I went 
more and more on the rules—(1) Use mathematics as a shorthand language, rather than as an 
engine of inquiry. (2) Keep to them till you have done. (3) Translate into English. (4) Then 
illustrate by examples that are important in real life. (5) Burn the mathematics. (6) If you can’t 
succeed in 4, burn 3. This last I did often. —quoted in [31], p. 307. 


So Marshall practiced mathematics as a secret vice; he was a closet mathematician. His most 
famous student was John Maynard Keynes. At Cambridge, Keynes took his degree in mathe- 
matics*. In 1920 Keynes published his Treatise of Probability. Keynes’s great books on economics 
contain many equations. By the time of Lord Keynes mathematics was not a secret vice but a 
public virtue. 

A living disciple of Keynes, Harvard Professor John Kenneth Galbraith, regards mathematics 
with scepticism. One of Galbraith’s more entertaining books is called Economics, Peace, and 
Laughter. Commenting on the models of mathematical economics, he says this: 


Moreover, the models so constructed, though of no practical value, serve a useful academic 
function. The oldest problem in economic education is how to exclude the incompetent... . 
The requirement that there be an ability to master difficult models, including ones for which 
mathematical competence is required, is a highly useful screening device. 


Not satisfied with this comment, Galbraith adds a dour footnote: 


There can be no question, however, that prolonged commitment to mathematical exercises in 
economics can be damaging. It leads to the atrophy of judgment and intuition... . 


John Galbraith does not stand alone. He tells this story about Paul Samuelson, a superb 
applied mathematician and winner of the Nobel Prize for work in mathematical economics: 


*While studying for the Tripos, Keynes wrote to his friend B. W. Swithinbank on 18 April 1905: “I am 
soddening my brain, destroying my intellect, souring my disposition in a panic-stricken attempt to acquire the 
rudiments of the Mathematics.” See R. F. Harrod [13], p. 130. 
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Professor Samuelson, in his presidential address to the American Economic Association several 
years ago, noted that the three previous presidential addresses had been devoted to a 
denunciation of mathematical economics and that the most trenchant had encouraged the 
audience to standing applause. 


Well! And skepticism about mathematics is not confined to this continent. Galbraith says: 


Once when I was in Russia on a visit to Soviet economists, I spent a long afternoon attending a 
discussion on the use of mathematical models in plan formation. At the conclusion an elderly 
scholar, who had also found it very heavy going, asked me rather wistfully if I didn’t think 
there was still a “certain place” for the old-fashioned Marxian formulation of the labor theory 
of value. 


The old Russian scholar must have sighed when a Nobel prize in economics was given to 
Leonid Kantorovich, a mathematician. Kantorovich got the prize for developing the mathematical 
theory of linear programming and for applying it to the economic problem of optimum allocation 
of resources. He would have gone a lot farther with linear programming if he hadn’t run into 
trouble from the orthodox Marxians, who objected to the use of the idea of prices. Dantzig tells 
the story in his book [6], p. 23. 

Among the Nobel Laureates in economics, some, like Kantorovich, solved problems in 
economics by inventing new mathematics; others made much use of known mathematics. Look at 
the list of Nobel prizes in economics, Fig. 2. I’ve put asterisks by seven of the twelve prize years to 
indicate work that is heavily mathematical. 


Nobel Prizes in Economics 


1969* Frisch, Ragnar and Tinbergen, Jan—“for having developed and applied dynamic models for the 
analysis of economic processes.” 


1970* Samuelson, Paul—“for the scientific work through which he has developed static and dynamic 
economic theory and actively contributed to raising the level of analysis in economic science.” 


1971 Kuznets, Simon—“for his empirically founded interpretation of economic growth which has led to 
new and deepened insight into the economic and social structure and process of development.” 


1972* Hicks, Sir John R. and Arrow, Kenneth J.—“for their pioneering contributions to general economic 
equilibrium theory and welfare theory.” 


1973 Leontief, Wassily—“for the development of the input-output method and for its application to 
important economic problems.” 


1974 Myrdal, Gunnar and Von Hayek, Friedrich August—“for their pioneering work in the theory of 
money and economic fluctuations and for their penetrating analysis of the interdependence of 
economic, social and institutional phenomena.” 


1975* Kantorovich, Leonid and Koopmans, Tjalling—‘“for their contributions to the theory of optimum 
allocation of resources.” 


1976* Friedman, Milton—“for his achievements in the fields of consumption analysis, monetary history and 
theory and for his demonstration of the complexity of stabilization policy.” 


1977 Ohlin, Bertil and Meade, James—“for their pathbreaking contributions to the theory of international 
trade and international capital movements.” 


1978 Simon, Herbert A.—“for his pioneering research into the decision-making process within economic 
organizations.” 


1979 Lewis, Arthur and Shultz, Theodore—for studies of human capital. 
1980* Klein, Lawrence—for computer models designed to forecast economic changes. 


1981* Tobin, James—for mathematical models of investment decisions. 


*Asterisks indicate very mathematical work. 


FIG. 2. 
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Seven out of twelve Nobel prizes—not a bad score for mathematics. And some of this 
mathematics has freshness and charm. For example, let me show you a theorem that won a Nobel 
prize: the Possibility Theorem of Kenneth Arrow. 

In 1957 Kenneth Arrow published a little book called Social Choice and Individual Values. He 
was thinking about a problem of welfare economics: Confronted by numerous conflicting special 
interests, how should the government make decisions? 

Use old-fashioned majority rule, you say. That’s the democratic way, isn’t it? That’s the rational 
way. 


Let’s see. Suppose we have 3 alternatives: vanilla (V), chocolate (C), and strawberry (S). And 
suppose we have 9 voters, each with his own individual values. For example, one individual may 
like vanilla better than chocolate (V > C), and he may like chocolate better than strawberry 
(C > S); then, by the way, he must like vanilla better than strawberry (V > S) if his individual 
values are rational. Another individual may prefer strawberry to vanilla (S > V), vanilla to 
chocolate (V > C), and therefore strawberry to chocolate (S > C). And so on. 

If all of our nine voters have definite flavor preferences, the voters constitute 6 special-interest 
groups, corresponding to the six ways of ranking 3 flavors. For example, we might have the 
following tabulation: 


Individual values Number of individuals 


V>c>s 
S>V>C 
C>S>V 
V>S>c 
C>V>S 
S>C>V 


—t ot et ON KD KD 


Now comes the general election. Here are the results: 


V > Cbya majority of 5 to 4 
C > S by a majority of 5 to 4 


and—what’s this? 
S > V by a majority of 5 to 4. 


But that’s crazy: V > C and C > S should imply V > S, not S$ > V. (This is an example of 
Concordet’s paradox.) 

No wonder Congress is confused. You see the problem. So did Arrow, and he wondered if 
there was any way out. 

There is one way out: Hitler’s way. Pick one individual, call him der Fiihrer, and do what he 
says. Then all the government’s preferences can be nice and transitive, and too bad for you if you 
don’t like it. 

Is there any rational way to make social choices besides dictatorship? To this basic question of 
welfare economics, Kenneth Arrow gave an astonishing answer: No. 


ARROW’S THEOREM. Suppose we have a function that makes rational (transitive) social choices as 
a function of rational individual values that rank (by preference or indifference) three or more 
alternatives. Assume that the social-choice function has two properties: 

(i) If all individuals prefer alternative a to alternative b, then society shall prefer a to b. 

(1) The social choice between any two alternatives a and b shall depend only on the individual 
values between a and b (and should not depend on any third alternative c). 


Then Arrow’s theorem says there exists a dictator—a single individual whose preferences 
become social choices. 


234 JOEL FRANKLIN [April 


In a minute I'll write this theorem symbolically, in terms of matrices. But first I want to explain 
the two assumptions. The first is a principle of unanimity: If everyone prefers vanilla to chocolate, 
so should society. The second is a principle of relevance: Society’s choice between vanilla and 
chocolate should depend on how people feel about vanilla and chocolate, not on how they feel 
about strawberry. 

If you wish, you can write Arrow’s theorem in terms of matrices. Let a;; = 1 if i is preferred to 
Jj; let a;; = —1 if is preferred to i; let a;,; = 0 if neither is preferred to the other. If there are m 
alternatives (flavors), then the numbers a;, constitute an m X m skew-symmetric matrix, A. In a 
rational preference ordering, if i is preferred to j, and if 7 is preferred to k, then i must be 
preferred to k. For the matrix A this says: If a;; = 1 and a,, = 1, then a;, = 1. We shall also 
require a,;, = 1 if a;;= 0 anda,, = 1 or ifa;; = 1 and a, = 0. If this is so, then we'll call A a 
rational preference matrix. 


EXAMPLE. Suppose we prefer flavor 3 to flavor 1 and flavor 2, which we like equally. Then this 
is our rational preference matrix: 
0 0 -!1 
0 0 -!1 


1 | 0 


EXAMPLE. Suppose we prefer flavor 1 to flavor 2, flavor 2 to flavor 3, and flavor 3 to flavor 1. 
That is irrational, and so the preference matrix is irrational: 


0 1 —-] 
A=j|-1 0 ] |. 
1 -!] 0 


Look: a, = 1 and a,, = 1, but a, # 1. 

Individual values and social choice: Suppose there are n individuals and m alternatives. The 
individual values are expressed by 7 rational preference matrices A(1),..., A(m). A social choice is 
a rational preference matrix A. We’re looking for a function F mapping P’ into P,,, where P,,, is 
the set of m X m rational preference matrices and P” is the n-fold Cartesian product: 


A = F(A(1),..., A(n)). 


A= 


EXAMPLE. For majority rule, the function F is defined as follows: 
a; = sign| a;;(1) foeee tf a;;(n)| (i,j =1,...,m). 


If m > 2, majority rule may give irrational social choices, as we saw in the example of vanilla, 
chocolate, and strawberry. So this F takes values outside P,; but this F does satisfy the 
assumption of unanimity and relevance: 


(1) a;; = lifa,(k) = 1Vk=1,...,7 
(2) a,;; = a function of a;;(1),..., a;;(). 


Arrow’s theorem now takes this form: Let F be a function mapping P, into P_,. Suppose m > 2, 
and suppose the function F satisfies equations (1) and (2). Then there exists an integer d such that 
a;; = 1 if a;;(d) = 1. (The integer d depends on F but not on the matrices A(1),..., A().) 

By the way, there are no restrictions on the number of individuals, n. In marriage, n = 2. Then 
Arrow’s theorem says: Either the husband or the wife must be a dictator, or there must be 
irrational choices. Experience seems to bear this out. 

Arrow’s theorem talks about rational (transitive) preference orderings. This raises a question in 
combinatoric analysis: How many rational preferences orderings of m alternatives are there? The 
answer has appeared in [12]. For large m the number of rational preference orderings behaves like 


(1/2)m!(log2)7™"~!. 
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The mathematics of Arrow’s theorem is very different from mathematics like linear program- 
ming. Here we have a rather ordinary looking problem: 

For i = 1,..., m and j = 1,..., m we are given the real numbers a 
numbers x; > 0 such that 


ij» D;, ¢;. We wish to find 


n 
oy cx, = minimum 
over all solutions of the linear equations 
n 
» 4,;x;= 6, (i= 1,...,m). 
j=] 


That is the canonical form of linear programming. In terms of matrices and vectors, it looks 

like this: 

Ax=b, x>0,c’x = min. 
The problem is interesting only if the linear system Ax = b has more than one solution, so we 
usually suppose rank A = m <n. Then the crucial assumption is the sign constraint x > 0 (all 
components of x must be nonnegative). 

Kantorovich in Russia and Dantzig in the United States independently developed linear 
programming to solve economic logistical problems. The history of their work appears in 
Dantzig’s book [6]. 

The most famous early problem of linear programming, the diet problem, first appeared in the 
Journal of Farm Economics [33]. The problem is to design a nutritionally adequate diet at 
minimum cost. The author, George Stigler, won the 1982 Nobel Prize in Economics. 

Suppose a;, is the amount of nutrient i in one unit of food j. (For instance, a3, might be the 
amount of vitamin B, in one gram of wheat bread.) Let b, be the minimum daily requirement of 
nutrient 7, and let c; be the cost of one unit of food j. Let x; be the amount of food j in a daily 
diet. Then we require 

Y 4,,x,;>b,(i=1,...,m), x,>0 Dc x, = minimum. 
j=l j=l 

This is a linear program in standard form. To put it in canonical form, we must replace the m 
linear inequalities by equations. We do that by introducing m new unknowns z, > 0: 


n 
j=l 


The problem is now easy to solve by Dantzig’s simplex method. 

Linear programming has many uses in industry and banking. In 1981, a good popular article 
[2] appeared in Scientific American; I recommend its example on beer. An introduction to the use 
of linear programming for the optimization of bank investment portfolios appeared in the Monthly 
Review of the Federal Reserve Bank of Richmond (see [3] and [10], p. 3). Banks and oil companies 
make a lot of money with linear programming. 

But you and I are mathematicians; money means nothing to us. So let us speak of something 
more important—let’s talk about Chebyshev approximation. 

Suppose we are given a system of real linear equations, Ax = b, and suppose the system has no 
solution x. Typically, this occurs when we have more equations than unknowns. If we have m 
equations in n unknowns, the error in equation i is a function of the vector x: 


n 
e; = d a; ;,x; — b; (i=1,...,m). 
j=l 
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The problem of Chebyshev approximation is to find a vector x that minimizes the maximum 
absolute error: 


Minimize ( max|e;). 
x i 


That is a beautiful and important problem of approximation theory. Many things were known 
about Chebyshev approximation before 1959, but no one knew a good way to do it. Then Edward 
Stiefel discovered how to do it by linear programming (see [32] and [10], p. 8). Here’s how: 

Define a new unknown: x, = maxle,| for i = 1,..., m. Then we shall have the uniform error 
bracket 


n 
j=] 


The problem of Chebyshev is to choose xo,..., x, $0 aS to minimize the maximum absolute error: 
Minimize xo. 

That’s all there is to it—a finite number of linear inequalities in a finite number of unknowns, 
with a linear form to be minimized. That is a linear program in general form. It’s trivial to restate 
it in canonical form, and it’s routine to solve it numerically by the simplex method. 

The simplex method is perhaps the most important numerical method invented in the twentieth 
century. Experience with enormous industrial problems shows that the simplex method works fast. 
In problems with m equations in n unknowns, the computation time seems to be proportional 
to n. 

Why does the simplex method usually work so fast? No one knows, and this is one of the great 
unsolved problems of numerical analysis. At first glance, the computation time would seem to be 


proportional to the binomial coefficient (fr , which is the possible number of basic solutions 
of Ax = b. For m ~ n/2, the binomial coefficient is almost as big as 2”, and this suggests the 
computing time could grow exponentially with n. Indeed, Victor Klee and George Minty [18] have 
constructed pathological cases for which that happens. But it never seems to happen in practice. 

A Russian mathematician named Khachian got around this problem by analyzing a quite 
different algorithm [16]. Khachian proved that his algorithm has computing time bounded by a 
constant, K, times n®°—which becomes smaller than 2”. Khachian’s proof is a triumph of 
theoretical computer science. But Khachian’s algorithm, in its present form, has little practical 
value: the constant K is enormous and so is the computing time. 

You can become famous by doing one of these two things: (1) show why the simplex method 
usually works as well as it does; (2) show how Khachian’s method can be made to work better 
than the simplex method in practice. [A persistent rumor says Stephen Smale has done (1).] 

Linear programming is important because it is the general mathematics of finite systems of 
linear inequalities. Linear programming is more general than real linear algebra, for this reason: 
Any real linear equation Ua,x; = b can be restated as a pair of linear inequalities: 


Yia;x,;<b and ia,x,>b. 


But the converse is false: You can’t restate a linear inequality as a finite number of linear 
equations. 

No mathematician doubts the importance of linear algebra. So linear programming must also 
be important, and perhaps you will agree that linear programming should be part of the basic 
undergraduate mathematics curriculum. Why should mathematics students have to pick up their 
linear programming from economists and chemical engineers and people like that? They should 
learn it from us, and they should learn it right. 

Marshall Hall has a section on linear programming in his book Combinatoric Analysis. There’s 
nothing odd about that; linear programming has many applications to combinatorics. For 
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instance, look at this problem: 
We are given an n Xn matrix of real numbers a,,. We seek a permutation jj,...,j, that 
maximizes the sum 
5 = ai, + a4 j, +eee + ani, 


This problem is called the optimal-assignment problem. 


EXAMPLE. Suppose we’re given the matrix 


7 2 6 
3 9 If. 
8 4 5 


The sum s has six possible values. The largest is 
max Ss = 4); + dy, + a3, =6+9+ 8 = 23, 
achieved for the permutation (j,, jo, j3) = (3,2, 1). 
In general, we could solve the problem by calculating all the n! possible values for s, but that 
takes too long if n is large. A much faster algorithm is given by linear programming. 
We define the unknowns x,,; as 1 if j = j;, or 0 if j # j;. Thus, x;; will tell us which component 
to pick from each row. For the preceding numerical example, we would have 


0 0 1 
(x, )=|0 1 Of. 
1 0 0 
In general, the integer unknowns x;; must satisfy the constraints 
x, tcc tx, =1 (i= 1,...,2) 
jtortxyy=1 (G=1,...,2) 


x, 290 (i,j=1,...,n). 


Xi 


Then we wish to maximize a linear form: 


s= 4 j%i; = maximum. 
i,j 


This is a problem in linear programming. H. W. Kuhn [19] has shown that it can be solved in 
O(n?) steps. 

You are right if you object that linear programming provides the optimal rea/ solution x; ;, and 
these numbers might not be integers (we need all x;,; = 0 or 1). But for the optimal-assignment 
problem the optimal solution over the integers x; ; is also optimal over the real numbers x; ;. That’s 
not obvious, but it’s easy to prove. In general, however, linear programming over the integers is 
difficult. The optimal solution over integers is usually not optimal over real numbers. 

So much for combinatorics. Now let’s look at geometry. I’d like to show you how quadratic 
programming solves a problem stated in 1857 by J. J. Sylvester [34]: “It is required to find the 
least circle which shall contain a given set of points in the plane.” 

Suppose the given points are a,,..., a,. We’re looking for a circle with the unknown center x 
and radius p. The given points are required to lie inside the circle: 


lla; — xl? <p? (i=1,...,m). 


Then we want to choose x and p so as to minimize p. 
We can replace the m quadratic inequalities by /inear inequalities as follows. Introduce the 
unknown 


] 
Xo = 5 (0? — Ihxll?). 
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Then the m inequalities become 
Xj ta,-x>b, (i=1,...,m), 
where b, = 4Ila,||*. Then we want to minimize p’: 
2X + |x|? = minimum. 
Sylvester’s problem now has this form: First we require m linear inequalities: 
Xo + yx, + Gx, >b, (i= 1,...,m). 
Then we want 
2x9 + x? + x3 = minimum, 
in which the quadratic terms constitute a positive definite form. This is a routine problem of 
quadratic programming. It can be solved numerically by an ingenious variant of the simplex 
method. This algorithm was discovered by a mathematician, Philip Wolfe, but it was published in 
an economics journal, Econometrica [36]. 

Why in an economics journal? Because Wolfe’s paper extended the work of some economists 
who were interested in the use of quadratic programming to make optimal investment decisions. 
Wolfe’s mathematical discovery solved a problem in economics. 

The theoretical basis of linear and nonlinear programming was published in 1902 by a 
mathematician named Julius Farkas. He gave a long, cumbersome proof of the following 


proposition, which you might call the alternative of linear inequalities (generalizing the Fredholm 
alternative of linear equations): 


THE FARKAS THEOREM. Let A be a given m X n real matrix, and let b be a given vector with m 
real components. Then one, and only one, of the following alternatives is true: 


(i) the system Ax = b has a solution x > 0 (all components > 0); 
(ii) the system of inequalities y'A > 0 has a solution y satisfying y’b < 0. 


Indeed, both alternatives can’t be true, for then we could deduce 
0 < (y7A)x = y™(Ax) = y"b < 0. 


That’s easy; the hard part is to show that one of the alternatives must be true. A modern 
straightforward proof of the Farkas theorem relies on the separating-plane theorem for convex 
sets (see, e.g., [10], p. 56). 

The Farkas alternative has many uses outside mathematical economics. I hope to convince you 
that every mathematician should know the Farkas theorem and should know how to use it. For 
example, let me show how to use the Farkas theorem to prove the fundamental theorem of finite 
Markov processes. 


THEOREM (Markov). Suppose p;; > 0, and suppose 
n 
LY pi =! (j= 1,..., 7). 
i= 


Then there exist numbers x, > 0 satisfying 
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The proof of a special case of this theorem occupies several pages in Feller’s book on 
probability ((8], pp. 428-432). The general case is usually proved by using the Perron-Frobenius 
maximum principle for positive matrices or by using the Brouwer fixed-point theorem. Instead, we 
can give an elementary proof using the Farkas theorem ((10], p. 58): 

First, we state Markov’s assertion as one Farkas alternative: 

(i) There exists a vector x > 0 satisfying the m + 1 linear equations 


Y (pi -— 6x, =9 G=1,..., 7) 
j=l 


where 6;; is the Kronecker delta. 
Second, we state the other Farkas alternative: 
(ii) There exist numbers y,,..., y,, y+, Satisfying the inequalities 


DY yi( pi; — 6;;) Tt Yn+1 = O (j= l,..., n) 


i=! 
Mn+ < 0. 
Alternative (11) implies the strict inequalities 


n 
Dy Pi > y; for all j. 


i=] 


But 


n 
max y; 2 » ViPij 


i=] 


because we assumed p;; > 0 and 2; p;; = 1, so we find 
max y; > y, for allj. 


That is impossible, so alternative (11) is false. 

Now Farkas tells us that alternative (i) is true: Markov’s theorem is proved. That was easy, 
wasn’t it? 

Now let me tell you about the theory of games and economic behavior. A book with that title 
was published in 1944 by the mathematician John von Neumann and the economist Oskar 
Morgenstern [25]. Economists consider this book an epoch-making contribution to economics. 

Fine, you say, but what has it done for mathematics? 

This book, along with von Neumann’s earlier work [24] on game theory, has given us some 
stimulating problems and some important results. For example, look at this theorem on matrices: 


THEOREM (VON NEUMANN). Let A be a real m X n matrix. Let vectors x and y range over the 
sets 


m n 


x, = 1, x, = 0; Yy=1, y, > 0. 
i=] j=1 


Then 
min max x’Ay = max minx/Ay. 
yx xy 


This theorem is no platitude. As a rule, mixed extrema are not equal, as the following example 
shows. Suppose x and y range over the setsO < x < 1,0 < y < 1. Then 
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, 2 | 
min max(x — y)° = re 
y x 
but 
max min(x — y)” = 0. 
x oy 


Von Neumann’s minimax theorem is the fundamental result in the theory of zero-sum 
two-person games. But that’s not the point; the point is, it’s good mathematics. Von Neumann 
proved the minimax theorem by using the Brouwer fixed-point theorem. His proof is nonelemen- 
tary and nonconstructive. Later, the mathematician George Dantzig gave an elementary, construc- 
tive proof by using the dual simplex method of linear programming. 

Following von Neumann, mathematical economists make much use of the fixed-point theo- 
rems. Their favorite seems to be the fixed-point theorem of Kakutani [15]. 

As a young mathematician at the Institute of Advanced Study, Shizuo Kakutani discovered a 
generalization of the Brouwer fixed-point theorem. Kakutani’s work was motivated by problems 
in economic game theory. His theorem has great mathematical novelty. It speaks of point-to-set 
mappings: 


THEOREM (Kakutani). Let X be a closed, bounded, convex set in R”. For every point x in X, let 
F(x) equal a nonempty convex subset of X. Assume that the graph 


{x,y:y © F(x)} is closed. 
Then some point in X satisfies x* © F(x*). 


The image of each point x is a convex set F(x) C X. The theorem says some point x* lies in its 
image F(x*). Figure 3 illustrates this. Kakutani’s theorem is novel because it talks about 
set-valued functions. 


FIG. 3. 


If every set F(x) contains just one point, the closed-graph assumption is equivalent to the 
continuity of the function F(x), and then Kakutani’s theorem reduces to the Brouwer fixed-point 
theorem. Kakutani proved his theorem by using the Brouwer theorem. 

A private survey indicates that 96% of all mathematicians can state the Brouwer fixed-point 
theorem, but only 5% can prove it. Among mathematical economists, 95% can state it, but only 2% 
can prove it (and these are all ex-topologists). This dangerous situation will soon be remedied. 
Within the last two years, John Milnor [22] and C. A. Rogers [27] have produced elementary 
proofs, using nothing more advanced than calculus. These proofs are so easy that J can 
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understand them [10], and certainly you can. 

While 96% of mathematicians can state the Brouwer fixed-point theorem, only 7% can state the 
Kakutani theorem. This situation is also dangerous or, at least, wasteful. The Kakutani theorem 
has many potential applications outside economics; these applications should be made. Now that 
we can all understand the Brouwer theorem, we can also understand the Kakutani theorem, so 
nothing can stop us. 

In the application of Kakutani’s theorem to many-person game theory, the point x denotes a 
collection of mixed strategies and the set-valued function denotes the sets of optimal mixed 
strategies. The inclusion x © F(x) characterizes an equilibrium solution of the game. The 
Kakutani theorem is thus the perfect tool for proving J. F. Nash’s fundamental theorem [23] on 
n-person games. 

Professor H. F. Bohnenblust once told me something about research. He had supervised many 
successful Ph.D. thesis proyects—and a few unsuccessful ones. He said this: The unsuccessful 
projects start with some famous old problem (prove the Riemann hypothesis) and then look for a 
method to solve it. The successful projects start with some new method and then look for a problem. 

Let’s take Bohnenblust’s advice. Let’s start with linear programming and look for a problem. 
Here’s a good one: the problem of moments in probability theory. 

Suppose we are given a collection of real-valued continuous functions a,(t) for t € R’. We are 
given a closed set 2 C R”, and we’re given a collection of real numbers b;. The problem is to find 
a probability distribution function x(t) satisfying the moment equations 


f a,(t) dx(t) = b, for alli 
Q 
where we require dx(t) > 0 and 
f dx(t) = 1. 
Q 


This problem has many applications in geophysics and in other sciences. It has an extensive 
mathematical theory (see, for instance, Shohat and Tamarkin [30]). So what is left for you and me 
to do here? Well, for one thing, we could devise a good numerical method. At least, that will 
please our colleagues in geophysics. 

Suppose we’re given a finite number of moments, which is the usual case in applications. And 
suppose we uSe Some numerical scheme to approximate the integrals by finite sums. Then we get a 
finite set of linear equations in a finite set of unknowns: 


n 


» 4;x,= 5, (i=1,...,m). 


j=! 


Now we’re looking for the numbers x,,..., x,; they will constitute a finite set of probabilities, 
satisfying 


So we want to solve m+ 1 linear equations in n unknowns x; > 0. Ah! We recognize a 
problem in linear programming. For this we have an existence theorem, the Farkas theorem, and a 
numerical method, the simplex method. 

The simplex method will tell us if no solution exists, or it will compute a solution x if solutions 
do exist. For n > m+ 1 we can’t expect the solution x to be unique. We are free to impose any 
minimum condition of the form 


n 
y c,x; = minimum. 
_ 
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We note that the original problem with a finite number of moments usually doesn’t have a unique 
solution x(t), so the freedom to impose an extra condition is physically natural and mathemati- 
cally necessary. 

Fine, you say. All right for some people but not for you. You are a pure mathematician, and 
numerical methods bore you. What you'd like is a little solid theory—something you can get your 
teeth into. 

OK, I’m with you. Let’s prove a great theorem together. Let’s give a new, elementary proof of a 
famous theorem of F. Hausdorff [14]. The proof will use a method of mathematical economics, the 
Farkas theorem. 

Hausdorff studied the moment problem 


(3) [th dx (1) = by (k =0,1,...). 


He asked this question: Which infinite sequences {b,)} are the moments of a probability distribution 
x(t) on the interval 0 < t < 1? He called those sequences moment sequences. 
Certainly b) = 1, since we require {f dx(t) = 1. Also, we must have 


[ f(t) dx(t) > 0 
0 
for all continuous functions f(t) > 0. Setting f(t) = t/(1 — t)*, we get the necessary condition 
1 y . 
f XC) (Ke axle) > 0, 
0 »=0 
which says this about the moments: 
k 
X (=) (F)be 20 (Uk > 0) 


A sequence {b;} with this property is called completely monotone. If we define the difference 
operator A by Ab; = b,;,, — b;, the last formula says 


(-)“A*b,>0 (j,k > 0). 


Hausdorff’s theorem says: If by = 1, the sequence bo, b,, b2,... is a moment sequence if and only 
if it is completely monotone. 

We've already proved the only if part. To prove the if part, let’s assume the sequence (b,)} is 
completely monotone, with b) = 1. Now we must find a p.d.f. (probability distribution function) 
x(t) satisfying the moment equations (3). 

Suppose we can Solve the system of moment equations 


(i) [ot de, (1) = by (k =0,..., 7) 


for each finite n. Then the p.d.f.’s x,(¢) have a subsequence that converges to a p.d.f. x(t) at all 
points of continuity of the limit x(t). Then x(t) satisfies a/l/ the moment equations (3), and we’re 
done. 

So the required p.d.f. x(t) exists unless some finite system (i) is unsolvable. But the system (i) is 
a finite linear system for an unknown dx, (t) > 0. A simple extension of the Farkas theorem says 
this: The system (i) is unsolvable for a p.d.f. x,(t) if and only if there exist numbers yo,..., y, 


satisfying 


(ii) 


= = 
Ms iMs 
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We must show that this is impossible. 
Suppose (ii) is true. Define the polynomial f(t) = & y,¢t*. Then Taylor’s theorem says 


Ve = f (0)/k! 
As a limit of difference quotients, this equals 


yg = lim ME £(0)/( ek), 


where A, f(t) = f(t + e) — f(t). Setting e = 1/N, we deduce 
y= jim (Varro), 


The second part of (ii) says L y, b, < 0, and so for large N we must have 


y ()(a'4(0)) by <0. 


k=0 


The upper limit, n, may be replaced by a larger integer, N, since an nth degree polynomial f(t) 
satisfies AX f(t) = 0 for k > n. Now we rearrange the last sum to obtain the inequality 


EA a)(7) -(-)" YA", < 0. 


But (ii) says f > 0, and the completely monotone sequence {b,} satisfies (—)*A*b, > 0, so all terms 
in the last sum are nonnegative, and we have a contradiction. The Farkas alternative (ii) is 
impossible. 

Therefore, the alternative (i) is true: every finite system of moment equations (1) is solvable. It 
follows that the infinite system (3) is solvable, and so we have proved Hausdorff’s theorem. 

This theorem is important in probability theory. As William Feller said, “Its discovery has 
been justly celebrated as a deep and powerful result.” (See [9], p. 226.) 

As you've just seen, the mathematical methods of economics have striking applications to the 
rest of mathematics. As you might have feared, I could go on talking to you forever. I could tell 
you about applications to ill-posed boundary-value problems of partial differential equations. But 
I manfully refrain; you have already heard enough. By now, I hope you will agree with me: these 
problems and methods of economics are valuable, and they are fascinating. 
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MISCELLANEA 


99. Bolzano-Weierstrass for ornithologists. 


We came... to a heavy thicket of bramble and stones, rising like a dinosaur out of which was 
an extraordinary contrivance of wire fences, wood, and ropes, of doors and pulleys moaning in the 
breeze. This, Miss Whittaker said cheerfully, was a Heligoland bird trap, the most serviceable kind 
there is. I stared astonishedly at the Heligoland bird trap, estimating it one hundred feet long, 
thirty feet wide, and eight feet high—roughly one million times as large as the average bird. 
Presently, Miss Whittaker saw my dismay, and observed that a Heligoland bird trap... is 
superior to any [other device] because of the sheer intricacy of its mechanism, which only the most 
erudite of birds could hope to grasp. Its proper utilization begins, she continued, when she 
thrashes about in the brambles and frightens a bird into the trap, shutting a screen door behind it. 
The bird flies about in consternation, and more and more screen doors are closed by Miss 
Whittaker, who—did I say?—is also inside the Heligoland bird trap; the woebegone bird finds 
itself in smaller and.smaller quarters, and finally, when the last door closes, in a small, accessible, 


wooden box. 
—John Sack, Report from Practically Nowhere, 
Harper, New York, 1959, p. 23. Suggested 
by H. P. Boas. 
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MISCELLANEA 


99. Bolzano-Weierstrass for ornithologists. 


We came... to a heavy thicket of bramble and stones, rising like a dinosaur out of which was 
an extraordinary contrivance of wire fences, wood, and ropes, of doors and pulleys moaning in the 
breeze. This, Miss Whittaker said cheerfully, was a Heligoland bird trap, the most serviceable kind 
there is. I stared astonishedly at the Heligoland bird trap, estimating it one hundred feet long, 
thirty feet wide, and eight feet high—roughly one million times as large as the average bird. 
Presently, Miss Whittaker saw my dismay, and observed that a Heligoland bird trap... is 
superior to any [other device] because of the sheer intricacy of its mechanism, which only the most 
erudite of birds could hope to grasp. Its proper utilization begins, she continued, when she 
thrashes about in the brambles and frightens a bird into the trap, shutting a screen door behind it. 
The bird flies about in consternation, and more and more screen doors are closed by Miss 
Whittaker, who—did I say?—is also inside the Heligoland bird trap; the woebegone bird finds 
itself in smaller and.smaller quarters, and finally, when the last door closes, in a small, accessible, 
wooden box. 

—John Sack, Report from Practically Nowhere, 


Harper, New York, 1959, p. 23. Suggested 
by H. P. Boas. 


To decide this problem, see p. 266. 
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The numbers in parentheses are the rankings by the Associated Press and the United Press 
International opinion polls. The first three columns present the win-loss-tie record. The ranking 
itself is presented in the next column. The numbers represent the ranking on a logarithmic scale 
base 2 (the scale is calibrated so as the team ranked number 60 has ranking value 0). The p’s and 
x’s can be computed from this scale. For instance 


X19 = 98.833— 8.784 _ 70.049 _ 1 934 


and 


For each team in the table we also give four selected “odds” x;;. For the nth team we list the 
numbers 


Xn.12%n,209 *n, 50 and Xn, 100 


comparing the team with the teams ranked number 1, 20, 50 and 100. 

Two technical remarks: 

1. All the 177 teams are comparable. One team had a perfect losing record, all other teams 
form one equivalence class. 

2. Some teams played against opponents from Divisions II or III. For the purpose of this 
ranking, these lower division opponents are identified and treated as one single team. This 
approximation introduces a slight error into the ranking of the bottom teams; it has, however, no 
discernible effect on the high ranked teams. [This hypothetical “average Division II/III opponent” 
is ranked number 155 (with ranking value — 3.685). The highest ranked team that played a lower 
division opponent is number 46, and we have p(46, 155) = .985, x(46, 155) = 67.] 


Appendix 2 The Split 1981 Baseball Season 


Due to a Strike in the middle of the season it was decided that the 1981 season would be 
divided into two halves: the games played before the strike and the games played after the strike. 
The teams with the best winning percentage in each division were declared first-division winners. 
At the time of this writing, the second half is still being played. 

In the table on p. 265 we give the ranking of both leagues, based on their results in the first half 
of the 1981 season. In each league, we list the Eastern division teams first, in decreasing rank, 
followed by the Western division teams. The numbers displayed next to each team represent 

(1) the schedule, i.e., the number of games played against other teams in the league; 

(2) the won-lost record; 

(3) the ranking; the numbers p;; rounded off to three decimal places; 

(4) the expected score: this is calculated from the p,,;, and is based on the complete schedules 

of both leagues as displayed in (2.3). 
In three cases the highest-ranked team in its division is the declared winner. Note, however, that in 
American League West, the declared winner (the team with the highest win-loss ratio) is ranked 
second by our method. 
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Kurt Godel. 


THE RANKING OF INCOMPLETE TOURNAMENTS: A 
MATHEMATICIAN’S GUIDE TO POPULAR SPORTS 


THOMAS JECH 
Department of Mathematics, Pennsylvania State University, University Park, PA 16802 


1. Introduction. Each winter millions of Americans engage in passionate discussions about 
which collegiate football team should be the national champion. A handful is even privileged 
enough to have their opinions tabulated by two opinion polls. These polls then decide who the 
champion is. 

It has been a long tradition in sports that champions are determined, whenever possible, by 
objective methods, preferably measuring performance numerically, in terms of seconds, meters, 
number of wins and losses, etc. In team sports, for example, the standard system is a round-robin 
tournament, where each team plays against every other team (possibly more than once but the 
same number of times), and counting a win as one point and a tie as a half point, the team with 
the largest number of points is the winner. 

So it is surprising that in a popular sport like collegiate football, the champion is determined by 
such arbitrary methods. And the obvious question is: Is there an objective method that ranks the 
teams according to their records even when their playing schedules are not identical? 

It is the purpose of this article to show that this problem can be formulated mathematically 
and to give a solution of this problem. We show that (subject to a very natural condition) there is 
one and only one correct way of comparing the records of teams in an incomplete tournament. 
The condition under which our method is applicable and which will be made precise in Section 2 
can be expressed roughly as: “there are enough results available to have a basis for comparison.” 

The method itself is not limited to applications in sports. As another example, consider a 
situation when a researcher considers a certain large number of objects and wants to rank them 
according to some of their properties but can only make a limited number of comparisons. The 
test results may even be inconsistent; for instance suggesting that A is better than B, B is better 
than C and C is better than A. 

Moreover, a result of comparing object A with object B may not be conclusive but only give 
probability p, 0 < p < 1, that A is better (bigger, brighter, more desirable, more marketable) than 
B. So our researcher has obtained these results 7;; (0 < 7;; < 1,74; = 1 — 7,;) for some i’s and j’s 
and the question is: Do these results provide a ranking of the given items, and in particular, can 
we tell which one is best (biggest, brightest, most desirable, most marketable) or at least which is 
most likely to be such? It should be clear that this is, mathematically, the same problem as the 
problem of ranking college football teams. So our method can also be applied in such situations. 

In the appendix I give some applications of my method. One example I use is the 1981 major 
league baseball season, in which (due to a strike) the teams played an unequal number of games 
against differing opponents. 


2. Formulation of the Problem and Statement of the Result. 


Tournaments. Although the ranking method can in general be applied in other situations, such 


I was born in Prague in 1944, Mathematics has fascinated me since a very early age, and although I have had 
many Other interests over the years, from chess to rock climbing to marathon running, mathematics has always 
remained my main interest. While in high school I competed in Mathematical Olympiads. At Charles University I 
became interested in set theory, mostly under the influence of Petr Vopenka, and in 1966 I earned a Doctorate for 
work using the then new method of forcing invented by Paul Cohen. I fled Czechoslovakia after the Soviet 
occupation in 1968 and spent the next six years wandering about British and U.S. universities. Since 1974 I have 
been a Professor of Mathematics at The Pennsylvania State University. T. J. 

I wish to express my gratitude to my wife and son for their invaluable advice on the use of the computer and on 
baseball and football, of which I am rather ignorant. 
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as I touched upon in Section 1, I prefer to state it in terms of comparing teams in a competition. 
Thus let us consider n teams 


T,,17);..-5 T, 
which are matched in a certain way so that some teams play each other (possibly more than once) 


and some do not. The information which pairs of teams have played is given by the schedule 
matrix 


(2.1) M = (mj;); j21- 


M is a symmetric n by n matrix and each m,; is a nonnegative integer. The number m,,; represents 
the number of times the team 7; is pitted against the team 7;. For each i and j we have 


M;, = Mj; 
and 
Mj; = 0. 
As examples, consider a simple round-robin tournament whose schedule matrix is 
0 1 1 ] 
1 0 1 ] 
(2.2) 1 1.0 
1 1 1] 0 


and the schedule matrices of a complete season of American and National League baseball: 
American League 


(2.3) M,= 13. 13 «130: «13's 13's sd13s—i*—*8 2 12 12 12 #12 «212 «212 


National League 
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The top seven rows of M, (the top six rows of M,,) belong to the eastern teams and the bottom 
seven (Six) rows to the western teams. 

Let T;, and 7; be two teams playing in the tournament. The result of the m;; matches between T; 
and 7; is a real number 7;; such that 


(2.4) O<nj,< mi, 
and 
(2.5) Vig ti = Mj; 


If a team winning a game is awarded one and the loser zero (and each gets 1/2 in case of a tie) 
then 7;; represents the number of points accumulated by the team 7; in the m,; encounters with the 
team 7;. In general, each of the m,; comparisons made between 7; and 7; can result in a number 
0<r<_1 for T;(and 1 — r for T;), and then 7,; is the sum of these m,; numbers. 

A tournament is represented by its result matrix 
(2.6) R= (7,;); 


i, j=l" 


The score of a team T, in the tournament (2.6) is the sum of its results in all its games: 
, . 
j=l 


The score vector is simply a listing of scores of all teams: 
(2.8) s= 


Ranking of a Tournament. Our goal is to find an ordering of the teams in the tournament, that 
is, a relation 


(2.9) T, > T, 


meaning that the team 7; is better (in the tournament) than the team 7;. The relation > should be 
transitive, 1.e., if T; > T, and T, > T,, then T, > T, and we would like the ordering to be total, i.e., 
for any 7; and 7; we wish either 7; > 7; or T; > T, or T; = T;. We do, however, expect the 
possibility of 7; = 7; for distinct i and j, 1.e., the case when the teams 7; and 7; have equal 
ranking. 

The key to successful ranking of the tournament (2.6) is to ask not just how the teams are 
related by the relation >, but to what degree. In other words, we ask not just whether 7, is better 
than 7; or vice versa but how much better: we try to assign a numerical value to the superiority of 
T; over T; (or of 7; over 7;). 

To understand this idea, consider the case when the “tournament” has only two teams 7, and 
T, and they play a large number of games, say m. It is reasonable to expect that in the long run, 
the better team wins more games than it loses, and the greater its superiority, the larger is the 
percentage of won games. If 7; wins r games and 7, wins the other m — r, then we might view the 
number 

se 
Pn? 
as the probability that 7, wins any single game against T>, or the probability that T, 1s better than 
T,. Similarly, 
m—r 
m 


l-p= 
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represents the probability that 7, is better than 7,. To express this differently, the chance that 7, 
beats 7, is 


times greater than that 7, beats 7,. In other words, x represents the odds of T, winning against 
T,. (Note that 7, has an x times better record than 7,.) Note also that if 7, wins all m games, then 
p = 1 and x = o0, so x can be any number between 0 and o and0O <p < l. 
Thus we introduce, as yet unknown, numbers 
Xijo Pij i,j=l,...,n 
with the stipulation that 0 < p;; < 1,0 < x,;; < 00 and such that 
(2.10) x, = 


ji xij and Pi= 1 — Diz 


moreover, the p’s and x’s are related by the equations 


Pj 

(2.11) Xj; = l — Diz’ 
X.: 

—_ —_ J 

Pig 7 + Xij 


(The second equation is obtained by solving the first equation for p; ,.) 

We try to assign these x’s (or p’s) to all pairs of teams T;, 7;. The intended meaning is that if 
x;;>1 (or p;; > 4) then 7; is better than 7; and if x;; = 1 (or p;; = 1), then 7; and T; are equal. 
As the intended interpretation of each x,,; is the relative strength of T, and 7,, the x’s should 
follow the laws of probabilities: for any i, 7 and k, 


(Since 0 - oo is undefined, we shall consider the equation (2.12) valid for all 7, 7, k whenever the 
right hand side is 0 - 00). The equation (2.12) takes the following form for the associated p’s (and 
any 1, j, k): 


(2.13) Pi? Pik’ Pri = Prj ° Py’ Pik: 
Thus we call a matrix 
(2.14) P= (Pis)j. j=1 


a ranking matrix (or simply a ranking) if the p;,; satisfy the equations (2.13). 

Note that the equations (2.12) guarantee transitivity of the relation 7; > Tj: If x,,; > 1 and 
xj, > 1, then x;, > 1, and so if 7; is better than 7; and 7; is better than 7,,, then 7; is better than 
T,. 

Since each p;; is interpreted as the probability that the team 7; beats the team T,, the expected 
(most probable) result of T; in m;; games against T, is 


(*) The following argument justifies the validity of the equation (2.12): Suppose we can compare the objects 7; 
and 7, only indirectly by comparing 7; with 7; and 7; with 7, and we do it a large number of times, say M. In 
M - p;; cases, T; looks better than 7;, and of these M - p,; cases, T; looks better than T,, exactly M - p;; - pj, times 
and worse M - p;,- (1 — p,,) times. Whenever we find 7; better than 7; and 7; better than 7, we conclude that 7; is 
better than 7,, but when 7; is found better than 7; and T; worse than T,, we reserve our judgment about 7; and 7;,. 

A similar situation arises in the M - (1 — p;;) cases when T; is deemed better than 7;. Thus we have M - p;; + Djx 
cases when 7; is declared better than 7, and M - (1 — p;,;) - (1 — pj,) cases when 7; is considered better. It follows 
that 

M * Piy* Pik 
Xi, ° Xipe 


Xj i YS 
“— M-(1-py)> (1 — De) , 4 
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Mj; ° Pij 
and the expected score of the team T; in a tournament with schedule M = (m,,) is 


m 
» Mi, Pip = My Pia + M2 Diz + °°* + Min Pin- 
j=l 


This gives us another condition on the p,;: if these numbers are the correct probabilities obtained 


from the tournament (2.6), then the expected score of each team has to be equal to the actual 
score of the team in that tournament: For each i = l,..., 7, 


m 
(2.15) yu Mii * Pig ~ 3i- 
j=l 


This reasoning leads us to the definition of the central concept of this study: 
A Ranking of the Tournament 


(2.6) R= (7%;) 
iS a matrix 
(2.14) P= (p;;) 
that satisfies, for all i, 7,k = 1,...,n 
(2.13) Pig’ Pik Pri Pry Pri’ Pik 
and 
m 

(2.15) » Mj; ° Dij > 3; 

j=l 


where M = (m,,) is the schedule and s = (s,) is the score (and satisfy (2.5) and (2.7). 

It is my intention to prove that the equations (2.13) and (2.15) have a solution P; i.e., a ranking 
exists. And, if the tournament satisfies a certain natural condition, then P is unique. The ranking 
matrix P then yields a linear ordering of the teams in the tournament, defined by 


T,;>T, if p> (orx;;> 1) 


T,=T, if pj,=4 (orx,,;= 1). 


If T;> T; (7; = T;), we say that the team 7; has higher (equal) ranking than the team T;. 

Note that the ranking P is determined by M (the schedule) and s (the score), and does not 
depend directly on R (the results). This is not surprising: as in a round-robin tournament, it is the 
total score that matters, not how the points are accumulated. 


Comparability. It is clear that the tournament has to satisfy some conditions if we want to get a 
meaningful ranking. For instance if all m,; = 0 (an “empty” tournament) then we have no basis 
for comparison of the teams. Similarly, if the tournament consists of two disjoint groups of teams 
with no games played between teams in different groups (like the American and National Leagues 
in a regular season), then we have no basis for comparison of teams in different groups. Another 
example is when there are two teams with a perfect score (beating all their opponents). Again, the 
results above do not provide a basis for comparison of the two teams. (As this situation is of 
considerable practical significance, one might want to use additional criteria for ranking such 


teams; see the remark on tie breaking in section 4.) 


Definition. Teams T; and T; are comparable (in the tournament R = (7;,)) if there exists a 
sequence 


(2.16) Lys dgyscey Uy 
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such that all the numbers 


(2.17) Vivigs Vinigoe + +> Vig aig 
are positive, and either 

(2.18) ii =i and i, =j 
or 

(2.19) ii=j and i, =i. 


(Example: T, beats T;, T; ties with T,, T; beats 7,. T, and J, are comparable because r, , = 1, 
rs. = 3517,4 = 1) 
If (2.18) is the case, then we use the notation 


T, = T;; 


i j 
if both 7, = T; and T; = T,, then we say that 7; and 7; are similar, and denote this relation 
T. = T. 


i J° 
Note that = is transitive and = is transitive and symmetric. We also let, by definition, 7; = T,, so 
that = is an equivalence relation. The equivalence classes of = are partially ordered by >=. 
As an example, consider a tournament with a result matrix 


In this example, all teams are comparable except the pair B and C:A > B> DandA>C>D. 
The theorems. 
THEOREM |. Every tournament in which all teams are comparable has a unique ranking. 
Precisely: If 

(2.6) R = (7;;) i,j=l,...,n 


is a tournament with schedule M = (m;;,) and score s = (s;) and if each pair 7;, T; is comparable, 
then there exists a unique ranking 


(2.14) P= (D;;) 

which satisfies the equations 

(2.13) Pij * Dik * Pei = Prj* Bji* Pik 
and 


(2.15) » M;;° Diz = 5;- 
j=l 
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There is, in fact, a more general theorem which I will state presently. If the tournament 
contains incomparable teams, we can still attempt to rank the teams that are comparable. 
So let us call a partial ranking of a tournament R = (7;;) a partial matrix 


P= (Dp; i) 
in which the numbers p;; are defined for those and only those pairs T;, T; which are comparable, 
and likewise satisfies the equations (2.13) and (2.15); that is, the equation (2.13) for those p’s that 
are defined. (Note that we don’t have to make qualifications about (2.15) as p;,; is defined when 
m;, = 0.) 
THEOREM 2. Every tournament has a unique partial ranking. 


The third theorem is just a verification of the expected, namely that our method generalizes the 
simple method of comparing the win-loss record of teams with the same schedule such as in a 
round-robin tournament. Let us say that the teams 7; and T; have identical schedules if for every 
k # i,j 
(2.20) Mix = M jx: 


For instance, in the American League (2.3), any two eastern teams have identical schedules (and 
so do any two western teams). 


THEOREM 3. Let T; and T; be two teams with identical schedules in the tournament. 


(1) If 7; has a higher score (i.e, 5; > s;), then T; has higher ranking than T,,. 
(2) T, and T; have equal ranking if and only if they are comparable* and have the same score (i.¢., 
5; = 5;). 


In particular, the ranking of a round-robin tournament produces the same ordering of teams as 
the ordering of teams by their total scores. 

We conclude this section with several remarks. One is that the uniqueness of partial ranking is 
best possible, in the following sense: There is no unique partial ranking that would be defined also 
for incomparable teams. More precisely: It can be shown (see Lemma 9 in Section 3) that for any 
tournament, if 7; and 7; are incomparable, then for any number p such that 0 < p < 1 there exists 
a ranking P = (p,;) of the tournament (satisfying (2.13) and (2.15)) such that p;; = p. 

Our next remark concerns the concept of comparability. Although the definition ((2.16), (2.17)) 
uses the result matrix (7;;), the property itself depends only on the schedule matrix M and the 
score vector s. This claim will be proved in Section 3 (Lemma 5). 

Finally, let me address the practical problem of how to compute the ranking of a given 
tournament. By Theorem 1, if all teams are comparable, then the tournament has a unique 
ranking, i.¢., a unique solution P = (p,;) of the equations (2.13) and (2.15). As shown below, this 
amounts to solving a system (3.2) of nonlinear equations. Our proof in Section 3 shows that the 
system (3.2) has a solution, by invoking the fixed point theorem, but does not indicate how to 
compute a Solution. In practice, an approximate (but sufficiently accurate) solution can be 
obtained by clever successive iterations. For the examples in Section 4, a pocket calculator was a 
sufficient tool. For the baseball example in Appendix 2, I used a microcomputer, but for the 
football example in Appendix 1, I had to turn for help to an IBM 360/270 computer. 


3. Proofs. The proofs in this Section are more technical than the rest of the paper. Those who 
are willing to believe the Theorem without reading the proof may skip Section 3 and resume 
reading in Section 4. 


*Note that having identical schedules and the same score does not make two teams comparable. Consider the 
example when neither team has played any game. 
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Existence of a Ranking. Since Theorem 1 is a special case of Theorem 2, we shall give a proof of 
Theorem 2. Let 1 be fixed and let 


R= (rij), 5° M = (mij) ;;> s = (s;); 
be the result matrix, the schedule matrix and the score vector of a given tournament. We shall first 
prove that a partial ranking exists. 

First we reduce the problem to the case when all teams in the tournament are in the same 
equivalence class given by the similarity relation =. If E is such an equivalence class, then the 
restriction 


Ry = (Hij)i jez 
of the result matrix R describes a subtournament with result matrix R,. And we have 


LEMMA 1. If for each equivalence class E of = , the subtournament R , has a ranking, then R has 
a partial ranking. 


Proof. For each equivalence class E, let Py = (p;;);, ;ex be a ranking of R;. We define a 
partial matrix P = (p;;) as follows: For each pair i, j such that 7; = 7;, if T; = 7;, then p,; is 
already defined; if 7; # 7;, then we let p,; = 1 (and p;; = 0). Since = is transitive, it is easy to see 
that the equations (2.13) remain valid for the newly defined p;;. As for the equations (2.15) note 
that when 7;> 7, (ic., T; > 7; and T,#7;) then 7r,,=m,; (by definition of =) and so 
m;;° Pj = ;;- It follows that the partial ranking P satisfies the equations (2.15) as well. 0 


So from now on we assume that any two teams in the tournament are similar: 
(3.1) T, = T, holds for any i, /. 
And we want to find numbers p, ; satisfying (2.13) and (2.15). 


LEMMA 2*. Let R = (1;;) be a tournament that satisfies the condition (3.1) and let M = (m;;) 
and s=(s;) be the corresponding schedule matrix and the score vector. There exist numbers 
V1>--+5 v, which satisfy the following system of equations, i = 1,..., n: 

n m ij 
2 ——/__=5. 
(3 ) y 1+ e%"% Si 
j= 
Before proving Lemma 2, let us show that a solution of (3.2) gives a ranking of R. So assume 


that v,,..., v, Satisfy (3.2) and let, for each i and /, 
___! 

Pig Te ee? 
It is clear that the p;; satisfy the equations (2.15), and noting that 
on 
Pig ~ 90 4 @P? 


one easily verifies the equations (2.13) as well. 
So our goal now is to prove Lemma 2. 
Proof of Lemma 2. Let % be the set of all n-tuples v = (v,,..., v,,) Such that 
v, +-:: +0, = 0. 


% is a closed subspace of the n-dimensional Euclidean space R”. We define an operator T on the 
space %. For each v € R”, let F(v) be the left-hand side of (3.2); more precisely, let F(v) = 


*As the numbers x;,; have to satisfy (2.12), it is natural to consider their logarithms instead, thus looking for 


numbers v; satisfying v; — v; = log x,,, and equivalently, p,; = 1/(1 + e°~*). 
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(F,(v),..., &,(v)) where 


n mM... 
F.(v) = —_— P= 1,... ; 
(= Layers Ga ben) 
And for each v € %, we let 
(3.3) Tv=v+s-— F(v). 
Let m denote the number of all games played in the tournament: 
] 
maa dm, 


Note that 


and that for each v € R”, 


Fly) = mij I x e”: e") 
yy i(v) = L Pte 2° m,| erites em + =) 
i=] i, J l,j 
1 
= hmm 


It follows that if v © % and z = Tv, then 17_, z; = 0 and so Tv € %. So T maps the space X 
into itself and clearly is continuous. To solve the equations (3.2) it suffices to find a fixed point of 
T, i.e., some v © X such that 
Tv =v. 
For then s — F(v) = 0 by (3.3) and v is a solution of (3.2). 
To find a fixed point of T in % we shall use the following form of Brouwer’s Fixed Point 
Theorem: 


If C is a nonempty compact convex subset of R" and if f is a continuous function on C such that 
f(v) € C for all v € C, then f has a fixed point in C. 


Thus Lemma 2 will follow from 
LEMMA 3. There is a nonempty compact convex subset C of % such that Tv © C forall v € C. 


Before we prove Lemma 3, we prove a technical lemma about F. Let d be a positive number, 
and let v = (0,,..., v,) € &. A gap in v of length d is an (open) interval of length d such that 
each 0; lies outside the interval. The top of the gap is the set of all indices i such that v, lies above 
the interval, and the bottom of the gap is defined similarly. If i is in the top and / is in the bottom 
then v; — v; > d. We always assume that both the top and the bottom of the gap are nonempty 
sets. 

We recall that 


1 
m= 7 » M;; 
is the total number of games played. Let e denote the least 7,, that is not zero: 
e=min{r,;:i,j=1,...,n and r,,> 0}. 


LEMMA 4. Let v © % and let G be a gap in v of length d > log(m/e). Then 


» (s,- Ry) <0. 


i€top of G 
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Proof. Let T and B denote, respectively, the top and the bottom of the gap G. Let 
k= )'  mj,,; = the number of games between the top teams and the bottom teams 
l 
l= 2, 2 m,; = the number of games between pairs of the top teams. 
JET 


We made the assumption that any two distinct teams 7; and 7; are similar; it follows that there are 


at least one top i and one bottom j such that 7, # 0, and so 7,, = m;; — 1; < mj; — e. Hence we 
have 
(3.4) Vs= bd nyt Lit <1 + (k-.e). 
i€T i,jeT iE€T 
JEB 
On the other hand, if i € T andj € B, then v; — v; < —d, and we have 
Mj; k 
(3.5) LEW= DL mi, Sao t Lee tS , 
ieT i, feT ee ‘bev sep lt em l+e4 
jJEB 
Using the inequality d > log m/e, we get 
—d & E -d — k k > _ 
4 San ke? lt+e Sk—e?’ ra oa2k E 


Now, comparing (3.4) and (3.5), we conclude that 


» (s; _ F,(v)) < 0 


ieT 
We are now ready to complete the existence proof: 
Proof of Lemma 3. Let d be a positive number such that 
d > log ~ and d>m 
Consider the following n — 1 numbers 
a, >a,>-*:: >a,_,>90 
(where 7 is the total number of teams in the tournament): 
Q,-,=2-(n- l)!-d 
Q,-7=2-(n—-2)!-dt+a,_, 


(3.6) ‘ : dan Leen ee eee 
a,=4d+a, 
a,=2d+a, 

Let C be the set of all v € R” such that 

(3.7) vo, t-:++0,=0 

and 


max{v,;:i<n}<a 
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The set C is nonempty, closed, bounded (because of (3.7) and the first inequality in (3.8)), and, 
being the intersection of a hyperplane with a finite number of half-spaces, it is also convex. Hence 
it suffices to show that Tv © C whenever v € C. Let v € C, and let 


Z,; 22.2 eee 2 Z, 


be the enumeration of the coordinates of v in decreasing order, and likewise, let 


zt >it > bat 


be the enumeration of the coordinates of Tv in decreasing order. We already know that Tv 
satisfies (3.7) so we now verify that Tv satisfies (3.8). In the arguments that follow we tacitly use 
the fact that for each i 


(Tv), <0, +5; 
and so 
zetess +z <2, + ¢+++2,+m. 
Also note that by the assumption that v € C, we have z, < a; for alli = 1,...,n — 1 and z, < 0. 
We start with zy. 


We know that z, < a,. If z; > a, + d, then since z, < a, z, is the top of a gap in v of length 
> d, and by Lemma 4 we have zt < z, < a,. If z,; < a, + d, then 


z*<z,+m<z,t+d<(a,t+d)+d=a,. 


Next we deal with zz. If z. > a, + d, then since z; < a3, {Z,, Z2} is the top of a gap in v of 
length > d, and by Lemma 4 we have z* + z} < z, +2, < 2a). If z, < a, +d, then 


ze+z*<z,+72,+m<a,+(a,+d)+d=(a,+2d)+ (a, —-2d) =2ay,. 


Then z} is handled similarly: If z; > a, + d, then {z,, z,, z,) is the top of a gap of length > d 
and we have zf' + a} + z} <z,+ 2, + 2; < 3a;. If z; < a, +d, then 


zet+zi+ 72% <2, +72,+2,;+m<a,t+a,t+(a,+d)+d 
= (a,+ 6d) + (a, + 4d) + (a, — 10d) = 3a. 


We continue in this fashion, dealing successively with z}*,..., z*_,, and verifying that (3.8) is 
satisfied for F(v) when v € C. 

This shows that the continuous function T maps the set C into itself and the proof of Lemma 3 
and consequently the existence part of the proof of Theorem | is complete. 0 


Uniqueness of Ranking. We shall prove that a given tournament has at most one partial 
ranking. As in the proof of existence we shall first reduce the general case to the case when any 
two teams are similar. This requires several lemmas, starting with a result promised in Section 2. 


LEMMA 5. Let R = (1,;) and R* = (15) be two result matrices that have the same schedule 
matrix M = (m,,) and the same score vector s = (s;). Then the comparability relations < and =<* 
defined by R and R* respectively are the same. Thus comparability depends only on M and s. 


Proof. By reasons of symmetry it suffices to show that 7; = 7; implies 7; = T;, for any 1, /. 
This will follow if we show, for all i and /, 


(3.9) if7,<7;, then ri = 
Because if T; >*7;, then by the definition of =* and by (3.9) we find i,,..., 7, such that i, = 1, 
ly =], and 


7,47, %-°°° *T, 
| 2 k 


and (since any two teams that have played each other are comparable) it follows that 
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T, = 7, 2+: &T,. 
| 2 k 


To prove (3.9), we assume by contradiction that 7, is a maximal 7; (in the partial ordering <) 
such that (3.9) fails. Let E be the set of all j such that T; = 7, 

Using the fact (2.7) that s, = L, 7, = L, rt for each j, and ‘that r r,; = m,, = 0 when i and j are 
not comparable, we have 


» » y= » rt 


JEET,<T, or T,>T, JEE 7,<T; or T,>T, 


and consequently 


3.10) LD Vnt+L Lath La=L Lait Let LL wf. 


JEE T,>T, JEEIiECE J€EE T,~<T, JEE T,>T,; JEEIEE JEE T,~<T, 


Now the first term on the left in (3.10) is 0, and by maximality of j, the first term on the right is 
also 0. The second term on both sides of (3.10) is equal to 72, ,<~m,,;, the number of games 
between the teams in E, and so 


(3.11) > Lan=hL dL we. 


JEE T,<T, SEE 7;<T,; 


ij? 


However, each 7;, on the left of (3.11) is equal to m,; (because T; < 7;) and it follows that each 
ri = m,,, and $0 rf = 0 for all j € E and all T, < T, contrary to the assumption that 7* # 0 for 
some 7; < 7;,.0 


LEMMA 6. If P = (p;,) is a partial ranking of a tournament, then 
p,,=9 ifandonlyif 7; < J. 
Proof. For each i and j such that m,, # 0, let 


x , 
jj = Mj; °° Dij- 


Since P satisfies (2.15), it follows that R* = (7;;) is a result matrix with the same M and s. So by 
Lemma 5, the relation <* defined by R* coincides with < 
First assume that T; < 7;. There are i,,..., i, such that i, = j, i, = i and ally, r, ;, are 


Hig? 9 “tg tk 
positive; moreover there is at least one ¢ such that T; < 7;,. Therefore T; | <*7;, and in 


particular, 7; PPT, and sor; ; = 0.Sincem;,  ; = 0, we have p,; __; = 0. Now using the basic 

property of the p’s ((2. 12) or (2. 13)), applied successively to i,,..., i, we conclude that p;; = 0. 
For the converse, assume that 7; * 7; and that p; i] is defined. Then T; = T, and hence T, = *T;. 

There are 7,,..., i, such that i, = i, 1, =j and all r¥,,,..., 77% _;, are positive. Thus all p; ;, pees 


Pi,_ 1, ate nonzero, and it follows (by (2.12) or (2.13) again) that p,;, # 0.0 


LEMMA 7. If P = (p;,) is a partial ranking of a tournament R = (1;;) and if E is an equivalence 
class mod = , then Pr = (P;;)ijer 18 a ranking of the subtournament R g. 


Proof. We know that for each i, 


n 
mpi = Dv Nj 


j=1 j=l 


and we have to show that for eachi € E, 


(3.12) » Mj; Pij = » Vj 
JEE JEE 
However, if 7 € E and m,, # 0, then either 7,;,;= 0 and p,;, = 0 (if T; < T;), or 7,; = m,; and 
= 1 (if T, > T,), and (3.12) follows. 0 
Lemma 7 reduces the problem of uniqueness of ranking to a single equivalence class E. Thus 
the following lemma will complete the proof: 
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LEMMA 8. Assume that all teams are similar in a given tournament. If P = (p,;) and Q = (q;;) 
are rankings of the tournament, then P = Q. 


Proof. For each i and ; let 


x, <2, y= 4 
4 Pi od ji 


Since all 7; and 7; are similar, all the p’s and q’s are nonzero and so x,,; and y,; are positive 
numbers; moreover, they satisfy the equations (2.12). Let 


_ ti 
ij Xj” 

__ Sy 
Pu + 8," 


For all i, 7, and k we have 
(3.13) Bi, = 945° Ox 


J 
and so p = (9;,);, ; 1S a ranking matrix (i.e. it satisfies (2.13)). 

Let us assume that P # Q. Then there exist i and j such that 6,, > 1. Since 7; and T; are 
comparable, there are i,,..., i, such that i, = i, i, =j and the numbers Mj ies Mi, i, are all 
nonzero. It follows from (3. 13) that for some ¢, 6;; , > 1. In other words, there exist i and j such 
that 6,, > 1 and m,; > 0. 

Let iy be such that there exists some j with 6, ios 7 I and m, inj > 0, and that 7;, has the highest 
ranking by p among 7; for all such i. So if 6,,,< 1, then m,, - 0. It follows that whenever 

mij + 0 then Gio; 2 Pip; And because for at least one j, m;, it 0 and qj, ; > Pi,» We get 


(3.14) by m, 


Gi, > ym Miy Pio, 


lo; 


which is a contradiction since both sides of a4) are equal to s;.0 


Proof of Theorem 3. Let T; and T; be two teams with identical schedules. First we assume that T, 
and 7; are comparable. 

If 7; and T; have the same ranking (i.e., pj, = 4), then by (2.13), p;, = pj, for all k (for which 
mi, = mj, # 0) and so 


5; = Mig Pik = Mix Dik = 8; 
k k 


If 7, has higher ranking (p;; > 4) then by (2.13) again, p;, > p;, for all k (and since 7; = T; at 
least one m,;, 1S nonzero), so 


5; = Min Pik > Lim Pje = Sj. 
k 


We complete the proof by showing that if s; > s;, then 7; and 7, are comparable. If they were 
not comparable, then each k with m,, # 0 would satisfy simultaneously T, > T; and T,, > T,, or 
simultaneously 7, < T; and T,, < T;, and so either p,;, = pj, = 0 or pj, = pj, = 1, which would 
contradict s; # 5;. 

Now both parts of Theorem 3 follow. 0 


A final remark. We shall now prove that the uniqueness theorem is best possible: 


LEMMA 9. Let R = (7;;) be a tournament and P = ( p,;) its unique partial ranking. Let T;, and 


T,, be two incomparable teams and let 0 < p < 1. Then there exists a ranking P’ = (p;;) extending 
P such that p;,;, = P- 
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Proof. Let E, and E, be, respectively, the equivalence classes of T;, and 7,, in the relation = . 
First we extend the partial ordering < of all the equivalence classes to a linear ordering <* with 
the stipulation that if p = 0, then we let FE, <*E,; if p = 1, then we let E, > E,; and if 0 < p < 1, 
then we put FE, and E, together and make it one class. 

As long as p;; is defined we let Diy = Pis- When T; <*7; we let p;; = 0 and p;; = 1. We also let 
Pig;, =p and p;,;,= 1—p. Finally, if 7; = 7,, and T; = T,,, we define p;,, using Pi, = Diiy 
Pio, = p and p;,, = Pjj,» SO that the condition (2.13) is satisfied. 

his way we obtain a ranking P’ of R extending P.O 


4. Some Practical Remarks and Examples. In this section I shall give several simple examples 
indicating how the method works, what it takes into account, and how it compares with methods 
used in practice. 


Example 1. A round-robin tournament of three teams. 


A B C Ss 


Each pair has played one game; 4 beat B and tied with C, and B beat C. The standard method 
(comparing the total scores) gives the order A > B > C. In view of Theorem 3 this is the correct 
order as it coincides with the one obtained by ranking. 

To find P = (p;,) in this case we have to find a solution of the equations 


(4.1) Pot Py3= 15, Pat Px = 1, P31 + P32 = 9.5, 
satisfying (2.13). 

The second equation implies that p;, = 1 — po; = P23 and so X42 = X;3 (where x;; = p;;/D;i, 
see (2.11)). So if we let x = x,,, we have x,, =x and x); = X,.° X23, =x’, and so p3, = 
1/(1 + x*) and p3, = 1/(1 + x), and the third equation in (4.1) becomes 


l l l 


+ — 


or 
(4.2) x? —x*-x-3=0. 


The equation (4.2) has the (approximate) solution x = 2.13. This number indicates that the team 
A is 2.13 times better than the team B and B is 2.13 times better than C. The ranking matrix is 


(4.3) B 
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Example 2. An incomplete tournament of four teams 


A B C D S 
A 15 
B ‘2 
C ] 
D 


The team A has only played two games and although its total score is less than B’s score, we 
would expect it to get a higher ranking than B (mainly because we would expect it at least to get a 
tie in its missing game with C). Our method confirms this expectation. 

This case would be difficult to solve algebraically; I have done it with a pocket calculator. The 
ranking matrix is 


(4.4) 


Note that (4.4) does not show the value of p,, as it does not enter into the calculations. 

In practice, when dealing with a large number of teams, exhibiting the ranking matrix is not the 
most convenient presentation of the ranking. There are several other possible ways that I shall now 
discuss (and mention another one in the next example). 

In the case when the tournament is an almost completed round-robin tournament as in this 
example, we may use the p,,’s as “expected results” in the missing game, and obtain the expected 
scores of all teams. In this case, p,, = .793 and we get the following (approximate) expected 
scores: 


A 2.3 
B 2 

C 12 
D_ 0.5 


Another way is to assign the teams certain numbers on some linear scale. For instance, since the 
numbers x;, indicate how much better the ith team is than the jth team, we may assign the last 
team value 1 and then for each i we assign the ith team the unique value a; so that x;, = a;/a, for 
all i and j. In our example we get 


9.78 
6.69 
2.56 
l 


(4.5) 


YQw» 
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(For instance, C is 2.56 times better than D, and A is 9.78 /2.56 = 3.82 times better than C.) 

More practical is to use a logarithmic scale, that is, replace the numbers in (4.5) by their 
logarithms. The following is the ranking of the teams in our example on the logarithmic scale base 
2. The significance of base 2 is that a difference of one point on the ranking scale signifies twice as 
good performance. 


A 3.29 
B 2.74 
C 1.36 
D 0 
(C is 2'°° = 2.56 times better than D). 
Example 3. Another tournament of four teams 
A B Cc D W. L. Pct. 
A 4 2 .667 
B 4 2 .667 


3 3 500 


Here A has played three times against B, winning 2 and losing 1, etc. Both A and B have the same 
number of wins and losses but A should be ranked higher, because it had a more difficult playing 
schedule (three games against C compared with B’s three games against D). This is confirmed by 
the actual ranking: 


The ranking on a logarithmic scale (base 2) is as follows: 


A 3.49 
B 3.12 
C 1.75 
D O 


indicating that A has done 2*49~3-'? = 1.3 times better than B. 

Often, performance in a tournament is measured by the winning percentage. In this case, both 
A and B have the same percentage .667, but since their schedules are different, this is not an 
accurate measure of their performance. This suggests yet another way in which the results of the 
ranking method can be expressed: we call the weighted percentage of a team T, the expected 
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percentage of 7, in a round-robin tournament where all the results of 7; are the expected results 
Pij> 1.€., 


In the present example we get: 
D4 = 3(.565 + .770 + .918) = .751 
Pp = 3(.435 + .721 + .897) = .684 
Po = 3(.230 + .279 + .770) = .426 
Pp = 3(.082 + .103 + .230) = .138 
Example 4. Example of equal ranking 


All four teams are comparable (similar). Teams A and B have the same schedule and the same 
score; similarly C and D. It follows that p,, = .5 = pcp, and for all i, p;4 = Dig, Pic = Pip. Since 


Pac + Pap = 1.5, ete. 
the ranking is 


and sod = B>C=D. 


Comparison with Other Methods. Theorems 1 and 2 should themselves be enough to convince 
the reader that the ranking method presented here is the only mathematically correct way of 
ranking tournaments. So I will only briefly touch upon the subject of comparison of our method 
with the methods used in practice. 

In college football, the ranking is done by opinion polls. In the Appendix, I give as an example 
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the ranking of the 1978 season and compare it with the two (incompatible) rankings obtained by 
the two opinion polls. 

In major league baseball, the division champions are the teams with the greatest percentage. 
This is fine as long as all the teams in the same division have the same schedule (see (2.3)). If this 
method is used for an incomplete season as it happened in the summer of this year (1981), the 
ranking so obtained need not agree with the ranking given by our method (see the example in 
Appendix). 

I shall now give an example showing that ranking by percentage is not at all a reasonable 
method. Consider a tournament of three teams, A, B and C. A plays B six times, winning 4 and 
losing 2, and plays C twice, beating it both times. Common sense dictates that A is the best team 
of the three, no matter how B does against C. (This is confirmed by our ranking method as shown 
below.) However, consider the three following cases: 


Percentage: 
A .150 B .765 
B 737 A .750 arr 
C .067 C000 


In the first case A has a better percentage, in the second case B has a better percentage, and in the 
third case they both have .750. The correct ranking in each case is: 


with A best in all three cases. 


Tiebreaking. In some instances our method does not determine the winner of the given 
tournament. This is when there are two teams with a perfect record, 1.e., winning all their games. 
In such a case, the method does not assign the two teams T;, T; any number p;;, because as proved 
in Section 2, every number p;; between 0 and | satisfies the equations. A similar case is when there 
are two teams which tie their game, and both win all other games. In this case they have equal 
ranking (tie for first place). 

Since these two cases are of considerable practical significance, one might desire to have a 
method that would decide which of the two teams is the champion. One method that suggests 
itself is to declare the winner the team with a more difficult playing schedule. (Another possibility 
would be to take into account the number of points scored in each game. This amounts to 
replacing the result matrix R by a different matrix, in effect, solving a different mathematical 
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problem.) This can be determined either by considering the arithmetical mean of the opponents’ 
ranking on the logarithmic scale, or by comparing the medians of the two teams’ opponents, or 
possibly by some other, equally plausible, procedure. However, since the teams are actually 
incomparable (in the first case) or actually have equal ranking (in the other case), mathematics 
alone can’t decide which tiebreaking method is preferable. 


Appendix 1 The 1978 College Football Season 


In the adjoining table we give the list of the top forty teams in the ranking of the 177 college 
football teams in Divisions I-A and I-AA in the 1978 season. 


The 1978 College Football Season 


Team (AP, UPI) WwW L T log scale xy X29 X59 X100 
1. Alabama (1, 2) 11 l 0 8.833 37:1 290: 1 2550: 1 
2. USC (2, 1) 12 l 0 8.784 1:1.03 * 35 280 2450 
3. Penn State (4, 4) 11 l 0 7.510 2.5 15 116 1000 
4, Oklahoma (3, 3) 1] l 0 7.319 2.9 13 101 890 
5. Michigan (5, 5) 10 2 0 6.259 6.0 6.1 49 430 
6. Notre Dame (7, 6) 9 3 0 5.810 8.1 4.5 36 310 
7. Nebraska (8, 8) 9 3 0 5.804 8.2 4.5 35 310 
8. Clemson (6, 6) 11 l 0 5.505 10 2.6 29 250 
9. Texas (9, 9) 9 3 0 5.010 14 2.6 20 180 
10. Michigan State (12, -) 8 3 0 4.828 16 2.3 18 160 
11. Purdue (13, 13) 9 2 l 4.685 1:18 2.1: 1 16:1 144:] 
12. Arkansas (11, 10) 9 2 l 4.670 18 2 16 142 
13. Houston (10, 11) 9 3 0 4.628 18 2 16 138 
14. Missouri (15, 14) 8 4 0 4.541 20 1.9 15 130 
15. Maryland (20, -) 9 3 0 4.151 26 1.4 11 99 
16. Stanford (17, 16) 8 4 0 4.108 26 1.4 11 96 
17. Georgia (16, 15) 9 2 l 3.922 30 1.2 9.6 85 
18. Texas Tech 7 4 0 3.882 31 1.18 9.3 82 
19. UCLA (14, 12) 8 3 l 3.785 33 1.10 8.7 77 
20. Washington 7 4 0 3.641 37 7.9 70 
21. North Carolina St. (18,19) 9 3 0 3.641 1:37 1:1 7.9:1 70:1 
22. Arizona State (-, 19) 9 3 0 3.530 39 1.08 7.3 65 
23. Texas A & M (19, 18) 8 4 0 3.321 46 1.2 6.3 56 
24. Ohio State 7 4 l 3.134 52 1.4 5.6 49 
25. Iowa State 8 4 0 2.901 61 1.7 4.7 42 
26. Louisiana State 8 4 0 2.646 73 2 4 35 
27. Florida State 8 3 0 2.595 75 2.1 3.8 34 
28. Pittsburgh 8 4 0 2.431 85 2.3 3.4 30 
29. California 6 5 0 2.426 85 2.3 3.4 30 
30. Southern Methodist 4 6 l 2.301 93 2.5 3.1 27 
31. North Texas St. 9 2 0 2.207 1:99 1: 2.7 2.9:1 26:1 
32. Georgia Tech 7 5 0 1.977 116 3.2 2.5 22 
33. Navy (-, 17) 9 3 0 1.775 133 3.6 2.2 19 
34. Mississippi State 6 5 0 1.726 138 3.8 2.1 18 
35. Auburn 6 4 l 1.698 140 3.8 2. 1 18 
36. Florida A & M 12 l 0 1.678 142 3.9 2 18 
37. Arizona 5 6 0 1.445 167 4.6 1.7 15 
38. Colorado 6 5 0 1.400 173 4.7 1.7 15 
39. Tennessee 5 5 l 1.332 181 5 1.6 14 
40. Tulsa 9 2 0 1.214 197 5.4 1.5 13 
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The numbers in parentheses are the rankings by the Associated Press and the United Press 
International opinion polls. The first three columns present the win-loss-tie record. The ranking 
itself is presented in the next column. The numbers represent the ranking on a logarithmic scale 
base 2 (the scale is calibrated so as the team ranked number 60 has ranking value 0). The p’s and 
x’s can be computed from this scale. For instance 


X19 _ 78.833 — 8.784 — 70.049 = 1.034 


and 


For each team in the table we also give four selected “odds” x;;. For the nth team we list the 
numbers 


Xn,19%n,202 Xn, 50 and Xn 100 


comparing the team with the teams ranked number 1, 20, 50 and 100. 

Two technical remarks: 

1. All the 177 teams are comparable. One team had a perfect losing record, all other teams 
form one equivalence class. 

2. Some teams played against opponents from Divisions II or III. For the purpose of this 
ranking, these lower division opponents are identified and treated as one single team. This 
approximation introduces a slight error into the ranking of the bottom teams; it has, however, no 
discernible effect on the high ranked teams. [This hypothetical “average Division II /III opponent” 
is ranked number 155 (with ranking value — 3.685). The highest ranked team that played a lower 
division opponent is number 46, and we have p(46, 155) = .985, x(46, 155) = 67.] 


Appendix 2 The Split 1981 Baseball Season 


Due to a strike in the middle of the season it was decided that the 1981 season would be 
divided into two halves: the games played before the strike and the games played after the strike. 
The teams with the best winning percentage in each division were declared first-division winners. 
At the time of this writing, the second half is still being played. 

In the table on p. 265 we give the ranking of both leagues, based on their results in the first half 
of the 1981 season. In each league, we list the Eastern division teams first, in decreasing rank, 
followed by the Western division teams. The numbers displayed next to each team represent 

(1) the schedule, i.e., the number of games played against other teams in the league; 

(2) the won-lost record; 

(3) the ranking; the numbers p;; rounded off to three decimal places; 

(4) the expected score: this is calculated from the p,;, and is based on the complete schedules 

of both leagues as displayed in (2.3). 
In three cases the highest-ranked team in its division is the declared winner. Note, however, that in 
American League West, the declared winner (the team with the highest win-loss ratio) is ranked 
second by our method. 


ANSWER TO PHOTO ON PAGE 245 


Kurt Godel. 
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Telegraphic Reviews 


Edited by Lynn Arthur Steen, with the assistance of the Mathematics Departments 
of Carleton, Macalester, and St. Olaf Colleges. Books submitted for review 
should be sent to Book Review Editor, American Mathematica] Monthly, St. Olaf 
College, Northfield, Minnesota 55057. 


Telegraphic reviews are designed to give prompt notice of all new books in the mathematical 
sciences. Certain of these books will be selected for more extensive review in the Reviews section 
of the Monthly. 


Special Codes: 


T: Textbook 13-18: Grade Level 
S: Supplementary Reading 1-4 : Time in Semesters 
P: Professional Reading ** : Special Emphasis 
L: Undergraduate Library ?? > Questionable 


General, P, L. The Voices of Time: A Cooperative Survey of Man”’s Views of Time as Expressed by the 
Sciences and by the Humanities, Second Edition. Ed: J.T. Fraser. U of Massachusetts Pr, 1981, 1x + 
710 pp, $12.50 (P). [ISBN: 0-87023-377-8] Though dealing largely with our idea and experience of 
time in a cultural, psychological and philosophical sense, these essays also probe with technical 


accuracy the problems inherent in scientific accounts of time. GHM 


General, P, L. Humor in der Mathematik. Friedrich Wille. Vandenhoeck & Ruprecht, 1982, 120 pp, DM 
19,80 (P). [ISBN: 3-525-40726-2] An amusing collection of mathematical jokes, anecdotes, parodies 
and instructional stories, having fun with and poking fun at higher (and lower) mathematics and 
mathematicians. Includes some traditional and some original material including musical pieces like 
the Hauptsatzkantata (Fundamental Theorem of Calculus Cantata). GHM 


Precalculus, T(13: 1, 2). Concepts of Algebra & Trigonometry. Howard E. Campbell. Prindle, Weber 
& Schmidt, 1982, x + 645 pp. [ISBN: 0-87150-332-8] Standard precalculus topics plus complex numbers, 
elementary linear programming, sequences, series, and counting techniques. Early emphasis on graph- 
ing. Chapter summaries and review exercises. Many "word problems." JRG 


Precalculus, T(13: 1). Concepts of College Algebra. Howard E. Campbell. Prindle, Weber & Schmidt, 
1982, ix + 453 pp. [ISBN: 0-87150-325-5] An attractive and lucid presentation of the standard topics 
of college algebra including a chapter on systems of equations and inequalities, matrices and linear 
programming, and a chapter on sequences, series and probability. Includes optional calculator exam- 
ples and exercises and chapter reviews. JNC 


Education, T(15-17: 2), P. Basic Concepts of Elementary Mathematics, Fourth Edition. John M. Peter- 
son, James E. Smith. Prindle, Weber & Schmidt, 1982, vii + 502 pp. [ISBN: 87150-333-6] For pre- 
Service and in~service elementary teachers; this edition integrates problem solving throughout the 
text, adds several geometric topics and a new chapter on consumer mathematics. (First Edition, TR, 
May 1971; Second Edition, TR, February 1975; Third Edition, TR, March 1979.) JNC 


History, P, lL. The Boole-De Morgan Correspondence 1842-1864. G.C. Smith. Clarendon Pr, 1982, 156 
pp, $44. [ISBN: 0-19-853183-4] Chronologically arranged, slightly edited texts of 90 letters between 
two of the grandfathers of modern symbolic logic. Includes thumbnail biographies of Boole and 
DeMorgan and introductions and commentary on the subject matter of each group of letters (which 
ranged widely). GHM 


History, L*. Books IV to VII of Diophantus” Arithmetica. Jacques Sesiano. Transl: Qusta ibn Luqa. 
Sources in History of Math. & Physical Sci., V. 3. Springer-Verlag, 1982, xii + 502 pp, $72. [ISBN: 
0-387-90690-8} Text, English translation, and commentary on this document extant only in a_ recently 
discovered Arabic translation. Lengthy introduction discusses the history of the Arithmetica and 
the Arabic text. JRG 


History, L*. Descartes on Polyhedra: A Study of the De Solidorum Elementis. P.J. Federico. Sources 
in History of Math. & Physical Sci., V. 4. Springer-Verlag, 1982, ix + 145 pp, $36. [ISBN: 0-387- 
90760-2] First English translation of Descartes” general treatment of polyhedra, arrived at by anal- 
ogy with plane figures. Contents include a facsimile of the manuscript with transcription, transla- 


tion and commentary, both on this work and comparisons with Euler. JRG 


History, P, L*. The History of Combinatorial Group Theory: A Case Study in the History of Ideas. 
Bruce Chandler, Wilhelm Magnus. Studies in History of Math. & Physical Sci., V. 9. Springer- 
Verlag, 1982, viii + 234 pp, $46. [ISBN: 0-387-90749-1] A detailed survey of the development of the 
theory of groups defined by generators and relations, beginning with the 1882 paper by Walther von 
Dyck and extending into the mid-1970"s. Includes a comprehensive bibliography, an invaluable 
resource for anyone searching for the roots of modern group theory. Concludes with chapters 
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discussing in general terms the sociology of mathematics research--migration, collaboration, publi- 
cation patterns. LAS 


Foundations, P. Logic Colloquium “80. Ed: D. van Dalen, D. Lascar, T.J. Smiley. Stud. in Logic & 
Found. of Math., V. 108. Elsevier North-Holland, 1982, x + 342 pp, $55.75. [ISBN: 0-444-86465-2] 
Research papers originally intended for 1980 European summer meeting of Association for Symbolic 
Logic in Prague, Czechoslovakia. The local organizers cancelled the meeting at the last minute due 
to unexplained "circumstances lying beyond the control of the organizing committee." GHM 


Foundations, P. Proceedings of the Herbrand Symposium: Logic Colloquium “81. Ed: J. Stern. Stud. 
in Logic & Found. of Math., V. 107. Elsevier North-Holland, 1982, xi + 384 pp, $60.50. [ISBN: 0- 
444-86417-2] Invited papers on impact of work and ideas of J. Herbrand (1908-1931). Contributed 
research papers range from set theory to proof theory and computer science. GHM 


Combinatorics, P. Lecture Notes in Mathematics-829: Combinatorial Mathematics VII. Ed: R.W. Robin- 


son, G.W. Southern, W.D. Wallis. Springer-Verlag, 1980, x + 256 pp, $18 (P). [ISBN: 0-387-10254-X] 
Proceedings of the Seventh Australian Conference on Combinatorial Mathematics. JRG 


Number Theory, T(17: 1), S*, P, L*¥. Lectures on the Theory of Algebraic Numbers. Erich Hecke. 
Grad. Texts in Math., No. 77, Springer-Verlag, 1981, xii + 239 pp, $36. [ISBN: 0-387-90595-2] A 
recent translation of the author’s 1923 classic. It is yet a precise and fresh introduction to the 
subject. Does not include exercises. CEC 


Number Theory, S(18), P. Lecture Notes in Mathematics-927: The Trace Formula and Base Change for 
GL(3). Yuval Z. Flicker. Springer-Verlag, 1982, xii + 204 pp, $12.50 (P). [ISBN: 0-387-11500-5] A 
new way of writing the non-elliptic terms of the trace formula of Arthur, Jacquet and Langlands is 


applied to the case of GL(3) in order to study the base change problem. CEC 


Algebra, T(18: 1), S, P. The Structure of Real Semisimple Lie Groups. Ed: T.H. Koornwinder. MC 
Syllabus 49. Math Centrum, 1982, v + 141 pp, Dfl. 17,85 (P). [ISBN: 90-6196-239-0] A joint effort 
of five contributors. The principal aim is an introduction to the structure theory of real semisin- 
ple Lie groups for one who is already fairly well-versed in the general complex theory. Three 
chapters on groups include Cartan, Iwasawa, Bruhat decompositions, Tits systems, Furstenburg boun- 
dary. Two chapters on semisimple algebras include classification for the real case. References, 
index, diagrams. JS 


Calculus, T7(13: 2), S. Calculus by Calculator: Solving Single-Variable Calculus Problems with the 
Programmable Calculator. Maurice D. Weir. Prentice-Hall, 1982, ix + 387 pp, $15.95 (P). [ISBN: 0- 
13-111922-2] Introductory calculus text designed to be used with the Texas Instruments TI-59 pro- 
grammable calculator. Fully utilizes the calculator’s capabilities, but in a rather cookbook style. 
RSK 


Real Analysis, T**(17: 1, 2), S*, L. Foundations of Modern Analysis. Avner Friedman. Dover Pub, 
1982, vi + 250 pp, $5.50 (P). [ISBN: 0-486-64062-0] Welcome unabridged, corrected republication of 
the 1970 edition (TR, January 1971). Excellent buy for use as text or as reference. Self-contained 
treatment of Lebesgue integration. Measure, metric spaces, functional analysis in Banach spaces and 
spectral theory in Hilbert spaces. Applications to topics in differential equations. JK 


Complex Analysis, T(17: 1), S, P, L. Starting with the Unit Circle: Background to Higher Analysis. 
Loo-keng Hua. Trans: Kuniko Weltin. Springer-Verlag, 1981, xi + 179 pp, $32. [ISBN: 0-387-90589-8] 
An introduction to harmonic analysis in several variables. The exposition is explicit and elemen- 
tary. Includes discussions of the Dirichlet problem on the unit ball in n-dimensions, the Lorentz 
group, the axioms of special relativity, and partial differential equations of mixed type. Very few 
exercises. CEC 


Complex Analysis, T*(15: 1, 2), L. Complex Analysis. Joseph Bak, Donald J. Newman. Undergrad. 
Texts in Math. Springer-Verlag, 1982, x + 244 pp, $19.80. [ISBN: 0-387-90615-0] A carefully written 
and illustrated text containing all the essential topics for a first course plus enough additional 
topics and applications for a solid one-year treatment. Abundant, well chosen exercises. TAV 


Complex Analysis, T?7(18: 2), P. Lectures on Riemann Surfaces. Otto Forster. Transl: Bruce Gilli- 
gan. Grad. Texts in Math., No. 81. Springer-Verlag, 1981, viii + 254 pp, $35. [ISBN: 0-387-90617- 
7] A formal treatment in the definition-theorem-proof format. A few exercises have been added to 
this translation along with minor revisions. The translation is very smooth, although the treatment 
is a bit lifeless and sterile. TAV 


Differential Equations, S(18), P. Nonlinear Partial Differential Equations and their Applications. 
Collége de France Seminar, Volume III. Ed: H. Brezis, J.L. Lions. Research Notes in Math., No. 70. 
Pitman Pub, 1982, 432 pp, $24.95 (P). [ISBN: 0-273-08568-9] Twenty written lectures from the 1980- 
81 weekly Seminar on Applied Mathematics at the Collége de France. The applications are to various 
fields: control theory, physics, dynamical systems, etc. PZ 


Differential Equations, P*. Volterra and Functional Differential Equations. Ed: Kenneth B. 
Hannsgen, et al. Lect. Notes in Pure & Appl. Math., V. 81. Dekker, 1982, xii + 333 pp, $45 (P). 
[ISBN: 0-8247-1721-X] Full contents of the 24 invited lectures at the conference on Volterra and 
functional differential equations held at Virginia Polytechnic Institute and State University in 
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Blacksburg, June 1981. Ideal guide and unique reference for current knowledge for researchers, 
mathematicians, and computer scientists. JK 


Differential Equations, T(17: 1, 2), S*, P*. Introduction to the Theory of Linear Partial Differen- 
tial Equations. Jacques Chazarain, Alain Piriou. Stud. in Math. & Its Appl., V. 14. Elsevier 
North-Holland, 1982, xiv + 559 pp, $81.50. [ISBN: 0-444-86452-0] Introduction to a variety of recent 
methods, Applications to numerous classic problems. Detailed proofs. Flexible. Suitable for 
introductory course as well as for advanced students and research workers. Formerly journal- 
dispersed results are included. JK 


Differential Equations, T(16), $. Equations Différentielles. Hervé Reinhard. Gauthier-Villars, 
1982, xiv + 446 pp, (P). [ISBN: 2-04-015431-0] Attempts to bridge the gaps between the classical 
"great works" in differential equations and introductory texts in the subject. Emphasis on stabil- 
ity theory and operator theory. Appropriate for a senior-level seminar or as supplementary reading. 


Differential Equations, S8(18), P. Local Bifurcation and Symmetry. A. Vanderbauwhede. Research 
Notes in Math., No. 75. Pitman Pub, 1982, 350 pp, $23.95 (P). [ISBN: 0-273-08569-7] A study of the 
influence of symmetric properties of mappings on bifurcation properties. Applications to 0(2) and 
S0(2) symmetry, including Hopf bifurcation. An extensive bibliography of 273 references. JG 


Differential Equations, P, L. The Lorenz Equations: Bifurcations, Chaos, and Strange Attractors. 
Colin Sparrow. Appl. Math. Sci., No. 41. Springer-Verlag, 1982, xii + 269 pp, $19.80 (P). [ISBN: 
0-387-90775-0] An informative case study in the mathematics of turbulence: a detailed, computer- 
assisted analysis of the equations x” = s(y-x), y° = rx - y — xz, z” = xy — bz (introduced in 1963 
by E.N. Lorenz in the study of atmospheric dynamics) as an archetype of important current research 
into chaotic behavior and strange attractors. LAS 


Analysis, T(18: 2), P. An Introduction to Ergodic Theory. Peter Walters. Grad. Texts in Math., 
No. 79. Springer-Verlag, 1982, ix + 250 pp, $28. [ISBN: 0-387-90599-5] A revision and expansion of 
Springer Lecture Notes No. 458. The material is updated and chapters on ergodic theory of continu~ 
ous transformations of compact metric spaces and other recent results are added. TAV 


Analysis, T(18: 3), L. Ergodic Theory. I.P. Cornfeld, S.V. Fomin, Ya. G. Sinai. Grund. der math. 
Wissenschaften, B. 245. Springer-Verlag, 1982, x + 486 pp, $48. [ISBN: 0-387~90580-4] In four 
parts: Part 1 presents elementary (but not simple) dynamical systems and investigates their ergodi- 
city properties; Part 2 is devoted to abstract ergodic theory emphasizing entropy; Part 3 treats 
spectral theory; and Part 4 discusses approximation. Contains appendices and a substantial bibliog- 
raphy. TAV 


Analysis, P. Classification Problems in Ergodic Theory. William Parry, Selim Tuncel. London Math. 
Soc. Lect. Note Ser., No. 67. Cambridge U Pr, 1982, 101 pp, $14.95 (BP). [ISBN: 0-521-28794~4] 
Presumes measure and integration and a familiarity with functional analysis through Hilbert spaces, 
as well as the classical ergodic theory. By classifying measure-preserving transformations in terms 
of how they “respect" structure, the authors investigate their applications. TAV 


Analysis, S(18), P. Differential Calculus and Holomorphy: Real and Complex Analysis in Locally Con- 
vex Spaces. Jean Francois Colombeau. Math. Stud., No. 64. Elsevier North-Holland, 1982, xii + 455 
pp, $65 (P). [ISBN: 0-444-86397-4] Presents aspects of real and complex analysis in locally convex 
spaces, usually either Banach or satisfying some nuclearity assumption. Many classical results and 
ideas, e.g., Taylor’s theorem, mean value theorem, holomorphy, pseudoconvexity, convolution, are 
generalized from the finite-dimensional setting, with new and different proofs. Hitherto unavail~ 
able in book form; accessible to nonspecialists. PZ 


Analysis, 8*(18), L. Problems in Analysis. Bernard Gelbaum. Springer-Verlag, 1982, vii + 228 pp, 
$28. [ISBN: 0-387~90692-4] For students with a background in real analysis, measure, topology, and 
topological vector spaces; 518 assorted nontrivial problems chosen from these areas followed by 
careful complete solutions. The third volume in this fascinating series. TAV 


Algebraic Geometry, P. Lecture Notes in Mathematics—961: Algebraic Geometry. Ed: J.M. Aroca, et 
al. Springer-Verlag, 1982, x + 500 pp, $25 (P). [ISBN: 0-387~11969-8] Proceedings of a 1981 
conference held in Spain containing 19 papers on various aspects of algebraic geometry. SG 


Geometry, T**(14-17: 1), L. Transformation Geometry: An Introduction to Symmetry. George E, Mar~- 
tin. Undergrad. Texts in Math. Springer-Verlag, 1982, xii + 237 pp, $28. [ISBN: 0~387-90636~3] 
This study of the automorphisms of the Euclidean plane and space is based on elementary high school 
geometry but done with a mathematical thoroughness that makes this an excellent pre-requisite to a 
standard abstract algebra course and outstanding preparation for prospective high school teachers. 
Teachers manual available. JNC 


Geometry, S$(15~-17), L. The Fifty-Nine Icosahedra. H.S.M. Coxeter, et al. Springer-Verlag, 1982, 
29 pp, $12 (P). [ISBN: 0~-387-90770-X] A reprinting of the 1938 edition which was originally pub- 
lished as No. 6 of University of Toronto Studies (Mathematical Series); consists of an enumeration 
and description of the polyhedra that can be derived from the five Platonic solids by stellation. 
JNC 
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Geometry, T(15-17: 1), L. The Foundations of Geometry and the Non-Euclidean Plane. George E. Mar- 
tin. Undergraduate Texts in Math. Springer-Verlag, 1975, xvi + 509 pp, $24. [ISBN: 0-387-90694-0] 
A reprinting of an excellent geometry text (TR, October 1976; Extended Review, June-July 1977). JNC 


Algebraic Topology, P. The Symplectic Cobordism Ring II. Stanley 0. Kochman. Memoirs No. 271. 
AMS, 1982, vii + 170 pp, $10 (P). 


Topology, S(18), P. Dynamical Systems on Surfaces. C. Godbillon. Transl: H.G. Helfenstein. 
Universitext. Springer-Verlag, 1983, 201 pp, $19.80 (P). [ISBN: 0-387-11645-1] An analysis of local 
behavior of singular points and periodic orbits, as well as Poincaré-Bendixson theory and direction 
fields on surfaces. JG 


Topology, T*(15-17: 1), S, L. Surface Topology. P.A. Firby, C.F. Gardiner. Series in Math. & Its 
Applic. Ellis Horwood Pub, 1982, 216 pp, $54.95. [ISBN: 0-85312-483-3] An appealing intuitive 
approach to many sophisticated aspects of two and three dimensional geometric topology, including 
the Euler characteristic, complexes, graphs, vector fields, index theorem, tessellations and the 
fundamental group. Based on a course at Exeter University, this book is distinguished both by a 
wealth of diagrams and by extensive linkages among topics. LAS 


Operations Research, P, L. Foundations of Decision Support Systems. Robert H. Bonczek, Clyde W. 
Holsapple, Andrew B. Whinston. Oper. Res. & Indus. Eng. Academic Pr, 1981, xvii + 393 pp, $29.50. 
[ISBN: 0-12-113050-9] Interdisciplinary and generalized approach to a theory of decision support 
systems. Integrates concepts and tools from various fields, particularly data base management, 
linguistics and artificial intelligence. JRG 


Operations Research, P. Deterministic and Stochastic Scheduling. Ed: M.A.H. Dempster, J.K. Lenstra, 
A.H.G. Rinnooy Kan. Math. & Physical Sci. D Reidel Pub, 1982, xii + 419 pp, $48. [ISBN: 90-277- 
1397-9] Proceedings of NATO Advanced Study and Research Institute, 1981. Part I reviews the state 
of the art with respect to scheduling models and the interface between deterministic and stochastic 
models. Part II presents recent results and directions for further research. JRG 


Operations Research, T(17: 2), P, L*. The Single Server Queve, Revised Edition. J.W. Cohen. Appl. 
Math. & Mech., V. 8. Elsevier North-Holland, 1982, xiv + 694 pp, $85.25. [ISBN: 0-444-85452-5] In 
four parts: Part 1 covers the basic ideas of stochastic processes needed for the rest; Part 2 
describes the model and the variety of methods used to analyze the queue; Part 3 covers variants, 
bulk queues, priorities, finite waiting space, etc.; and Part 4 covers recent developments. Exten- 
sive bibliography and update on literature. A massive, definitive treatment. (First Edition, TR, 
February 1970.) TAV 


Operations Research, T(14-15: 2). Discrete and Dynamic Decision Analysis. J.T. Buchanan. Wiley, 
1982, xii + 260 pp, $36.95. [ISBN: 0-471-10130-3] Covers both decision analysis and dynamic program- 
ming. Does not assume knowledge of probability or statistics. Demonstrates methods for structuring 
and solution of decision problems. Modest number of exercises. JRG 


Optimization, T(15-16). Méthodes explicites de l’optimisation. Jean-Pierre Aubin, Pierre 
Nepomiastchy, Anne-Marie Charles. Dunod, 1982, 287 pp, 200 FF. [ISBN: 2-04-015402-7] Extensive 
treatment of quadratic programming with applications to economics; introduction to convex analysis. 
JRG 


Probability, P. The Multiple Stochastic Integral. David Douglas Engel. Memoirs No. 265. AMS , 
1982, v + 82 pp, $5 (P). The author’s doctoral thesis (Yale 1979). The main result represented is 
that the multiple stochastic integral as introduced by Ito is a countably additive extension of the 
L2.Asvalued set function xX). TAV 


Probability, T(18: 2), P*. Lectures from Markoy Processes to Brownian Motion. Kai Lai Chung. 
Grund. der math. Wissenschaften, B. 249. Springer-Verlag, 1982, viii + 239 pp, $34. [ISBN: 0-387- 
90618-5] An impressive piece of work. Chung begins with the familiar properties of Markov 
processes, extends and generalizes through martingales, Feller and Hunt processes to Brownian 
motion, ending with areas of future study. Challenging but rewarding reading. TAV 


Probability, P. Markov Random Fields. Yu. A. Rozanov. Transl: Constance M. Elson. Springer- 
Verlag, 1982, ix + 201 pp, $42. [ISBN: 0-387-90708-4] In an attempt to formulate the Markov property 
for generalized random functions of several variables, the author introduces and develops the theory 
of random fields on continuous time domains. TAV 


Statistics, P. Lecture Notes in Statistics-l10: Fitting Linear Models: An Application of Conjugate 
Gradient Algorithms. Allen McIntosh. Springer-Verlag, 1982, vi + 200 pp, $12 (P). [ISBN: 0-387- 
90746-7] Presents theory and illustrates by examples (with computer output) the use of conjugate 
gradient algorithms to fit a variety of linear models. These algorithms require much less computer 
memory than more commonly used algorithms and so are more appropriate for small computers. RSK 


Statistics, T(15-16: 2). Elements of Probability and Mathematical Statistics. Frederick H. Steen. 
Duxbury Pr, 1982, xiii + 505 pp. [ISBN: 0-87872-299-8] Fairly traditional text with a broad coverage 
of theoretical topics. Includes chapters on Bayesian inference and nonparametric statistics. RSK 
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Statistics, T*(16-17: 1, 2). Linear Models: An Introduction. Irwin Guttman. Wiley, 1982, x + 358 
pp, $36.95. [ISBN: 0-471-09915-5] In the Wiley Series in Probability and Mathematical Statistics. 
Clearly written theoretical presentation of the "essential concepts of regression, least squares, 
and linear models." Presumes courses in statistics and linear algebra. RSK 


Computer Programming, S, P, L*. Basic BASIC-English Dictionary for the Apple, PET and IRS-80. Larry 
Noonan. Dilithium Pr, 1982, 150 pp, $10.95 (P). [ISBN: 0-918398-54-1] A convenient aid for 
translating among the three Basic languages used by Apple II, PET and TRS-80. Part I is a diction- 
ary of commands, functions, etc., giving English definitions and the corresponding terms in each of 
the three dialects; Part II is a summary of terms; appendices explain similarities and differences 
of ASCII codes, key words, graphics and Boolean operators. JNC 


Computer Programming, S(13~14). Some Common BASIC Programs: IBM Personal Computer Edition. Lon 
Poole, Mary Borchers, Peter M. Burke. Osborne/McGraw~Hill, 1982, vii + 212 pp, $14.99 (P). [ISBN: 
0~931988-83-7] A collection of 76 practical programs written in Basic for the I.B.M. Personal Com- 
puter; includes programs in consumer mathematics, elementary statistics, matrix calculations and 
hQhumerical integration. JNC 


Computer Science, T(17: 1), S. Algorithmic Language and Program Development. F.L. Bauer, H. 
WUssner. Texts & Mono. in Comp. Sci. Springer-Verlag, 1982, xvi + 497 pp, $29. [ISBN: 0-387- 
11148-4] This book presents a formal description of the fields of problem solving and computer pro- 
gramming. Rather than looking at a particular language and its accompanying syntax, it takes a 
language-independent, algorithmic approach to programming. It investigates the mathematical founda- 
tions of such common Language constructs as subprograms, variables, data types, and control struc— 
tures. It assumes a very high level of mathematical sophistication on the part of the reader, espe- 
cially in the areas of logic, algebra, and the predicate calculus. MS 


Control Theory, T(17-18: 1). Introduction to Optimal Control Theory. Jack Macki, Aaron Strauss. 
Undergrad. Texts in Math. Springer-Verlag, 1982, xiii + 165 pp, $24. [ISBN: 0-387~90624-x] 
Emphasis on motivation and explanation with many examples. One simple canonical example (the rocket 
car) is carried through the text. Discussion includes linear autonomous time-optimal control prob- 
lems, eXistence theorems, and necessary conditions for optimal controls. Modest number of exer- 
cises. JRG ' 


Applications (Actuarial Science), P?. Limits of Insurability of Risks. Baruch Berliner. 
Prentice-Hall, 1982, x + 118 pp, $14.95. [ISBN: 0-13-536789-1] Mostly non-mathematical discussion of 
what risks are still insurable. RSK 


Applications (Biology), P. Lecture Notes in Medical Informatics-17: Biomedical Images and Comput- 
ers. Ed: J. Sklansky, J.-C. Bisconte. Springer-Verlag, 1982, vii + 329 pp, $21 (P). [ISBN: 0~-387- 
11579-X] Proceedings of United States-France Seminar on Biomedical Image Processing. Papers on 
microscopic image analysis, radiological image analysis, tomography, image processing technology. 
JRG 


Applications (Engineering), T(16-17: 1), P**. Problems in Structural Analysis by Matrix Methods. 
P. Bhatt. Construction Pr, 1981, 465 pp, $22.50 (BP). [ISBN: 0-86095-881-7] Unified approach via 
matrix methods. Digital-computer-oriented. Numerous examples with complete solutions paralleling 
steps in a computer program. Covers elastic analysis, elastic-plastic analysis, elastic stability 
plus natural frequency calculations. Presupposes a first course in structural analysis. Exercises 
with answers. JK 


Applications (Engineering), T*(17). Variational Methods in Theoretical Mechanics, Second Edition. 
J.T. Oden, J.N. Reddy. Universitext. Springer-Verlag, 1983, xi + 309 pp, $20 (P). [ISBN: 0-387- 
11917-5] Revised and corrected edition (First Edition, TR, June-July 1977). Out~-of-date sections 
deleted. Textbook for graduate students in mathematics and engineering science. Covers essentials 
of modern variational theory. JK 


Reviewers 


RJA: Richard J. Allen, St. Olaf; PB: Peder Bolstad, St. Olaf; RB: Richard Brown, Carleton; JNC: 
Judith N. Cederberg, St. Olaf; CEC: Clifton E. Corzatt, St. Olaf; JD-B: John Dyer-Bennet, Carleton} 
JRG: Jennifer R. Galovich, St. Olaf; SG: Steven Galovich, Carleton; JG: Jack Goldfeather, Carleton; 
PH: Paul Humke, St. Olaf; RBK: Roger B. Kirchner, Carleton; RSK: Richard S. Kleber, St. Olaf; JK: 
Joseph Konhauser, Macalester; LCL: Loren CG. Larson, St. Olaf; GHM: George H. Mills, Carleton; RWN: 
Richard W. Nau, Carleton; AO: Arnold Ostebee, St. Olaf; AWR: A. Wayne Roberts, Macalester; MS: 
Michael Schneider, Macalester; JS: John Schue, Macalester; SS: Seymour Schuster, Carleton; JAS: J. 
Arthur Seebach, Jr., St. Olaf; KS: Kay Smith, St. Olaf; LAS: Lynn Arthur Steen, St. Olaf; TAV: 
Theodore A. Vessey, St. Olaf; MW: Martha Wallace, St. Olaf; FLW: Frank L. Wolf, Carleton; PZ: Paul 
Zorn, St. Olaf. 
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The Mathematical Association of America 
The Sixty Sixth Annual Meeting of the Association 
Denver, Colorado 


The Sixty Sixth Annual Meeting was held in Denver, Colorado during the period January 4-9, 
1983. There were 2326 registrants including 1227 members of the Association. The Program Committee 
consisted of: C.E. Burgess; Ronald L. Graham; Ruth I. Hoffman; Kenneth I. Gross; John H. Hodges; 
Gary Bitter and Stanley P. Gudder. The talks for which abstracts were submitted consisted of the 
following: 


MAA/NCTM Panel Discussion: Please Professor, Let”’s Work Together on This! William E. Briggs, William 
Gox, Helen Holgren, Terry Kret, Billy E. Rhoades, Ronald Schnackenberg, Donald B. Small. 


A survey by the MAA Committee on Secondary School Lectures has shown that there are approxi- 
mately one hundred and twenty existing lectureship programs, suggesting that there is substantial 
interest among college/university mathematics faculty members in such programs. It is likely that 
most faculty members see these programs as a way to interact with high school mathematics teachers 
and students, i.e., to help and support high school teachers in their efforts to stimulate interest 
in mathematics among their students. Qn the other hand, most of the existing programs are very 
small. This suggests that high school teachers do not share the interest and sense of importance of 
lectureship programs that are found among college faculty, It also suggests that there may be other 
ways in which college and high school mathematics teachers can establish a two-way communication and 
support system that will benefit the high school students. The latest NRC study entitled "The State 
of School Science" delineates a number of serious problems with the present situation in high school 
mathematics. The purpose of this session was to hear directly from a panel of high school and col- 
lege mathematics teachers about their perspectives on the problems to be faced and then discuss, in 
small groups, the issues raised by the panel members, 


Nonassociative Algebras, the First 101 Years, by Marvin L. Tomber, Michigan State University. 


In 1847 Cayley published the equation (i(a) i(b)) ify) = i(a) (ib) ily)). This is the first 
public announcement of the existance of a nonassociative product. The genesis of the Cayley-Graves 
numbers is traced from Hamilton’s treatment of the complex numbers. After the beginning of a gen- 
eralized concept of algebra the study of associative and Lie algebras was initiated. In the early 
and mid 1930°s, Zorn and Moufang began their study of alternative algebras and Jordan initiated his 
investigations which would lead to Jordan algebras. Early work on alternative and Jordan algebras 
is described along with other developments. The 101 years end in 1948 with Albert”s paper on 
power~associative rings which played an important role in subsequent developments. 


How to Throw Small Matrices Away, or, Just What Did Brown, pour has. And Fillmore Do in 1973?, by 


oe 


Paul R. Halmos, Indiana University. 


Operator theory can be viewed as a generalization of finite-dimensional linear algebra. The 
most famous theorem of linear algebra is the diagonalizability of real symmetric matrices--a result 
whose infinite-dimensional extension encounters many difficulties, both technical and conceptual. 
The purpose of this lecture is to place the familiar theorem in its proper algebraic context, extend 
the context to infinite-dimensional spaces, and then "throw small matrices away" (that is, form the 
appropriate quotient algebras). What is left is one of the biggest operator-theoretic steps forward 
in the 1970°s, namely the celebrated theorem of Brown, Douglas, and Fillmore. 


Nerve Conduction and Cardiac Fibers: Some Qualitative Problems in Differential Equations, by Jane 
Cronin Scanlon, Rutgers University. 


The Hodgkin-Huxley equations of nerve conduction and the Noble equations for the cardiac Purk- 
inje fiber were described and compared. Mathematical and physiological reasons were given for 
studying these equations from the singular perturbation viewpoint, especially studying discontinuous 
solutions of the corresponding degenerate systems. Results for the Noble equations were obtained by 
using this viewpoint and applying work of Mishchenko and Rozov. Unsolved mathematical questions 
which arise were described. 


MAA Workshop: High Level Languages--Why PASCAL?, by John L. Van Iwaarden, Hope College. 


The high-level computer language PASCAL is increasing dramatically in popularity as a first 
language for students in mathematics and computer science as well as engineering and other physical 
sciences. The literature now abounds with references to PASCAL. Mathematicians are well directed 
to learn something of the unique facets of this language. A workshop entitled "High Level 
Languages--Why PASCAL?" was presented by Professor John L. Van Iwaarden of Hope College to explain 
the syntax structure of the language and detail some of its unique data structures and its versatil- 
ity. Comparisons of PASCAL with BASIC, COBOL, and FORTRAN were given. 


A Systematic Method for Teaching Mathematical Proofs, by Daniel Solow, Case Western Reserve Univer- 
sity. 


An effective and systematic approach to teaching mathematical proofs and theoretical reasoning 
was presented. The approach is based on the observation that all proofs repeatedly make use of a 
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limited number of different proof techniques, each of which was described. Then it was shown how a 
proof can be explained as a sequence of applications of the individual techniques, and how a correct 
technique can often be chosen based on certain key words appearing in the problem under considera- 
tion. Benefits of using the proposed method, and the issue of implementing this material into 
current curricula was addressed. 


Mathematical Modeling in Petroleum Reservoir Simulation, by Richard E. Ewing, University of Wyoming 
and Mobil Research and Development, Dallas, Texas. 


Mathematical questions arising in the numerical simulation of enhanced recovery processes are 
discussed. A model for describing miscible displacement in porous media is given by a coupled sys-~ 
tem of nonlinear partial differential equations for the concentration of the invading fluid and the 
pressure of the total fluid. Mixed finite element methods are presented for the simultaneous 
approximation of the pressure and the darcy velocity from an elliptic equation with point sources 
and sinks. The concentration of the invading fluid is described by a nonlinear transport dominated 
parabolic equation with point sources which has many properties of first order hyperbolic equations. 


Homoclinic Bifurcation and Phase Transition in Time Discrete Dynamical Systems (with computer pic- 


re I ey 


tures), by Heinz-Otto Peitgen, University of Bremen and University of Utah. 


The author discusses the scenario of homoclinic bifurcation for time discrete dynamical sys- 
tems, which are given by one-parameter families of area preserving diffeomorphisms of the plane. 
For specially selected models which are derived from numerical approximations of nonlinear elliptic 
boundary value problems he gave some evidence of its occurrence. He indicated how these structural 
changes govern the asymptotic fate of periodic orbits and initiate phase transitions in the global 
dynamics, Particular attention was given to the phenomenon of homoclinic bifurcation as such in the 
space of all dynamical systems and its connection to the occurrence of spurious solutions in numeri- 
cal approximation schemes. The results were illustrated by computer-generated phase portraits. 
Subsequently a computer-generated movie was shown, which was produced at the University of Utah in 
Salt Lake City with the support of the Departments of Computer Science and Mathematics. 


VisiCalc and Mathematical Algorithms: Mathematical Applications of an Electronic Spreadsheet, by 
Deane E. Arganbright, Whitworth College. 


VisiCale is a computer software package designed as an “electronic spreadsheet" for business 
and economic modeling and forecasting. This talk presents VisiCalc as a creative and dynamic tool 
for doing and teaching mathematics. Algorithms which are recursive, iterative, or tabular in nature 
can be easily implemented on VisiCalc in a way that allows the user to change parameters, initial 
values and constants, and instantly see the result of the changes. Examples are drawn from numeri- 
cal analysis, linear programming, linear algebra, number theory, differential equations, combinator- 
ics, statistics, calculus, and elementary mathematics. 


The Role of Microcomputers in the Mathematics Curriculum, by Ruth Hoffman, University of Denver. 


ernecnpemremaine 


Computer Literacy is the fundamental issue of the decade. It has developed in the past 15 
years in the secondary school from computer science courses through microcomputers used in mathemat- 
ics, hence to microcomputer usage in mathematics, science and business education and in the “80"s to 
microcomputers used in all areas of the pre-college curriculum. Colleges have moved more slowly. 
In a survey in the spring of 1982, 14,132 pre-college schools across the nation were officially 
listed as having microcomputers in their curriculum. The conclusion of the survey was that comput~ 
ers have definitely penetrated the education system marking the beginning of major curricular 
changes. A survey was also taken of 100 colleges from a list of 3,308. The conclusion for colleges 
is that computer literacy is ill-defined and that different institutions have widely diverse needs. 
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by Katherine P. Layton, Beverly Hills High School. 


In response to the current decline in the quality and quantity of precollege mathematics and 
Science education in the United States, the National Science Board established the Commission on 
Precollege Education in Mathematics, Science and Technology. The purpose of the Commission is to 
define a national agenda for improving mathematics and science education. A progress report was 
given of the work done by the Commission during its first eight months. Included was a discussion 
of the problem statement, “Today’s Problems, Tomorrow's Crises;" the Atlanta Public Hearings; Access 
to High Quality Education in Mathematics, Science and Technology for All Citizens; and studies 
instituted by the Task Groups on Education, Governments, Facilitators, and Recipients. 


Also included on the program were lectures entitled "Applications in the Undergraduate Curricu- 
lum" by Solomon Garfunkel, and "Geometric Structures for 3-Manifolds with Symmetry” by William P. 
Thurston. The Association’s Committee on Corporate Members sponsored a session entitled "Mathemat- 
ics Publishing, Copyright, and Software." Finally, there was a session organized by Irwin Kra on 
behalf of the MAA/AMS Committee on Employment and Educational Policy entitled “Freshman Mathematics: 
Are There Alternatives to the Calculus?" 
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Special Sessions 
Mini-Courses. 


The Association sponsored five mini-~courses entitled "Placement Testing," "Introduction to 
Microcomputers in Mathematics Instruction," “An Introduction to the Mathematical Foundations of Com- 
puter Graphics," and "Uses of Computers in Undergraduate Mathematics Instruction." Persons involved 
in these courses and from whom additional information can be requested are Richard H. Prosl, College 
of William and Mary; Klaus E. Eldridge or Donald 0. Norris, Ohio University; Gerald J. Porter, 
University of Pennsylvania; David A. Smith, Duke University. 


Contributed Papers. 


For the first time at an Annual Meeting, the Association sponsored sessions for contributed 
papers. There were two such sessions entitled "Discrete Mathematics in the Undergraduate Curriculum 
Program" organized by Anthony Ralston, and “Computers in Undergraduate Mathematics Instruction" 
organized by Ronald Wenger. The list of presentors follows: "Proof by Algorithm," Stephen B. 
Maurer; "Some Elementary Applications of the Principle of Inclusion and Exclusion," Kenneth P. 
Bogart; "An Alternative to Calculus for Liberal Arts Students," Duane Bailey; "Some Finite Problems 
Neither My Computer Nor I Can Solve," Richard VY. Andree; "Formal Reasoning in Discrete Mathematics," 
Susanna S. Epp; "Calculus with a Function Preprocessor,” R.J. Knill; "3-D Image Generation in Vec~- 
tor Calculus," Michael Frantz; "A Hidden-Line Algorithm for Three-Dimensional Computer Graphics," 
William F. Rich; "APPLE Graphics Enhanced With an Attached Processor," Roy Myers; "The School of 
Soft Knocks," Richard L. Eisenman; "Using Microcomputers to Illustrate Concepts in Probability and 
Statistics," Elliot A. Tanis; "Some APPLE Computer Programs for Combinatorics Class," Donald R. 
Snow; "A Mathematical Analysis of the Game of Jai Alai," Louise E. Moser; "Applying the Computer to 
Abstract Algebra," George M. Whitson; "Computer Generated Stereo Slides-Movies-Models," Cliff Long 
and Thomas Hern; and "Computer Scaling of Tests," Clifton A. Lando. 


The Local Arrangements Committee consisted of: Jerrold W. Bebernas; William S. Dorn; Gary W. 
Grefsrud; Raymond R. Gutzman; Darel W. Hardy; Zenos Hartvigson; Frieda K. Holley; Arne Magnus; 
Richard Osborne; Arlan B. Ramsay; William N. Reinhardt; Nancy Warren Townsend. 


Board and Business Meeting 


The Board Governors met at 9:00 A.M. on January 6, 1983 in the Forum of the Executive Tower 
Inn. The major items of business will be announced to the membership in FOCUS. 


The business meeting of the Association was held at 9:30 A.M. on January 8, 1983. At this 
meeting, Alice Beckenbach accepted the Award for Distinguished Service to Mathematics on behalf of 
her late husband Edwin F. Beckenbach. The citation that accompanied the Award was read by Profes- 
sor Ivan Niven, MAA President-Elect. This citation will be published in the Monthly. 


The Certificate of Merit was presented to H. Hope Daly for her work as Head of Meetings 
Arrangements. The Secretary read a citation and the audience joined in an ovation for Ms. Daly. 


The revisions to the By~Laws made necessary by offering members the choice of any one journal 
published by the Association were approved. 


The Lester R. Ford Awards for expository Monthly articles were presented to: 


Philip Davis, Brown University 
"Are There Coincidences in Mathematics?" 
American Mathematical Monthly 


R. Arthur Knoebel, New Mexico State University 
"Exponentials Reiterated." 
American Mathematical Monthly 


Election of Members 
At its meeting on January 6, 1983, the Board elected to membership 802 applicants for indivi- 
dual membership and 8 applicants for academic membership. The latter follow: Hampden-Sydney Col- 


lege; Iowa Wesleyan College; University of Maine at Machias; University of Wisconsin at Green Bay; 
Drake University; Moorpark College; Liberty Baptist College; and Hamline University. 


Respectfully submitted, 


David P. Roselle 
Secretary 


CONJECTURES ON THE CRITICAL POINTS OF A POLYNOMIAL 


MORRIS MARDEN 
Department of Mathematics, University of Wisconsin-Milwaukee, Milwaukee, WI 53201 


1. Introduction. Let us suppose that p(z) is an nth degree polynomial which has all its zeros in 
the unit disk |z| < 1; then all the critical points (zeros of the derivative) of p(z) also lie in the 
same disk |z| < 1. This is in fact the well-known theorem which was implied in a note of Gauss 
(1836) and stated and proved explicitly by Lucas (1874) (Marden [6], p. 22). 

Now, instead of considering the relative positions of all the zeros and all the critical points of 
p(z), let us choose any one zero Z, of p(z) and ask: At most how far from Zz, does the nearest 
critical point lie? A possible answer to this question is given by the following: 


CONJECTURE I. If p(z) is an nth degree polynomial having all its zeros in the unit disk |z| < 1 
and if Z, is any one such zero, then at least one critical point of p(z) lies on the disk |z — Z| < 1. 
( Fig. 1.) 


FIG. 1 


This conjecture was included in the collection of Research Problems in Function Theory, 
published in 1967 by Professor Hayman [4]. Since it had been brought to Hayman’s attention by 
Professor Ilyeff, it became known as “TIlyeff’s Conjecture.” Actually, Conjecture I was due to the 
Bulgarian mathematician B. Sendov who had acquainted me and probably others with it in 1962 
at the International Congress of Mathematicians held in Stockholm. 

After publication of the Conjecture, efforts were initiated by a number of very competent 
mathematicians to test the validity of the conjecture. Fourteen papers were published on the 
conjecture between 1968 and 1978. Some of these papers dealt with special cases or used various 
ad hoc devices. Our purpose here is to report primarily upon some general methods and results 
pertaining to Conjecture I. 


Morris Marden received his Ph.D from Harvard under Joseph L. Walsh in 1928. As a National Research Fellow 
from 1928 to 1930 he did postdoctoral work at Wisconsin under E. B. Van Vleck, at Princeton under Einar Hille, at 
Zurich under George Polya and at Paris under Paul Montel. Later he studied applied mathematics at Brown under 
Stefan Bergman. Since 1930 he has been on the University of Wisconsin-Milwaukee faculty; since 1975 as UWM 
Distinguished Professor Emeritus. During 1975-77 he served as Visiting Distinguished Professor at California 
Polytechnic State University in San Luis Obispo. He has served as a MAA visiting lecturer and has lectured abroad 
extensively. He has two sons: Albert, professor of mathematics at the University of Minnesota; and Philip, a 
pediatrician practicing in Oconomowoc, Wisconsin. 
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The Conjecture is obviously true for p(z) = z” — 1 and polynomials of the form p(z) = ay + 
a,z* +++++a,z" where aya, + 0. For these polynomials p’(0) = 0 and so every zero is within 
unit distance from the origin, which is a critical point. 

The Conjecture may be interpreted physically if we recall Gauss’ Theorem that the critical 
points of a polynomial, which are not multiple zeros of the polynomial, are the equilibrium points 
in a certain force field. The field is due to particles placed at the zeros of the polynomial, the 
particles having masses equal to the multiplicity of the zeros and attracting with a force inversely 
proportional to the distance from the particle. These critical points cannot therefore be too close 
to any one zero since the force due to the particle at the zero would be relatively large. On the 
other hand, the Conjecture would imply that, if all the particles are of unit mass and situated on 
the disk |z| < 1, then at least one equilibrium point will lie within unit distance of each particle. 

The Conjecture may also be given a geometric interpretation. It is known (Marden [6, p. 9] that 
the critical points {, and §, (# 2), Z, Z3) of the polynomial 


p(z) = (z- 2)" (z- 22)" (z - 23)™ 
lie at the foci of the ellipse which touches the line segments (z,, Z,), (Z, 23), and (z3, Z,) in the 
points that divide these segments in the ratios m,/m,, m,/m, and m,/m, respectively. The 
Conjecture implies therefore that, if the vertices of this triangle z,, z,, z3 all lie in the unit disk 
|z| < 1, each vertex is within unit distance from one of the foci of the inscribed ellipse (Fig. 2). A 
similar interpretation can be given for the critical points of the polynomial 


p(z) = (z—2)(z— 2) ++: (2-2,) 
as foci of a curve I of class n (Marden [6, p. 11]), which is tangent to the sides of the polygon with 


vertices at the points z,, j = 1,2,...,. If true, the Conjecture would imply that within unit 
distance of each vertex lies at least one focus of I. 


FIG. 2 


2. Existence of Extremal Polynomials. Let us reformulate Conjecture I as in Phelps-Rodriguez 
[16] by study of the extremal members in the family of all nth degree polynomials whose zeros 
all lie on the disk |z| < 1. Let us write p(z) and its derivative p’(z) in the form 


n—-1 


(2.1) p(z)= 11 (2), p(2) =a TT (ef) 


and denote by D(a, r) the disk |z — a| < r. Let us also define 
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I(z,) = min{|z; — §,|,k = 1,2,...,2 — 1}; 1(p) = max{I(z,), j = 1,2,..., 0}; 
I(P) = sup{I(p), p € F}. 


According to Lucas’ theorem (Marden [6, p. 22] any circle containing all the zeros of a 
polynomial p(z) also contains the critical points of p(z). It follows that [(P) < 2. The example 
p(z) =z" — 1 has I(p) = 1 and thus shows that [() > 1. However, Conjecture I asserts that 
I(P) <1. 

Any polynomial p* € such that J( p*) = I() is called an extremal polynomial of P. Let us 
first determine that ? contains at least one extremal polynomial p*. 

By the definition of extremal polynomial, % contains a sequence { p,} of polynomials such that 


(2.2) lim, ../(p,) =1(P). 


Now, ? is a “normal family of functions” since for an arbitrary but fixed R > 0, all polynomials 
p © have, according to (2.1), the property |p(z)|<(R+ 1)” for |z|< R and hence are 
uniformly bounded. This means that a subsequence of the {p,} can be found that in |z| < R 
converge uniformly to nth degree polynomial f(z) = (z — Z,)\(z — Z,) -+-(z — Z,,) and simulta- 
neously the p;, converge uniformly to the derivative of f(z): 


f'(z) =n(z— W)(z- W)--> (2 - W,-1). 


By Hurwitz’ Theorem (Marden [6, p. 4]) the zeros of f(z) and f’(z) are the limits of the zeros of p, 
and p/, respectively. Hence, we infer that |Z,| < 1, j = 1,2,..., n; f(z) © &, and that I(f) < I(®). 
To show in fact that J(f) = I(), let us assume on the contrary that I(f) < 1(P) and write 


(2.3) I(f) =1(P)-—4e, e>0. 


Let us choose 6,0 < 6 <e, so that f(z) # 0 for 0 < |z — Z| <6, j = 1,2,...,n. If k is suffi- 
ciently large, only one zero z, ; of p,(z) lies in disk D(Z,, 86) C D(Z;,, e). From (2.3) we conclude 
that at least one zero W,, of f’(z) satisfies the inequality 


(2.4) IZ, — Wl < I(f) = 1(P) — 4. 


Choose 6 so that also f’(z) # 0 for 0 < |z — W,| < 6 and thus for k sufficiently large only one 
zero §,,, of p;, lies in D(W,, e). From 
ZK Sky = (24; —~ Z;) + (Z, — W,) + (W, - Sey) 
it follows that 
245 ~ Steel < Ze — Zl 4+ 1Z) — M+ Mh — Seal 
<6+1(?)-4e+ 6. 


Hence, lim,_,,,/(p,) < 1(P) — 2e which contradicts (2.2). Therefore I(f) = I(P) and hence 
f(z) = p*(z), an extremal member of ?. 


3. Properties of Extremal Polynomials. We now show that a necessary condition on each 
extremal polynomial p*(z) is that at least one of its zeros Z; lies on the unit circle C(0, 1). This 
condition is to be expected in view of the fact that each zero W, of p*’(z) is an analytic function 
of the zeros Z; of p*(z) due to the equation 


p™'(W,)/p*(W,) = » [1/(™ ~ Z;)| = 0. 
j=l 
Since a neighborhood of any Z; with |Z;| < 1 would be mapped onto a neighborhood of W,, a 
small increment on Z; in a suitable direction should increase I( Z,;) contrary to the hypothesis that 
p*(z) is extremal. Hence, we may expect at least one Z; to lie on circle C(O, 1). 
That p*(z) actually has at least one zero on the unit circle is proved in Phelps-Rodriguez [16] as 
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follows. Assuming on the contrary that |Z;| < 1, 7 = 1,2,...,, and labeling Z, so that I(Z,) = 
I(p*), we note that by Lucas’ Theorem |W,|< 1 and that |Z, -W,|>J1(Z,) for k= 
1,2,...,n — 1. Let us choose a sequence §,,,v = 1,2,..., with |f,,| < 1, lim,_..§ = W,, and 
|Z, — §,,| > [(Z,). Then the polynomials 


f(z) = n(z ~ $1) “ee (z ~ Sn —1,9) 
converge uniformly to p*’(z) and p,(z) = fz, fv(t) dt converge uniformly to p*(z) for |z| < R, R 
> 0. By Hurwitz’ Theorem all the zeros of p,(z) lie interior to D(0, 1) and therefore p, € & if v is 
sufficiently large. However, since Z, is a zero of p,, 


I(p,) > min{|Z, — §,|,1<k<n—-1}>1(Z,) =I(p*), 


contradicting I( p*) = I(P), the property of extremal polynomials. 

Furthermore, Phelps-Rodriguez show that not only one zero but at least two zeros of p* must 
lie on the unit circle C(0, 1). For, if only one such zero, it may be taken without loss of generality 
as Z, = 1 with |Z;|< 1 and max |Z)| =r, j = 2,3,...,. We now introduce the polynomial 
q(z) = p*(z + s), where s = (1 — r)/2. The polynomial q(z) has all its zeros (Z; — s) interior to 
D(0, 1) and its critical points are (W, — s). Thus I(q) = I( p*) = I(%), but g(z) cannot be an 
extremal polynomial since it has no zero on the unit circle. Hence, p* must have at least two zeros 
on the unit circle. In fact, from the above reasoning, Phelps-Rodriguez infer that, on the unit 
circle, every arc of length at least 7 contains at least one zero Z; of p* and, hence, unless 


Z, = —Z, with |Z,| = |Z,| = 1, p* would have at least three zeros on the unit circle. 
Regarding any zero Z, of p* interior to D(0, 1), Phelps-Rodriguez establish the inequality 
n—1l n—3 
(3.1) [7(Z,)]" < G/n)(1 + 1Z?)G + |Z) 


Thus, if |Z,| < 1, then 1(Z,) < (2/3)'/* for n = 3 and I(Z,) < 1 for n = 4. Thus for n = 3 and 4, 
Conjecture I is now confirmed. However, if n > 5, inequality (3.1) will not lead to the result 
[(Z;) < 1 unless |Z,| is taken sufficiently small. 


4. Critical Point Nearest a Boundary Zero. Whether or not p(z) is extremal in ?, let us assume 
that it has a zero z, with |z,| = 1. Let us now show, not merely that /(z,) < 1 but the sharper 
result (Fig. 3) that at least one critical point of p lies in D(z,/2, 1/2). This fact was discovered by 
Goodman-Rahman-Ratti [3]. We give a modified version of the inductive proof as given in 
Schmeisser [12]. 


Fic. 3 
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For n = 2, p(z) =(z — 2,)(z — Zy) and p’(z) = 2(z — §,) with £, = (z, + z,)/2. In view of 
\z,| = 1, and |z,| < 1, |§,; — (z,/2)| < 1/2. Let us now assume the theorem to have been proved 
for n < N; namely that the derivative of p(z) = (z — z,) -+-(Z — Zy), where |z,| = 1 and |z,| < 1 
for j > 2, has at least one zero in D(z,/2,1/2) and consider p(z) = (z — Z,) ++: (Z — Zy41) 
where |z,| = 1 and |z,|< 1 for j > 2. We introduce the polynomial (polar derivative of p with 
respect to Z,): 


(4.1) q(z) =(N + 1)p(z) — (2-21) p'(2) 
which has degree N and a zero at z = z,. If g(Z) = 0 but Z # z,, we infer from (4.1) that 


Thus 


(4.2) = 


If |Z|> 1, the function W=1/(z — Z) maps the disk D(0,1) conformally upon the closed 
interior of a circle I’. The points w, = 1/(z; — Z), j > 2, are in the closed interior of I but w, lies 
on I’. According to (4.2), 


w, = (vw, + w, +--+ + wy41)/N. 


However, w, as a point on I cannot be the centroid of points in or on I’. Thus g(z) has a zero z, 
with |z,| = 1 and all its remaining zeros lie in D(0, 1) so that g € ® and at least one zero of q’(z) 
hes in D(z,/2, 1/2). 

Finally, suppose all zeros of p’(z) lie in the circular region K:|z — (z,/2)|> 1/2. By 
differentiating (4.1), we find 


(4.3) q'(z) = Np'(z) — (2-2) p"(2). 
Thus, q’ has the form, 
F(z,r) = Nf(z) — (z-A)f'(z), 


called by Laguerre the “polar derivative” of Nth degree polynomial f(z) with respect to A. 
Laguerre proved (Marden [6, pp. 49—51]) that, if all the zeros of f(z) lie in a circular region K and 
if A € K, then all the zeros of F(z, A) also lie in K. It follows therefore, since |z,| = 1 and hence 
z, € K, that all the zeros of g’(z) lie in K. Since this contradicts our earlier result that g’(z) has at 
least one zero in D(z,/2, 1/2), we infer that not all zeros of p’(z) lie in K and consequently at 
least one zero of p’(z) lies in D(z,/2, 1/2), as was to be proved. 

Since D(z,/2, 1/2) C D(z,,1), we have shown /(z,) < 1 for any p € * for which p(z,) = 
0,|Z,| = 1. If we now apply inequality (3.1) to zeros Z;,|Z;| < 1, we infer that Conjecture I is 
valid for n = 2, 3 and 4. The case n = 5 was established by a different method by Meier-Sharma 
[8]. 


5. Convex Hull of the Zeros. Let us denote by H(p) the convex hull of the zeros of p(z); that 
is, the smallest convex polygon containing all the zeros of p(z). Since the zeros of p(z) lie in 
D(0, 1), so does H( p) € D(0, 1). (Fig. 4.) 

We begin as in Schmeisser [12] with the case that all the vertices of H(p) lie on the unit circle 
|z| = 1. Ifz,, Z,..., Z,,(m <n) are at the vertices of H( p) and Z,,.4.1, Z)495--+5 Z, are interior to 
H(p), we already know from §4 that at least one critical point §, lies in D(z,/2,1/2) for 
1 <j < m. Let us now show that 


H(p)c T= U D(z,/2,1/2). 
j=l 
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Zo 


FIG. 4 


That is, the region H( p) is completely covered by the set of discs D(z,/2, 1/2) withj = 1,2,..., m. 
First, any point z on the side of H(p) joining vertices z, and z, has the representation 
z=dAz,+ (1 —A)z, where 0 <A < 1. If point z is closer to z, than to z,, then (1/2) <A < 1 
and so this point lies in D(z, /2, 1/2) since 
Iz — (2,/2)l = [A — (172) ]2z, + (1 — A)za| < ILA — (172)] + (A - ADL = 172. 

Hence the entire side joining z, to z, lies in T. Similarly all other sides of H( p) and interior points 
of H(p) lie in T. But, as each disk D(z,/2,1/2), j = 1,2,..., m contains at least one critical 
point ¢, of p(z), it follows that Conjecture I is valid for p(z) if its H( p) has all vertices on the 
unit circle |z| = 1. 

The case that not all vertices of H( p) lie on the unit circle has been investigated by Schmeisser 
[14] only where H( p) 1s a triangle ABC with points A and B on circle |z| = 1 and C interior to the 
circle. He also treats the case that at most one vertex of triangle ABC lies on circle |z| = 1, by 
applying a linear transformation w = az + b which reduces this case to that in which two or three 
vertices of the triangle are on circle |z| = 1. 


6. Application of Classical Composition Theorems. In attempting to validate Conjecture I, 
some authors have successfully used certain theorems that are well known in the Analytic Theory 
of Polynomials. We have in §4 already applied Laguerre’s Theorem involving the polar derivative 
of a polynomial. Another is the following theorem due to Szegé (Marden [6, pp. 65—66]). 


Given the polynomials 


f(z) = » (7) Ane" g(z)= Y (7) Bee", 
k=0 k=0 
where (7 | =m(m— 1):-:(m—k + 1)/k!, we form the polynomial h(z) = Tio An Ber”. 
If all the zeros of f(z) lie in a circular region A, then every zero of h(z) has the form y = — af where 
a is a suitably chosen point in A and B is a zero of g(z). 
In Schmeisser [12], Szegé’s Theorem is applied to polynomials p(z) which have a zero at the 
origin. Thus, 


n—-1 


(6.1) p(z)= 2D aye! p(z)= X (1 + k)z*. 


Here the polynomial p(z) may be regarded as h(z), being obtained from p’(z) as the g(z), by 
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composition with the polynomial f(z) = F(z), where 
n—-1| 
_ n—-1)\ ek 
F(z) x , rE / (k +1). 


But 
(6.2) zF(z) = + (” k | fie dt = fa +t)" ‘dt 


=(1/n)[(1 +z)" - 1]. 
The zeros of F(z), being B, = —1 + exp(2aj/n); 7 = 1,2,..., — 1, all lie on the disk |z + 1| < 1. 
Hence, taking F(z) as the f(z) of Szegd’s Theorem, we infer that each zero z,; of p(z) has the form 
z,; = —ag, where a € D(—1,1) and B = §, is some zero of p’(z). Thus 
Iz; - Sl =| - of, — & | = la + US| < 1. 
This proves that Conjecture I is true for polynomials of form (6.1). 

A more elaborate application of Szegé’s Theorem is given in Rubinstein [10] in order to verify 
Conjecture I for polynomials of degree n = 3 and 4. Due to the result in §4 for boundary zeros, it 
suffices to show that for any zeros x, of p(z) with |x| < 1, at least one critical point lies in 
D(Xo, 1). Without loss of generality we may take x, as real with O < x, < 1. Writing p(z) = (z — 
Xo) q(z), we translate the origin to x, and introduce the Taylor expansion about x = x,. Thus, 

f(Z) = p'(Z + x0) = (d/dZ)[Zq(Z + xo)] 
= q(Z + x9) + Zq'(Z + x9) 


n—-1 


n—-2 
= ¥ [a (x) kt] Z* + YS [q&* (xo) /k!] Z* 
k=0 k=0 


(6.3) f(Z) = E (+ ka (a9) /kt] 24 


(6.4) g(Z)= E(("; '\/ (1+ K)| z" 


n—-1 


(6.5) h(Z) = 4(Z+ xo) = YL [g(xo)/k!] Z*. 


Let us assume, contrary to Conjecture I, that p’(z) has no zeros within unit distance of the zero 
Xo; that is, f(Z) = p’(Z + x.) # 0 for [Z] < 1. Thus, all the zeros of f(Z) lie in the circular 
region A: |Z| > 1. According to (6.2), the zeros of g(Z) are 


B, = —l+exp(2zki/n), k= 1,2,...,n— 1. 
Applying Szegé’s Theorem, we infer that the zeros Z,; of h(Z) have the form 
Z, = ~—oB, = a[1 — exp(2zki/n)| 
where |a| > 1 so that |Z,| > 2sin(w/n). Since Zh(Z) = p(Z + Xo), the zeros z; = Z; + Xo of 
p(z), other than z = xp, lie in the intersection of the two circular regions of the Z-plane: 
IZ+x|<1, |Z|>2sin(a/n); 
that is, in the z-plane: 
(6.6) Iz}<1, |z—Xxo|> 2sin(a7/n). 


Continuing further, Rubinstein [10] introduces a third plane, the w-plane, and the polynomial 
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n—| 


R(w) = p(w-14+x))=(w- NU [w — (l+z,- xo). 


Hence, 
n—| 
R'(w) = p'(w-1+x9) =n I] [ w -(+¢- Xo). 
j= 


The assumption that no ¢; lies in disk D( x9, 1) is equivalent to saying that no zero of R’(w) lies in 
the disk |w — 1| < 1. The restrictions (6.6) on the zeros of R(w) other than w = 1 now become for 
n=3or4 


(6.7) Iw-1+x]<1, |w—-1|> 2sin(a/n) > v2. 
Setting w = u + iv, we obtain from (6.7) by squaring both sides: 


(6.8) u* + v* — 2(1 — xp ut xg — 2x) < 0 
(6.9) u>+v*—2u> 1. 
Multiplying (6.9) by (1 — x.) and subtracting the product from (6.8), we obtain 


Xo(u? + v7) < 2x) -— x8 —1l+ xp = 3x,- x3 - 1, 


(6.10) u>+v?<3-[x + (1/x)] <1 


due to x) + 1/x, > 2, since 0 < xy < 1. 

Thus R(w) is a polynomial which by definition has a zero at w = 1 and by (6.10) has all its 
other zeros in the disk |w| < 1. Thus R(w) € & and by §4 it has at least one critical point in the 
disk |w — 1| < 1. But, as this contradicts the assumption that R’(w) # 0 in|w — 1| < 1, it follows 
that the earlier assumption f(Z) = p’(Z + x.) # 0 for |Z| < 1 was wrong and thus p(z) has at 
least one critical point in disk D(x, 1) when n = 3 or 4. Thus together with the results of §4, this 
validates the Conjecture I when n = 3 or 4 again. 

The Szegd Theorem is also used in Rahman [9] to determine the shortest distance from point 
z =a toa critical point of the polynomial [(z — a) p(a)] where p(z) has all its zeros in the disk 
\z| < 1. 

As an alternative to using the Szegd Composition Theorem, there is Grace’s Theorem from 
which the Szegé6 Theorem is derived. Grace’s Theorem states the following (Marden [6, pp. 
60-63)): 

Let f(z) = EP ol h \Auz* and g(z) = Tyo Buz’, be apolar polynomials; that is 


(6.11) LD (2) ABs = 0. 


Then any circular region containing all the zeros of f or g contains at least one zero of the other. 


There being only one condition (6.11) on the coefficients of f and g, an infinite number of 
polynomials g(Z) can be found apolar to the polynomial f(Z) given by (6.3). If such a 
polynomial g can be found that has all its zeros in the disk |Z| < 1, Conjecture I would be 
established. However, so far in this search the present author has succeeded only in the cases 
n = 3 and 4, already treated by Rubinstein [10]. 


7. Further Conjectures. All the evidence so far produced has been favorable to Conjecture J; 
no contrary example has as yet been found. The evidence includes, not only the situations 
discussed in the preceding sections, but the validation of the Conjecture when n = 5 (Meier-Shar- 
ma [8]), and for the lacunary polynomials 
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z"0 + az"! + agz™ + azz withny > n, > n> nh; 


and 
>) a,z"* withn, > 3k—2,np9<n,<--: <n, 
k=0 
(Schmeisser [13], [14]) as well as for real polynomials of the form p(z) = z" — a,z"~'—---— a, 


with a, > 0 for all k (Schmeisser [13)). 

In the general case, Conjecture I may hinge upon the determination of the extremal polynomi- 
als p* € &. In §3, it is proved that at least two zeros of p*(z) must lie on the unit circle |z| = 1. 
Phelps-Rodriguez [16] have further conjectured that all the zeros of p*(z) must lie on the unit 
circle and in fact be ‘equally spaced on the unit circle. That is, they have proposed the following: 


CONJECTURE II. The extremal polynomials of the family P have the form p*(z) =z" — 
exp(ia),0 < a < 27. 


Another conjecture is the following: 


ConsECTURE III. If p(z) = IVj_,(z — 2z;) © &, then at least one critical point of p(z) lies in 
each disk D(z,/2,1 — (\Z;\72)), j = 1,2,..., . (Fig. 5.) 


Fic. 5 


This conjecture was made both in Goodman-Rahman-Ratti [3] and Schmeisser [12]. Conjecture 
III reduces to the result of §4 when |z,| = 1. If correct, Conjecture III would give a sharper result 
than Conjecture I in that D(z,/2, 1 — (|z;|/2)) © D(z,, 1). 

Conjecture III has, however, been proved in Vernon [15] for zeros z; such that |z,;| < 6(7) 
where 6(n) = 1 for n < 4 but decreases with increasing n, approaching zero as n > oo. This 
result is not surprising since Conjecture III is true for z; = 0 (see §6) and since the critical points 
are continuous functions of the z,. 

As an example illustrating the three conjectures, let us take the polynomial p(z) = 1+ z+ z? 
+ z? which has the zeros z, = i, z) = —i and z, = —1 and for which the derivative p(z) = 1 + 
2z + 3z* has the zeros £, = (—1 + y2i)/3 and {, = (—1 — ¥2i)/3. A calculation shows that 


|Z, — I = |Z> — g,| = 0.6249 and |Z3 — 1 = |Z; — g,| = 0.8165, 


verifying Conjecture I and indicating that this p(z) is not an extremal polynomial since both 
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numbers are less than one. A further calculation shows that 
Si — (21/2) = Sa — (42/2)) = | 2 + (2V2 — 3)i//6 < 1/2 = 1 - ((211//2) = 1 = ((221/2) 
Si — (23/2)| = So — (23/2)| = [1 + v2 i//6 < 1/2 = 1 = (\z5//2) 
verifying Conjecture III. 
As yet, none of the three conjectures has been fully established. It is clear that, if correct, either 
Conjecture II or III would imply the correctness of Conjecture I. However, even if Conjecture I 


were validated independently of Conjectures II and III, it would still be interesting and important 
to prove or disprove Conjecture II and III. 


The author is grateful to his engineering colleague, Professor Emeritus Webster Christman, for the drawings 
accompanying this paper. 
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numbers are less than one. A further calculation shows that 
Si — (21/2) = Ik — (42/2) = | - 2 + (2V2 — 3)i//6 < 1/2 = 1 - (\24|/2) = 1 - (20172) 
IS. — (43/2) = 82 — (43/2) = | + V2 i//6 < 1/2 = 1 -(\251/2) 
verifying Conjecture III. 
As yet, none of the three conjectures has been fully established. It is clear that, if correct, either 
Conjecture II or III would imply the correctness of Conjecture I. However, even if Conjecture I 


were validated independently of Conjectures II and III, it would still be interesting and important 
to prove or disprove Conjecture II and III. 


The author is grateful to his engineering colleague, Professor Emeritus Webster Christman, for the drawings 
accompanying this paper. 
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THE RIESZ REPRESENTATION THEOREM REVISITED 


DONALD G. HARTIG 


Mathematics Department, California Polytechnic State University, San Luis Obispo, CA 93407 


The purpose of this note is to give a formulation of the celebrated Riesz Representation 
Theorem (RRT) using the language of categories. Once this has been done we will outline a proof 
due to D. J. H. Garling [2] that is especially appropriate in this context and deserves to be better 
known. Garling’s proof is mainly topological as opposed to the more standard arguments as found 
for example in [7]. While the measure theory is reduced to a bare minimum, it is assumed that the 
student has been exposed to enough measure theory to make the statement of RRT both 
meaningful and significant. The few topological facts that are required should be accessible to 
most students after a first course in general topology. 


1. Categorical Formulation of RRT. The theorem under consideration states that any bounded 
linear form @: C(X) — R can be represented by a unique signed Baire measure p on X via the 
equation 


o(f)=ffdu, fe c(X). 


Here, as always, X is a compact, Hausdorff space and C(X) is the Banach space of continuous, 
real-valued functions on X. In fact much more is actually true and the exact nature of the 
relationship between the linear form ¢ and the Baire measure p is more readily described from the 
vantage point provided by category theory. Before continuing the reader might profit from a 
glance at [3]. 

The category of compact, Hausdorff spaces and continuous maps between them will be 
denoted by Comp. The category of Banach spaces and continuous (i.e., bounded) linear maps 
between them will be designated by Ban. Given an object X in Comp there are three objects in Ban 
that are relevant to the present discussion: C(X) described above, its Banach space dual C(X)* 
and M(X), the Banach space of signed Baire measures on X. (The o-algebra of Baire sets in X will 
be denoted by %,; it is the smallest o-algebra rendering each f € C(X) a measurable function 
and is generated by the compact G,-sets in X. ) Given a morphism a: X — Y in Comp there are 
also three morphisms in Ban that are immediately available: 


a*:C(Y) > C(X) defined by a*(g) = goa 
a**: C(X)* > C(Y)* defined by a**(W) = poa* 
a*: M(X)— M(Y)_ defined by a*(n) = pea! 


At this point it can be easily verified that each of the three maps a > a*,a > a** anda > a* is 
functorial. In the earlier note [3] we were interested in the first functor, RRT is a statement about 
the latter two which we will denote by C* and M respectively, i.e., C*(a) = a** and M(a) = a*. 


Now given a signed Baire measure » in M(X) we define $,: C(X) > R as 


$,(f) =f fa. 


Donald Hartig has a bachelor’s degree from Rensselaer Polytechnic Institute and a Master’s from the University 
of Wisconsin-Milwaukee. His doctorate, earned in 1970, is from the University of California at Santa Barbara under 
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The resulting map ty: M(X) > C(X)* given by « (4) = ¢, 1s, to use Mac Lane’s terminology [5], 
“natural in X.” That is to say, for any morphism a: X — Y in Comp, the diagram induced in 
Ban: 


c(x)*—*——_--c(v)* 
tt | [ty 
M(X)——*—-M(Yv)..........*) 


is commutative. In the language of category theory the family of maps t = {1,|X € Comp) 
constitutes a natural transformation from the functor M to the functor C*. (To verify that (*) 
commutes, it must be shown that 


Js cadp = Is da*(u) 


for each g € C(Y) and » € M(X). In view of the fact that this equality reduces to the definition 
of a*(u) when g = xz, the characteristic function of a Baire set in Y, the verification is quite 
straightforward and is left as an exercise.) 

Since a measure p is determined by the integrals it produces and C(_X) is dense in each space 
L'() the connecting morphisms 1 , are seen to be one-to-one. With a little more work it can even 
be shown that each : , is norm-preserving and, of course, our initial statement of RRT simply says 
that sy is onto. A natural transformation having the property that each connecting morphism is an 
isomorphism (one-to-one and onto for the category Ban) is called a natural equivalence. The full 
story of RRT can now be stated as the 


THEOREM. The natural transformation t = {ty} is a natural equivalence from M to C*. 


2. Proof of RRT. In view of our remarks preceding the statement of the theorem, its proof 
boils down to showing that each 4, is an onto map. The key idea is to observe that this is quite 
easy to do for spaces X that are very fragmented. For example, the argument is trivial when X is 
discrete (hence finite) and we will see that it is only slightly more difficult for the so-called 
“extremally disconnected” spaces. It turns out that such spaces are plentiful enough in Comp to 
allow us to easily extend the proof to alls ,’s via the diagram displayed in (*). The only result we 
will need that is of a purely measure theoretic nature is Caratheodory’s Extension Theorem as 
found, for example, in Chapter 12 of [6]. 

A topological space is called extremally disconnected when each open set has an open closure. 
This is equivalent to requiring that each pair of disjoint open sets have disjoint closures. In such 
spaces the algebra @ of subsets that are open and closed (the so-called clopen sets) is very large 
and plays a decisive role in both topological and measure-theoretic considerations. This rather 
broad generalization is made precise in the following lemma. 


LEMMA 1. Let X be extremally disconnected in Comp. Then the Baire sets in X are generated by 
the algebra @ of clopen sets and the simple functions based on clopen sets are uniformly dense in 
C(X). 


Proof. Let 5 denote the o-algebra generated by @. Since each set in @ is a compact Gz, 
5 C By. To show that 8, C 5 it will suffice to establish that each f © C(X) is an 5-measurable 
function. Given f € C(X) and real number c let A = {x € X|f(x) < c). For n = 1,2, ... define 
A, = {x © X|f(x) < c+ 1/n). Because A,, is open, its closure A, is clopen and from 


— ] 
A,CA,C {xe Xf(x)<e+ >] 
it follows that A = 1 ,,A, implying that A € 5. 
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To prove the second assertion choose f € C(X) and let 7 be a fixed positive integer. For 
k =0,+1, +2,... let 


k k+1 
A, = {x X|= <f(x) < 
As above, the closure B, of each set A, is a clopen set and since at most finitely many of the B,’s 
are nonempty, the sets B = U, B, and X \ B are also clopen. It follows easily that the sets 


C. = {x © XN Bif(x) -<\, k=0,4+1,+2,... 
are clopen and clearly the simple function 
k 
g=d (xa, + Xc] 
k 


is within 1/n of f uniformly on X. End of proof. 


Lemma | provides a basis for the proof of RRT for the subcategory of Comp consisting of the 
extremally disconnected spaces. Let X be such a space and let @ be a nonnegative linear form on 
C(X), that is ¢(/) > 0 whenever f > 0. We immediately obtain a nonnegative finitely additive set 
function w on the algebra @ of clopen sets by defining u(A) = $(x,,). This measure is even 
countably additive on @ since compactness implies that any countable family of disjoint clopen 
sets whose union is also clopen can have at most a finite number of nonempty members. 
Caratheodory’s Extension Theorem, together with Lemma 1, allows us to extend p» to a Baire 
measure on X. The second part of Lemma | shows that this extension of » represents the linear 
form ¢. Since an arbitrary member of C(X)* can be obtained as the difference of nonnegative 
forms, this establishes our assertion. 

The extension to arbitrary X in Comp will be accomplished after we state two topological 
lemmas. Lemma 2 says that extremally disconnected compact Hausdorff spaces abound while 
Lemma 3 shows that they appear throughout the category Comp. 


LEMMA 2. The Stone-Cech compactification of a discrete space is extremally disconnected. 


LEMMA 3. Any compact Hausdorff space is the continuous image of an extremally disconnected 
compact Hausdorff space. 


The proof of Lemma 2 is left as an exercise, Lemma 3 follows directly from Lemma 2. 


Proof of RRT. Let Y be compact and Hausdorff. Choose an onto morphism a: X — Y in 
Comp where X is extremally disconnected. One easily sees that a* is a norm isomorphism and an 
application of the Hahn-Banach Theorem will show that a** is also onto. We already know that 
ty is onto and (referring to the commutative diagram (*) ) it follows that 1) is onto. 


3. Related Results. To my knowledge V. S. Varadarajan first had the idea of shifting the main 
burden of the proof of RRT to topological considerations. His approach is from a slightly 
different angle and the interested reader will find additional information in his paper [9]. A brief 
version of Varadarajan’s proof can also be found in [1]. Garling’s ingenious arrangement of the 
proof of RRT can also be found in the recent text on functional analysis [4] by R. B. Holmes. This 
book is warmly recommended for its wealth of interesting applications and insightful proofs. The 
categorical formulation of RRT is well known. It is one more example of the usefulness of this 
point of view and is described in Semadeni’s book [8], see Proposition 18.4.3. 
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FUNCTIONS WITH A PROPER LOCAL MAXIMUM IN EACH INTERVAL 


E. E. POSEY AND J. E. VAUGHAN 
Department of Mathematics, University of North Carolina at Greensboro, Greensboro, NC 27412 


Let f: R > R be a real function. We say that f has a proper local maximum at x € R provided 
there exists an open neighborhood V of x such that f(y) < f(x) for all y © V — {x). In 1900 A. 
Schoenflies [3, p. 158] noted that for any function f, the set of points at which f has a proper local 
maximum is at most countable. (This is easy to see: Let B denote the set of all open intervals in R 
with rational end points; so B is a countable base for the usual topology on R. For each proper 
local maximum x of f, pick V, © B such that f(y) < f(x) for all y € V,. — {x}. The correspon- 
dence x > V, is one-to-one.) 

It is trivial to sketch the graph of a continuous function f: R — R which has a proper local 
maximum at infinitely many points in [0, 1], say at each point in the set {1 /n: n = 1,2,--- }U (0). 
This brings up the question: Does there exist a continuous function f: R — R which has a proper 
local maximum at each point of a countable dense set? It is not easy to visualize the graph of such 
a continuous function, but the answer to the question is known to be “yes.” In [1, p. 63, Thm. 3] it 
was proved that each parameter function of a completely wild arc in R? will be a continuous 
function having a proper local maximum at each point of a countable dense set. This result 
provides an “existence theorem’ for such functions, but the concept of such arcs is not very 
intuitive. 

The purpose of this note is to construct, in a straightforward way, a simple example of a 
continuous function f: R — R which has a proper local maximum at each rational point. This 
function will be the uniform limit of functions which are easy to visualize. In this sense the 
construction is analogous to several well-known constructions of continuous nowhere differentia- 
ble functions (e.g. [2, p. 141, Thm. 7.18]), and provides a reasonable exercise for a real analysis 
course. 


EXAMPLE: A function f: R — R with a proper local maximum at each point of a set 
everywhere dense in R. 
Let {7,)°°_, be a set (of distinct points) that is everywhere dense in the real numbers, and let 


n=] 


g(x) = 1— Min{1,|-\], 
and let 


f(x) = ¥ g(x) 


n=1 


where g,(x) = A,g((x — 7,)/w,,), and the positive real numbers A, and w, will be specified. 


The definition of g,, will be by induction on n. It is clear that g,(x) = 0 for |x — r,| > w,, and 
we denote the interval |x — 7,| < w, by J, and call it the interval associated with g,. We also 
define 


f(x) = g(x). 
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In general we require (1) 0 < A, < 2~"; (2) the end points of [,, be distinct from all points r,, 
j = 1,2,3...; and (3) ifj < n, then either (3’) r, € J; and w, is chosen so that J, and J, are disjoint 
or (3”) r,, € I; and w, is chosen so that I, is strictly interior to that half of J; in which ,, lies, e.g., 


ify,-we<1,< 17, 


then 
i WS, — WwW, <1, + W, <0. 


Choose by induction positive constants A, and w, so that (1), (2), and (3) are satisfied. It is 
clear that because of (1) f is well defined and continuous regardless of how the constants w, are 
specified. In the following paragraphs we will change some of the constants A, and w,; we do so 
with the understanding that conditions (1), (2), and (3) are still to be satisfied. The conditions (1), 
(2), and (3) on the choice of A, and w, will not be repeated at each further stage of the 
construction. 


Let A, and w, be chosen as above, thus defining f;. As a sum of “triangular” functions g,,, f, 
will be constructed tosatisfy: 


(a) f, has a proper local maximum at r,,, being linear and increasing on (7,, — w,, 7) and linear 
and decreasing on (7, 7, + w,). 
(b) If I, ¢ J, then < n and f,(7,) < f,(7). 


Suppose f,_ ; has been defined and satisfies (a) and (b). If J, is exterior to J;,j = 1,2,...,n — 1, 
then A, and w, are not further modified and (a) and (b) are valid for f,. Otherwise let h be the 
maximum integer not exceeding n — 1 such that r, © J,. Then by the restrictions imposed 
previously J, is strictly interior to (7, — w,,7,) or to (”,, 7, + w,), and f, is linear on J,. By (a) 
f(T, — W,) and f,(7%, + w,,) are both less than f,,(7,,). Hence A,, can be taken sufficiently small so 
that f,(%,) < fn(7,). If, € I; for j < h, then I, c J;, and thus by (b) f,(7,) < (7%) 80 f,(™m) < 
f(7;) and (b) holds. With A,, fixed we may now decrease w,, until (a) is satisfied. Indeed, the 
maximum and minimum slopes of g,; are + A,/w, and thus it suffices to require 


n- | 
A,/ Wh > » A i/ Ww; 

j=l 
to insure that (a) holds. Thus the induction is complete and conditions (1), (2), (3), and (a), (b), 
hold generally. 

The function f so defined has a proper local maximum at each point 7,. Indeed, let x € J[,, 

x * r,, and we shall prove that f(x) < f(7,). Ifx € I, forj=n+ 1,n+ 2,... then f(x) = f,(x) 
< f.(7,) < f(7,) by (a). Suppose now that h is the smallest index exceeding n such that x € J,. If 
a largest index k exceeding n for which x € I, exists, we have 


f(x) =f) < fi (%) <tnCn) < fn) 
by (a) and (b). If no such largest k exists, then f(x) = lim,_,,.f, (x) where x € I, , i = 1,2,3,... 
and all k; > h. Then | 
fi (%) < Se (Te) < fan) < fan) 


and hence f(x) < f,(",) < f,(%) <f(7%,), and the proof is complete. 
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A SPACE FILLING CURVE 


Liu WEN 
Department of Mathematics, Graduate Center of the City University of New York, New York, NY 10036 
Hebei Institute of Technology, Tianjin, the People’s Republic of China 


The purpose of this paper is to give an example of a space filling curve which is simpler than 
previous examples (cf. [1], 455-458). 
Divide the interval [0, 1] into ten equal subintervals: 


5, =[k/10,(k + 1)/10], k=0,1,..., 9. 
Let f and g be continuous real valued functions defined on [0,1] which satisfy the following 
conditions: 


0 when ¢€6,U65;, 
(i) ry= {i when ¢+€6,U6,; 


_ {0 when 1+€6,U6;, 
a(t) = 1 when t¢€6,U6;; 


(ii) f(0)=f(1) and g(0) = g(1). 
Extending f and g to (— 00, + 00) periodically, we get two continuous periodic functions which are 
still denoted f and g respectively. 

Let 


1° @] 1° @] 


o()= Yfke); H(t) = Yate), 


k=] k=] 


Obviously @ and yw are continuous. We will now easily show that 


x = (t) cre 
ee o<tst 


is a curve filling the unit square. 
In fact, let x and y be arbitrary real numbers in (0, 1) with base 2 binary expansions 
X= .XXy...X,...3 eachx, =Oorl; y= .yyo... Vy... 3 eachy, = Oorl. 

Let 

1 if x,=0O and y,=0 

3 if x,=0 and y,=1 

5 if x,=1 and y,=0 

7 if x,=1 and y,=1. 


[= 


Now let ¢ be the number whose base 10 decimal expansion is 
t= tity... tyes. 
It follows immediately from the definitions that 
x=$(t), y=¥(t), 
and so the curve passes through every point of the unit square. 
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SUMMING POWER SERIES WITH POLYNOMIAL COEFFICIENTS 


JOHN KLIPPERT 
Department of Mathematics and Computer Science, James Madison University, 
Harrisonburg, Virginia 22807 


Third semester calculus students quickly learn that 
le 6) 
h | 
Lex -7-x for |x| < 1 


‘by showing that the sequence of partial sums 


But by rewriting a power series in terms of finite differences, we can almost as easily sum any 
power series with polynomial coefficients. Consider the series f(x) = L*_ a,x” with positive 
radius of convergence. Then 


f(x) =aq + xb(x) where $(x)= Di a,x"! = Do ayy x". 
n=0 


n=] 


So 
co co 
(1 —x)6(x) = ¥ aye" + y ax" 
n=0 n=! 
io @) co co co 
= » Any 1x" ~~ » a,x" = » Any 1x" ~~ » a,x” + ao 
n=0 n=] n=0 n=0 
co co 
=a) + » (4,44 _ a,)x" = dy) t » Aa,x", 
n=0 n=0 
with Aa, =a,4,—4,, n=0,1,2,.... 
Therefore if x # 1 and x € D, the domain of f, we have 
(x) = 4 1 > na x" 
Ota l-x l-x 7 O 
Thus 
aAyx XxX 00 n 
f(x) =a) + xo(x) =a, + + > Aa,x". 
l1-—x l-x ~% 
That is, 
(1) a,x" =—"— + y Aa, x", x#1, xe D. 
nn l1-—x l- x 


Putting A’a, = A(Aa,) = Aa,,, — Aa, (and inductively, 
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A’a, = A(A?~'a,) = A?~'a,,, - A?-'a,, pEN) 
and applying (1) to L??_,Aa,x”, we have 


le 6) A le 6) 
Aa,x" = fo 4 _— >) Aa, x" 
n=0 7 — * n=0 
and so 
le 6) 
n ao X Ady X ,) h 
XL anx et let Laan] 
_ _%0 xAdy ( x ) > Wa, x” 
l—x @ x) 1— x n=0 " 
Repeating p times in succession, we get 
va yt = 20 xAay 4 xP? 'AP~ la, ( x )" > Aa. x” 
n l1—x 2 _ P l xX " 
n=0 (1 — x) (1 x) n=0 
That is, 
00 p-| xk x p © 
(2) Y a,x" = Y Ata,—*—_, + (5 >> A’a, x", x#1,xED 
n=0 k=0 (1 - x) — *! n=0 


where we define A°a, = do. 
Now if a, = P(n), a polynomial of degree < p — 1, we have 


A’P(n)=0, n=0,1,2,.... 


Observing that the radius of convergence of L?_)P(n)x” is 1, we have according to (2) that 


(3) y P(n)x" = y AkP(0) —~—_, |x| <1, 
n=0 k=0 (1 — x) 


For example, if P(n) = n? + n+ 1, thenAa, = 2n+ 1, A’a, = 2 and Xa, = 0. In particular, 
P(0) = 1, AP(0) = 2, A*P(0) = 2. Putting p = 3 into (3), we have 
k 


oe) 2 
x 
dL P(n)x" = )) A‘P(0)———__ 
n=0 k=0 (1 — x) 
and so 
ia l x x? 
y(n? ++ 1)x" = P(0) 5 + AP(0)——,, + &’P(0) ——_, 
n=0 _ * (1 — x) (1 — x) 
l 2x 2x? 


Toy ba to 
I~ x (-xy (1-x) 


Finally, we remark that the transform (2) can also be used to accelerate the convergence of a 
series when A’a, converges to 0 faster than a, (assuming that a, does converge to 0). 
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E 2993. Proposed by Michael Larsen, Student, Harvard University. 


Let a), @),..., a, be complex numbers such that L7a;” is an integer for every positive integer 
m. Show that the polynomial II/(x — a;) has integer coefficients. 


E 2994. Proposed by John E. Wetzel, University of Illinois. 


Let Z7 be the set of lattice points in R%, the set of points all of whose coordinates are integers. 
For any subset A of Z% write (A) for the distance set of A, the set of distances XY for X, Yin A. 

(a) Show that Z% can be partitioned into two subsets A and B so that neither )(A) nor D(B) 
equals )(Z%). 

(b) Show that if A U B = Z% but A N B # @, then either (A) or %)(B) must equal )(Z7). 


SOLUTIONS OF ELEMENTARY PROBLEMS 
Fair Division of a Cake 


E 2895 [1981, 444]. Proposed by M. Vulis, CUNY Graduate Center. 


The children at a birthday party sit around a table and divide a homogeneous cake according 
to the following rules: Initially the birthday child divides the cake into two equal portions, giving 
one portion to one of the two neighboring children and keeping the other. At the next and 
succeeding steps, some pair of adjacent children combine their portions and divide equally. (Thus 
if two adjacent children have portions a and b before division and share them, each has a portion 
4(a + b) after the division. Note that either a or b can be zero.) 

The process terminates in a finite number of steps with a fair division of the cake. How many 
children are at the party? 


Solution by David M. Wells, Pennsylvania State University, New Kensington. There are 4 
children at the party. To establish this, identify the children with the group of integers (mod n). 
Let f(k) denote the amount of cake held by child k after j steps. The proof depends on two 
propositions, each established by induction on /. 


PROPOSITION 1: There are integers k, and k, for which f(k,) < f,(k, + 1) < +++ <f(k2) and 
fi(k2) = fk, + 1)2-::: > fi(k)). 


PROPOSITION 2: If n > 4 there are at most two consecutive values of k for which f,(k) = 1/n. 


The proof of the first is tedious but straightforward. To prove the second, suppose the 
statement is true after j steps. We may assume with no loss of generality that f(1) = 1/n, 
f,(2) > 1/n, and f,(3) < 1/n. Proposition 1 then guarantees that f(k) < 1/n for 3 <k <n, and 
by assumption f(k) < 1/n for 3 <k <n — 1. Thus f, (2) = (3) > 1/n, since 2f,4 (k) = 1. 

Proposition 2 insures that n <4, and the statement of the problem implies that n = 3. 
However, f;(k) is always a rational number whose denominator is a power of 2, so n # 3. 


Also solved by O. P. Lossers (Netherlands), K. Wayland, and the proposer. 
An Eulerian Circuit with No Crossings 


E 2897 [1981, 537]. Proposed by David Singmaster, Polytechnic of the South Bank, London, 
UK. 


Consider a planar graph G and a path P in it. If the path visits a vertex two or more times, we 
say there is a crossing at the vertex if the path would cross itself when the vertex is viewed as a 
road intersection; e.g., in the figure on page 288, a path containing ...,1,7,4,..., 2,7,5,... hasa 
crossing, while a path containing ...,1,7,2,..., 3,7,6,..., 4,7,5,... has no crossing. Prove or 
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Fic. | 


disprove: A connected planar graph with every vertex of even degree has an Eulerian circuit with 
no crossings. 


Solution by Jerrold W. Grossman, Oakland University, and E. M. Reingold, University of 
Illinois. This problem is solved in the affirmative in “Construction of Planar Eulerian Multi- 
graphs,” in Proc. Tenth Southeastern Conf. on Combinatorics, Graph Theory and Computing, 1979, 
pp. 123-130, by J. Abraham & A. Kotzig. Here is a simpler proof, by induction on the maximum 
degree d of a vertex and, within the class of graphs with the same maximum degree, the number of 
vertices with the maximum degree. Clearly if d = 2, then every Euler circuit has no crossings. 
Assume the inductive hypothesis and let v be a vertex of the graph G of degree d. Split v into two 
vertices, one with degree two and incident to two neighboring edges of G, the other incident to all 
its other edges of G. If the resulting graph is connected, then its Euler circuit with no crossings 
clearly induces such a circuit in G. If not, then the Euler circuits without crossings in the two 
components can be spliced together at v to form the desired Euler circuit in G. See the subjoined 
figure, in which the dotted sectors contain no edges. 
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A related and I believe still unsolved problem is due to H. Fleischner: if the plane, Eulerian 
graph G is 3-connected (i.e., removal of two vertices cannot disconnect the graph) and has no 
loops or multiple edges, then is there necessarily an Euler circuit which at each vertex leaves along 
one of the two edges neighboring the edge from which it entered? 


Also solved by R. B. Eggleton & D. K. Skilton (Australia), M. L. Marx, E. Scheinerman, R. Statman, and the 
proposer. 

Marx cited his article (with L. Lovasz), A forbidden substructure characterization of Gauss codes, Szeged Acta, 38 
(1976) 115-119. 
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ADVANCED PROBLEMS 


Solutions of these Advanced Problems should be mailed in duplicate to Professor G. L. Alexanderson, Department 
of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by August 31, 1983. The solver’s full 
post-office address should be on each sheet. 


6423. Proposed by P. R. Halmos, Indiana University. 


If A is a bounded linear operator on a Hilbert space, and if 
|i — Al] < 1, 
then A is invertible. That assertion, not only for operators on Hilbert spaces, but for arbitrary 
elements of Banach algebras, is usually proved by consideration of the infinite series L°°_)(1 — A)”, 
and, therefore, the proof uses completeness. Is the assertion true without completeness? Is there, in 
other words, a bounded linear operator A on an inner product space, such that ||] — A|| < 1 but A 
is not invertible? 


6424. Proposed by Michael J. Dixon, California State University, Chico. 


Let m and k be positive integers. Show that 
le 6) 
S(m,k) = > —— 


wa, n(mn + k) 


is rational if and only if k = 0 (mod m). 


6425. Proposed by J. Miles and L. Rubel, University of Illinois. 


For the Koebe function K(z) = z/(1 — z)*, show that there exists a bounded analytic function 
B(z) in the unit disc D such that F(z) = K(z) + B(z) maps D onto the whole complex plane C. 
Is the same still true for an arbitrary analytic function K(z) in D that maps D onto a dense subset 
of C? 


6426. Proposed by J. L. Brenner, Palo Alto, California. 


(a) Show that every finite group can be embedded in one or another of the finite groups 
PSL(n, q). (b) The same for O*(n, q). (c)* For fixed g, find the smallest n = n(k) such that 
the symmetric group S, can be embedded in PSL(n, q). (Take representative values of q, k.) 


SOLUTIONS OF ADVANCED PROBLEMS 
The Hankel Matrix Associated with a Power Series 


6313 [1980, 675]. Proposed by L. W. Tu, University of Michigan. 


Let f(t) = La,t' be a monic power series (a) = 1; a; = 0 if i < 0). Let A,,(f) be the g x q 
Hankel matrix with (i, j) element a, _;,.;. Prove that det A,,(1/f) = (— 1)?%det A,,(/). 


[Note that, with g = 1, this gives a determinantal formula for the coefficients of 1/f. In the 
special case that f(7) is the quotient of polynomials of respective degrees p, q, the result seems to 
be known.] 


I. Solution by J. G. Horne, University of Georgia. Associate with any power series f(t) = La,t' 
the infinite matrix A(f) whose (i, /) entry is a,_,, with the agreement that a, = 0 if k < 0. This 
association is an isomorphism from the ring of such power series under convolution to the ring of 
infinite upper triangular matrices. Now assume a, = | and let 1/f(t) = Xb,t’. Let r, s be any two 
integers with r > s > 1. Let A, B be the r X r “upper left hand corner” submatrices of A( f) and 
B(1/f) respectively. Then det A = det B = 1 and the relation f-: 1/f = 1 guarantees that B = 
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The Hankel Matrix Associated with a Power Series 
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be known.] 


I. Solution by J. G. Horne, University of Georgia. Associate with any power series f(t) = La,t' 
the infinite matrix A(f) whose (i, /) entry is a,_;, with the agreement that a, = 0 if k < 0. This 
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infinite upper triangular matrices. Now assume a, = | and let 1/f(t) = Xb,t’. Let r, s be any two 
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2 
A~'. If k is any positive integer, n = (7) and C is any square matrix, let C‘*’ denote the n X n 
matrix consisting of the k x k minors of C, arranged lexicographically. Let Adj‘C denote the 
transpose of the matrix obtained by replacing each entry of C“) by its complementary cofactor. 
Now (A) ~! = (47')\™, and when det A = 1, (A) 7! = Adj“A. Therefore, 
BS) = (A!) = (AW)! = AdjA., 


This implies the following set of identities whenever j,,...,7,, j/j,.-..,j,_, are complementary 


subsets of the indices {1,...,r} with ] <j, <-:: <j<r and I<jj<-::: <j/_,<r 
(1) det A, = (—1)°det B, 
where 
, , cee , 
Gj, -19;,-1 Gj, 1 
A, = ’ 
, , 
j,—(r-s) TT Gi, = (rs) 
b,- s-G,-1 b, 5 —(j,-2)... b, — j, 
r—s—G2-1) vo b, — j, 
B,= 
b,- s—(,-1) —_ b,— j, 


(r—s)\(r—-st+I) 


5 to ties 


and v = 


The equality (1) follows from the observation that 


[ oe | 
A, =A / ./ :/ 
J J2 °° '° Jras 
and 
B,=B Ji J2-+°Js 
r—-st+l ae 2 


and the fact that BS) = Adj&)4: 

To see that these identities include those in the proposed problem, let p, qg, be arbitrary positive 
integers. Setr = p + 9,58 = p. Takej, =i,i=1,...,p,j7,/ =p+i,i=1,..., q, then note that A, 
reduces to A,,,(f), and B, reduces to A,,(1/f) and (— 1)?% = (—1)”. The set of identities (1) is 
much larger than this set. 


II. Solution by D. M. Jackson, University of Waterloo, Canada. We set f~' = 1 + ©%b,t', and 
let B[A] be (p + gq) X (p + q) Hankel matrices with constant diagonals 
0,...,0, bo, by,.--5 by4g-110,..., 0, ag, ay,..-]; 
by [ao] is the principal diagonal. (b) = a, = 1.) Assume without loss of generality that p > g, and 
partition B[ A} into a 3 X 3 matrix of boxes B; ,[A;,;], of respective principal sizes gq X q,(p — @) 
X (p — q), g X q. Then since BA = I, we have 
(*) B\,A), + By.Ax + Bi3A3) = O, ete. 
We set 
Cxp = [Bit By; By, By], 


Ds xp = [ Aj; Aj3; An, A43]. 
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In this notation, we have to prove det D = (—1)?%det B,,. (Note det C = 1 since B,, = 0.) Using 
(*) and similar relations, and _ also A3. = 0, we find 


= [0, —B,3A 333 1, — By3A33]. 
But det A,, = 1, and elementary transformations show that 
det CD = [(—1)*|? “det(—B,3433). 
I 


To see this, we computed [0, J »—qxp—qo YICD. Since — B,3A33 is aq X g matrix, this gives 


’ qXq@ 
det CD = (-1)%? ?* "det B,3. 


Thus det CD = (—1)?%det B,,, from which the result follows. 

The proposer notes that the formula appears to have a long history. He refers to formula (36) 
in J. Hadamard’s Essai sur |’ étude des fonctions données par leur développement de Taylor, J. 
Math., 8 (1892) 101-186. 


Collections of Topologies 


6357 [1981, 623]. Proposed by Zachary Robinson, student, Massachusetts Institute of Technol- 
ogy. 


Let X be a nonempty set. Let 5 be a collection of topologies on X, well ordered by strict 
inclusion. Let + be the topology generated by the base Us. Prove or disprove: If 7 is second 
countable, then 5 is countable. 


Solution by Douglas E. Cameron, The University of Akron. The assertion is false. Let X be the 
set of real numbers R, and let < be a well ordering of R. For a € R, let 7, be the least topology 
on R such that R — {B) € 1, for a < B. Then U , err, is a base for ¥, the topology of finite 
complements on R. Also 1, C 7 if a < B and a + £. Let 7 be the usual topology on R. Then 


® = {tT} U {7,|a € R} 
is well ordered by strict set inclusion, US = 7 is second countable and |5| = |R. 
Also solved by F. S. Cater, A. K. Desai (India), K. P. Hart (The Netherlands), and the proposer. 


A Dirichlet Character Sum 


6359. Proposed by Ronald J. Evans, University of California, San Diego. 


Let x be a nontrivial Dirichlet character, and let a be complex. Show that Lx(p)(log p)*“p * 
converges for Re(s) > 1, where the sum is over all primes p. 


Solution by the proposer. Let A(x) = 1,<,x(p)log p for x > 1. After grouping the primes 
p < x into congruence classes, the prime number theorem for arithmetic progressions with error 
term can be applied to show that A(x) = O(x/(log x)”) for any constant B > 0. Thus 


Ix(p)(log p)"p-* = f (log x)*"'x-*dA(x) 


=f A(x) s(log x) ~ (a — 1)(log x)*?} dx, 


and the last integral converges absolutely for Re(s) > 1. (The above equality, of course, arises 
after letting a finite upper index of summation approach infinity.) 


Conjugates in Division Rings 


6361 [1981, 623]. Proposed by Paul R. Chernoff, University of California, Berkeley. 


Let D be a division ring, a € D*, the group of nonzero elements. Suppose that a has only 
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set of real numbers R, and let < be a well ordering of R. For a € R, let 7, be the least topology 
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complements on R. Also 1, C 7 if a < 8 and a  £. Let 7 be the usual topology on R. Then 
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Also solved by F. S. Cater, A. K. Desai (India), K. P. Hart (The Netherlands), and the proposer. 


A Dirichlet Character Sum 


6359. Proposed by Ronald J. Evans, University of California, San Diego. 


Let x be a nontrivial Dirichlet character, and let a be complex. Show that /x(p)(log p)“p* 
converges for Re(s) > 1, where the sum is over all primes p. 


Solution by the proposer. Let A(x) = 1,<,x(p)log p for x > 1. After grouping the primes 
p < x into congruence classes, the prime number theorem for arithmetic progressions with error 
term can be applied to show that A(x) = O(x/(log x)”) for any constant B > 0. Thus 


Ex(p)(log p)"p-* = f (log x) 'x~*da(x) 
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and the last integral converges absolutely for Re(s) > 1. (The above equality, of course, arises 
after letting a finite upper index of summation approach infinity.) 


Conjugates in Division Rings 


6361 [1981, 623]. Proposed by Paul R. Chernoff, University of California, Berkeley. 


Let D be a division ring, a € D*, the group of nonzero elements. Suppose that a has only 
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finitely many conjugates. Prove that a is in k, the center of D. (Application: a polynomial p with 
coefficients in k, irreducible over k but with a root in D, has infinitely many roots in D.) 


Solution by Fergus J. Gaines, University College, Belfield, Dublin. Since, by Wedderburn’s 
theorem, a finite division ring is a field, we may assume that D is infinite. Let a € D* have finitely 
many conjugates. Let C(a) = {x € D*: ax = xa). Then C(a) is a subgroup of D* and its index, 
which equals the number of conjugates of a in D*, is finite. Suppose there exists x € D* with 
x 'ax # a. We claim that C(a) is finite. To see this, note that there are only finitely many 
elements of the form (b + x)~'a(b + x) where b € C(a). Now let b, c € C(a) with 


(b+x) ‘a(b+x)=(c+x) ‘a(ct+x). 


Then (b+ xc + x)~'=d€ C(a). This gives d= 1, since d + 1 implies x € C(a), because 
(d — 1)x = b— dc. Thus b = c. So C(a) is finite and has finite index in D*. Thus D is finite. 
This contradiction implies a € k, the centre of D. 


Also solved by Jeffrey Bergen, Jerzy Browkin (Poland), F. S. Cater, Thomas C. Craven, Hugh M. Edgar, 
Lorraine L. Foster, Gary R. Greenfield, A. A. Jagers (The Netherlands), Michael Josephy (Costa Rica), Charles 
Lanski, Susan Montgomery, R. W. K. Odoni, A. J. Pianzola (Canada), David R. Richman, Tom Sadler, Rony Teitler 
(U.K.), Ernst Trost (Switzerland), Z. Z. Uoiea, William C. Waterhouse, Gregory P. Wene, and the proposer. 

A number of the solvers observed that a solution appears in I. N. Herstein, Conjugates in division rings, Proc. 
Amer. Math. Soc., 7 (1956) 1021—1022. The solution above is simpler and more elementary than Herstein’s. 


REVIEWS 


EDITED BY ALLAN L. EDMONDS AND JOHN H. EWING 
COLLABORATING EDITOR FOR FILMS: SEYMOUR SCHUSTER, CARLETON COLLEGE 


Geometry, Particles and Fields. By Bjorn Felsager. Odense University Press, Odense, Denmark, 
1982. xx + 643 pp., Dan. Kr. 110.00. 


CLIFFORD HENRY TAUBES 
Department of Physics, Harvard University, Cambridge, MA 02138 


To write a report for a mathematics journal on a manuscript partly about physics is at best a 
dangerous task. The reason is that, in the century past, mathematics and physics became distinct. 
It is, therefore, useful to explain first what physics is and does. The distinction between 
mathematics and physics is this: while physics explores our natural world, mathematics explores 
all possible worlds. Physics itself is a symbiotic pair. Half of it is experimental physics, which 
seeks to uncover and describe new phenomena: to investigate, map, probe, and quantify the 
events that occur in our natural world. It is a glorious crusade, “going where no man has gone 
before.” Perhaps it is our destiny. To the experimental physicist, “data” is the operative word. 

The other partner in the symbiosis is the discipline called Theoretical Physics. A theoretical 
physicist is the modern theologian. He seeks, in almost a biblical sense, to understand the data, to 
perceive the patterns in the chaos, and ultimately to predict. The operative word is “understand.” 
This, too, is our destiny. (In antiquity, the symbiotic split was less pronounced— the same priests 
who nightly recorded the positions of the sun, moon, planets and stars also made predictions of 
eclipses from their data. Now we live in a society of specialists.) 

The symbiosis spins as a cycle; the experimentalist discovers new data; the theorist must then 


Supported in part by Harvard Society of Fellows and the National Science Foundation under Grant PHY79- 
16812. 


292 CLIFFORD HENRY TAUBES [April 


finitely many conjugates. Prove that a is in k, the center of D. (Application: a polynomial p with 
coefficients in k, irreducible over k but with a root in D, has infinitely many roots in D.) 


Solution by Fergus J. Gaines, University College, Belfield, Dublin. Since, by Wedderburn’s 
theorem, a finite division ring is a field, we may assume that D is infinite. Let a € D* have finitely 
many conjugates. Let C(a) = {x € D*: ax = xa). Then C(a) is a subgroup of D* and its index, 
which equals the number of conjugates of a in D*, is finite. Suppose there exists x € D* with 
x 'ax # a. We claim that C(a) is finite. To see this, note that there are only finitely many 
elements of the form (b + x)~'a(b + x) where b € C(a). Now let b, c € C(a) with 


(b+x) 'a(b+x)=(c+x) ‘a(c+x). 


Then (b + x)(c + x)~' =d © C(a). This gives d = 1, since d # 1 implies x € C(a), because 
(d — 1)x = b — dc. Thus b = c. So C(a) is finite and has finite index in D*. Thus D is finite. 
This contradiction implies a € k, the centre of D. 


Also solved by Jeffrey Bergen, Jerzy Browkin (Poland), F. S. Cater, Thomas C. Craven, Hugh M. Edgar, 
Lorraine L. Foster, Gary R. Greenfield, A. A. Jagers (The Netherlands), Michael Josephy (Costa Rica), Charles 
Lanski, Susan Montgomery, R. W. K. Odoni, A. J. Pianzola (Canada), David R. Richman, Tom Sadler, Rony Teitler 
(U.K.), Ernst Trost (Switzerland), Z. Z. Uoiea, William C. Waterhouse, Gregory P. Wene, and the proposer. 

A number of the solvers observed that a solution appears in I. N. Herstein, Conjugates in division nngs, Proc. 
Amer. Math. Soc., 7 (1956) 1021—1022. The solution above is simpler and more elementary than Herstein’s. 


REVIEWS 


EDITED BY ALLAN L. EDMONDS AND JOHN H. EWING 
COLLABORATING EDITOR FOR FILMS: SEYMOUR SCHUSTER, CARLETON COLLEGE 


Geometry, Particles and Fields. By Bjorn Felsager. Odense University Press, Odense, Denmark, 
1982. xx + 643 pp., Dan. Kr. 110.00. 


CLIFFORD HENRY TAUBES 
Department of Physics, Harvard University, Cambridge, MA 02138 


To write a report for a mathematics journal on a manuscript partly about physics is at best a 
dangerous task. The reason is that, in the century past, mathematics and physics became distinct. 
It is, therefore, useful to explain first what physics is and does. The distinction between 
mathematics and physics is this: while physics explores our natural world, mathematics explores 
all possible worlds. Physics itself is a symbiotic pair. Half of it is experimental physics, which 
seeks to uncover and describe new phenomena: to investigate, map, probe, and quantify the 
events that occur in our natural world. It is a glorious crusade, “going where no man has gone 
before.” Perhaps it is our destiny. To the experimental physicist, “data” is the operative word. 

The other partner in the symbiosis is the discipline called Theoretical Physics. A theoretical 
physicist is the modern theologian. He seeks, in almost a biblical sense, to understand the data, to 
perceive the patterns in the chaos, and ultimately to predict. The operative word is “understand.” 
This, too, is our destiny. (In antiquity, the symbiotic split was less pronounced—the same priests 
who nightly recorded the positions of the sun, moon, planets and stars also made predictions of 
eclipses from their data. Now we live in a society of specialists.) 

The symbiosis spins as a cycle; the experimentalist discovers new data; the theorist must then 
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deduce a new pattern that fits the data. The theorist predicts from the pattern phenomena for 
which the experimentalist probes, the probes producing again new data. 

The theoretical physicist describes the patterns he sees with mathematics. In order to predict 
new phenomena, a mathematical model is made. Here, one must think, in the simplest example, of 
a simple polynomial expression: y = ax + b. This expression expresses the output, called y, in 
terms of the input, called x, and parameters, called a and b. The experimentalist inputs a string of 
x’s and measures with machines the values of y. Later, the theorist comes to conclude that y is 
related to x, as given by the equation above, with “a” and “b” as “such and such,” and that when 
the experimentalist tries x =“doo,” he will surely record that y = “daa.” 

If the theoretician truly suspects that there is a polynomial modeling our universe, then he will 
certainly learn as much as he can about polynomial equations. The knowledge allows him to 
publish predictions that will test the power of the proposed polynomial model. 

Before you realize it, you have a physicist who is doing abstract algebraic geometry. Indeed, 
one may argue that such a scenario certainly suggests how the human race started to study 
geometry in the first place. The point I am pushing is that theoretical physics and modern 
mathematics are mutually entwined. Indeed, it has seemed, at least in the past, that finding new 
physics leads to learning (or inventing) new math. 

If I may return to my recent example: when the measuring machine, matching x’s with y’s 
starts to suggest a sinusoidal situation, then the theoretical physicist had better discover trigono- 
metric functions, or else retire. 

Mathematics is a tool for the physicist and for this reason a student of physics should be 
encouraged to learn the elegant formulations of his physical theories. (There is always the danger 
of some student finding the tool more tantalizing than the physics, but the world needs young 
mathematicians as well.) Professor Felsdger has written a book that will introduce seniors and new 
graduate students in physics to the elegant mathematics one discovers while studying electro-mag- 
netic phenomena and quantum mechanics. 

The study of electro-magnetism is the study of electric fields, magnetic fields, and the forces 
they put on passing particles. The subject finds phenomena from fluorescent lights to lightning 
bolts to aurora borealis and electric eels. 

The mathematical description of the electro-magnetic field is most elegantly elucidated by the 
language of modern differential geometry and topology. This is the language that Professor 
Felsdger teaches. But he presents more than a language; for a new language opens a new window 
with which to view the physical world. 

The language of differential geometry allows one to ask new questions which were not even 
poseable before. There is such a question whose answer may come to be the fundamental 
discovery of this half century. The question and the reasoning that led to its asking is readily 
recounted. Recall from childhood playing with magnets. They came in all sizes, colors: bar 
magnets, and horseshoe magnets. But no matter which,:they all had two ends; all of mine were 
marked on one end with an N and the other with an S: the north pole and the south pole. Poles of 
the same kind repel each other while unlike poles attract. I never saw a magnet with only one pole. 
In fact, every magnet ever encountered or built has two poles; one north and the other south. The 
mathematics of electromagnetism is the mathematics of line bundles, covariant derivatives, and 
curvature. This dipole phenomenon is described completely by the differential geometry of line 
bundles; but remarkably the phenomenon is specific to line bundles, as opposed to vector bundles 
of higher rank. 

Thinking about differential geometry led physicists to speculate about the nature of ‘a world 
whose forces were modeled on vector bundles of higher rank. Indeed, such vector bundle models 
would better describe our world. There are forces in nature other than electricity and magnetism 
and these models unify all the forces of nature as facets of one object: a connection on a vector 
bundle of high rank. This is a beautiful proposition. But if this Grand Unified Theory, as it is 
called, is an accurate model, there most certainly should exist magnetic monopoles. These are the 
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as yet unencountered pieces of matter with just one pole, either north or south. The critical 
question is whether there is or is not a real monopole out there, somewhere. The experimental 
physicists are strenuously seeking it, or its telltale traces. 

The second subject of Professor Felsager’s book is quantum mechanics. This is treated with 
considerably less rigor, and perhaps reasonably so: it is a much more difficult subject. Indeed, the 
welding of quantum mechanics with electro-magnetism (called QED, quantum-electrodynamics) 
by a well-defined mathematical model has eluded the theorists today. Theoretical physicists have 
assembled an algorithm to calculate quantities in QED. The quantities calculated using the 
algorithm are amazingly accurate. But today there is little basis for believing the algorithm except 
that it has never yet been wrong. Is there a mathematical model behind it? Investigations have 
suggested an interesting hypothesis: only theories based on the differential geometry of vector 
bundles of high rank can be welded with quantum mechanics in a rigorous mathematical model. 
The suggestion is that Grand Unified Theories are accurate because there is no way to build any 
other kind. 

The construction of the quantum Grand Unified Theories is a formidable task which is only 
just begun. The algorithms for calculating physical parameters are extremely complicated. When 
one considers the Grand Unified Theories, the nonlinearities in the theory are of primary 
importance, and so far, rigorous results are rare. 

These mathematical models are so monstrous that physicists do not know what they predict. 
The physicist has only his intuition as a guide, a large part of which comes through studying the 
differential geometric aspects of these theories. 

The application of differential geometry to calculate quantum amplitudes is mathematically 
justified in simple models; and Professor Felsager presents a number of these calculations. From a 
practical viewpoint, these calculations are the most useful parts of his book, because the intuition 
for Grand Unified Theories comes from studying the role of differential geometry in the simple 
situations. Here I have my major criticism of Professor Felsager’s book: he does not give enough 
references. The format is expository without providing the rigorous background to quantum field 
theory. I think the author should have given us a large selection of references to constructive 
quantum field theory and to the many “instanton” papers and calculations using differential 
geometry. It defeats the purpose to present this thought-provoking subject without also telling the 
reader where to find more detail. Except for this defect, Professor Felsager’s book is wholly 
commendable. 


An Introduction to Classical Real Analysis. By Karl R. Stromberg. Wadsworth, Belmont, Cali- 
fornia, 1981. ix + 576 pp., $29.85. 


ALBERTO TORCHINSKY 
Department of Mathematics, Indiana University, Bloomington, IN 47405 


Skip the bills, exams, reviews and other distractions at hand, play one of Hindemith’s 
wood-wind sonatas (the Beatles will also do), and, most importantly, read An Introduction to 
Classical Real Analysis, Karl Stromberg’s expert work of art and love, in the basic subject 
described by its title. Stromberg condenses over twenty years of teaching experience in an 
attractive treatise meant for those students who, having survived our merciful attempts to teach 
them Calculus without e’s and 6’s, want to know how things can be really made to work. Such a 
book is especially welcome at this introductory level, for the main texts in print are the second 
editions of Apostol and Rudin, written in the midseventies. Both of these authors made a 
significant contribution to the study of Classical Analysis, or Advanced Calculus as the subject is 
sometimes referred to, and their highly successful texts do provide a solid preparation for 
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undergraduate and beginning graduate students learning complex and abstract analysis. In fact, 
the clarity of their presentation affords in a sense an invitation to the students to “read through” 
the book. Stromberg’s book, on the other hand, while just as clear as those of Apostol and Rudin, 
advances a somewhat different point of view: the author insists that to learn Mathematics (and 
this subject in particular) hard work is necessary, and he incorporates into the flow of the 
presentation the solving of sets of exercises, some supplied with copious hints. Most of the 
exercises represent assertions to be proved and lead the students to important and interesting 
results; this is an attractive asset. 

Readers will have no difficulty in deciding that in the balance of rigor vs. intuition, the author 
has chosen an extremely rigorous approach which opens with a set of axioms introducing the real 
numbers and culminates with a discussion of the growth rate of partial sums of Fourier series. A 
safe assumption is made, namely that students know no mathematics, and a point of view is 
advanced, namely that no concept will be used until it is properly introduced; thus 7 is not 
mentioned until Chapter 5. The emphasis (and in some parts, as in the convergence tests for 
numerical series, it is heavily laid) is on the classical aspect of the theory. For instance, although 
the concepts of abstract topological and metric spaces are indeed introduced in Chapter 1, the 
book promptly reverts to real Euclidean space for examples, motivation, and development. 
Complex valued functions are also considered, but the theory of analytic functions is not 
discussed. As for functions of several real variables, although the Fubini-Tonelli theorem and 
formulas such as the change of variables involving Jacobians are given, the theorems of Stokes and 
Green are not covered. Students in engineering and other applied areas will be pleasantly 
surprised by a significant simplification, if that is the right word, which Stromberg proposes. The 
Lebesgue integral is introduced before other notions of integral. To support this concept 
Stromberg advances three reasons; I add two of my own: it saves time and it works well. 

The different chapters that comprise the book can be read independently, the exercises are 
challenging and fun, and the content is supported by the clear, concrete, and correct mathematical 
thinking (in both level and presentation) that is such a crucial component of the book. Through 
the chapters students can learn about the number system, sequences, and series, limits and 
continuity, differentiation, the elementary transcendental functions, integration (and its main 
applications including L,, spaces, and some differential calculus in several real variables), infinite 
series and products, and trigonometric series. Stromberg indicates to the prospective teacher how 
to program a two-semester, or three-quarter, course by marking in the index with a single asterisk 
sections that may be omitted if time presents a problem and with a double asterisk interesting and 
useful applications of the theory. 

I feel this book will become a source of inspiration for teachers and students of classical 
analysis. It has become one of my reference books and one of my first choices for the kind of 
course where students want to see the proofs of the theorems, and consequently the whole theory, 
come alive. 


LETTERS TO THE EDITOR 


Material for this department should be sent to Professor P. R. Halmos, Department of Mathematics, Indiana 
University, Bloomington, IN 47405. 


Editor: 


This section seems an appropriate place to comment on a recent “conjecture” by the editor. I 
refer to the “conjecture” described by the title of Professor Halmos’s recent essay, “Applied 
Mathematics Is Bad Mathematics,” in Mathematics Tomorrow [1]. In the event you’ve missed it, in 
the lead article of a section titled “What is Mathematics?” Professor Halmos has decided to 
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address himself to this question by explaining that we have Pure Mathematics (PM) and Applied 
Mathematics (AM), and that the latter is “usually” “bad mathematics” (BM), i.e, AM = BM ae. 
PM is not explicitly labeled “good,” but the reader is unlikely to be confused by the omission. 

Actually, we never do find out very clearly just what “bad” means (vague talk of standards of 
clarity, elegance, precision), although a sufficient number of self-serving examples of Pure vs. 
Applied categories is supplied to allow us to make some inferences; these are on the level of: 
Knowing vs. Doing, Mozart vs. military marches, Beauty vs. Boredom, and so on. Guess which is 
which. What we do get seems consciously coy (a kind of mathematical I’m-just-telling-it-like-it-is- 
folks), and largely an exhumation of a point of view amply spoken for by Hardy in A 
Mathematician’s Apology [2}. (Not everyone took Hardy’s “apology” seriously, and the review of 
[2] by F. R. Soddy [3] is well worth looking up; he refers to [2] as “such cloistral clowning,” of 
which “AM is BM” also contains its share.) 

What’s the point? After all, von Neumann (no slouch in either AM or PM) in commenting on 
these matters in “The Mathematician” (in Newman [4]) has long ago resolved the issue in a 
realistic, useful, and positive way, has he not? Von Neumann stresses the “given” character of 
applied problems, their concentrated and circumscribed nature, and the fact that an important 
AM problem represents a difficulty which “must” be resolved—there is often no opportunity to 
exercise aesthetic criteria of selection and success. He also seems uninterested in promulgating 
mathematicians’ myths of their own power and taste, and I dare say he might give the answer 
which can be given to much of the kind of “good-bad” discussion in “AM is BM”’: Try it. 

The intermittent sallies by those in PM who would like to dissociate themselves from AM (or 
vice versa), while not always enlightening, often make livelier than usual professional reading. I 
imagine “AM is BM” was fun to write, and, for one opinion of “What is Mathematics?” it’s worth 
reading. How about the MONTHLY running a series of articles by others addressing the same 
question, the editor to work out the details? 


. L. A. Steen, editor, Mathematics Tomorrow, Springer, NY, 1981. 

. G. H. Hardy, A Mathematician’s Apology, Cambridge University Press, 1977. 
. F. R. Soddy, Qui s’accuse s’acquitte, Nature, Jan. 4, 1941, 3-5. 

. J. 


R. 
R. Newman, editor, The World of Mathematics, vol. 4, Simon and Schuster, NY, 1956. 


Richard C. Bollinger 
Behrend College, Pennsylvania State University 
Erie, PA 16563 


MISCELLANEA 
Continuity: A Do-it-yourself Editorial 
Suggested by S. I. Gass and R. C. Buck; titled and prepared for publication by R. P. Boas. 


102. Here are some definitions of continuity, arranged chronologically. Draw your own 
conclusions. 


1. (1817) According to a correct definition, we are to understand only this much by saying that 
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a [real-valued] function f(x) varies according to the law of continuity, for all values of x between 
or outside certain bounds: that if x is any such value, the difference f(x + w) — f(x) can be made 
smaller [in absolute value] than any given number, provided that we can take w as small as we 
please. 


— Bolzano [1], pp. 7-8 


[That the words in brackets were intended is clear from the context and from other writings of Bolzano; cf. 
[7]. Bolzano, like many generations of physical scientists and college freshmen, uses “small” to mean “close 
to zero.” ] 


2. (1857) A continuous function is subject to the two following assumptions: Ist. As the 
variable gradually changes, the function must gradually change. 2nd. The law symbolized by the 
functional character must not abruptly change. When these two conditions are not satisfied, the 
function is discontinuous. 


—Price [6], p. 28 


3. (1863) A function is continuous, when it undergoes a gradual change; it is discontinuous, 
when the change is not gradual, or when the function changes suddenly from one value to another 
very different value. Thus when the difference between f(x) and f(x + h) may, by the continued 
diminuition of h, be made as small as we please, f(x) is a continuous function; but when under 
the same circumstances f(x + h) differs widely from f(x), the latter is a discontinuous function. 


—Hall [3], p. 2 


4. (1891) A function is said to be continuous between certain values of the independent 
variable, when it changes gradually while the variable passes from one value to the other. In other 
words, a continuous function is one that can be represented by a continuous curve. 


— Osborne [4], p. 64 


5. (1922) A function, f(x), is said to be continuous if a slight change in x produces but a slight 
change in the value of the function.... We can now make more explicit the definition given in 
Chapter I by saying: f(x) is continuous at the point x = a if lim,-, f(x) = f(a). 


— Osgood [5], pp. 7, 28 


6. (1978) A numerical-valued function f, defined on a set D, is said to be continuous at a point 
Po € D if, given any number e > 0, there is a neighborhood U about py such that | f( p) — f(po)| 
< e for every point p € UN D. The function f is said to be continuous on D if it is continuous at 
each point of D. 


—Buck [2], p. 72 
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TRICKS OR TREATS WITH THE HILBERT MATRIX 


MAN-DUEN CHOI 
Department of Mathematics, University of Toronto, Toronto, Ontario, Canada M5S 1A1 


This article is intended to be a partial account on everything you always wanted to know about 
the Hilbert matrix—namely, the infinite Hilbert matrix A or the finite order n X n Hilbert matrix 
A . 


n° 


, 2 fi 1 , J 1 1 
2 3 4 2 3 n 
1 1 1 1 1 
2 3 4 2 3 
A=|i 1 , A =| lL 
3 4 3 
1 
4 ] 1 
n 2n—-1 


It is natural to ask: What are the natural consequences from such a natural assemblage of (the 
reciprocals of) natural numbers? Hence, here arise ten concrete problems (instead of theorems), 
aimed to reveal the “heart of mathematics” in a broad sense. All solutions provided in this article 
are elementary (requiring only the rudiments of linear algebra or simple operator theory without 
the spectral theorem). However, many tricks or treats associated with the Hilbert matrix may seem 
rather frightening or fascinating. 

The Hilbert matrix has played a prominent role in the structure theory of several branches of 
mathematics. Indeed, it serves as one of the most vivid examples for many unusual aspects (as well 
as usual aspects, of course) in operator theory (see e.g., [1], [4], [7]). Undoubtedly, the infinite 
Hilbert matrix stands out from other single operators, illustrating many soft results by hard 
analysis, discrete features of continuous phenomena, and subtle effects through intricate combina- 
torics. 

This pseudo-expository article is designed to exhibit some of these phenomena. The main body 
consists of ten problems on the Hilbert matrix, showing various aspects of concrete and discrete 
nature. The majority of these problems are apparently new, but many of their variants and 
underpinnings have already appeared in the literature. The second half of the article is devoted to 
solutions and additional notes and references. Much effort has been made to seek the most 
elementary solutions instead of the best answers to the problems. Interested readers will probably 
want to find other solutions that are less elementary but provide still more information. 

It may be appropriate to fix some notations and terms here. The term “matrix” refers to a 
finite-order or infinite square matrix with complex entries (or real entries, without loss of 
generality)./The letter J stands for the identity matrix with ones along the main diagonal and zeros 
elsewhere. A matrix T = [1,;]; ; 18 said to be positive, in notation T > 0, if X7,,x;x; > O for all 
complex k-tuples (x,, x,..., x,) with k > 0. The notation ||T|| refers to the Hilbert-space-opera- 
tor norm; thus if an operator T: /? > /? has the matrix representation [r, j), then the operator 
norm of T (that is, the operator norm of [7;,]) 1s 


Man-Duen Choi was brought up in Hong Kong. He received his Ph.D. degree under the guidance of Chandler 
Davis at Toronto. He taught at the University of California, Berkeley, from 1973 to 1976, and has worked since then 
at the University of Toronto. His research has been mainly in operator algebras, operator theory, and polynomial 
rings. He is particularly interested in examples/counterexamples and two-by-two matrix manipulations. 
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reel) sor 


Tj a; 
There is a useful elementary fact: if T = [7, ; and S = [o,,] with 0 < o,, < 7,, for alli and /, then 
SI] < [7 
I. Invertibility. By computation, A; ' = 1, 


4 6 9 —36 30 
A;'=|_4 8), Ax'=|-36 192 —180 
30 -180 180 


Thus, the general case becomes irresistible. 
PROBLEM: Write out the inverse of the n X n Hilbert matrix A,, explicitly. 


It may be instructive to expose what you were afraid to ask: 

(a) While each entry of A, is the reciprocal of an integer, what sort of coincidence is it if A7' 
has integer entries? 

(b) In particular, why is det A, the reciprocal of an integer? 

(c) Since each entry of A, is a positive real number, does it follow that A; ' is of the form [B; A 
with (— 1)'"/8,, > 0 for all i, 7? 

(d) While A, is a Hankel matrix (i.e., A, = [a,;] with a,;=a,, whenever i+ j= p+ q), 
should there be any pattern for A; ' at all? 


II. Formal Inverse. With all the tractable A; ' already in hand, can A7' be still far beyond the 
grasp? 

In this section, the notation » is used only for sequential summation— absolute convergence is 
not required. There are three intrinsic properties pertinent to each infinite matrix T = [7;,]: T is 
said to be formally one-to-one if the trivial sequence is the only sequence (a,) satisfying 
X;7,;4; = 0 for each i; T is said to be formally onto if for each sequence (a,) there is a sequence 
(B)) such that )#7,,a, = 8; for each i; S =[o,,] is called a formal inverse of T if the formal 
products TS and ST are defined and equal to I; that is, 


fl if i=y, 
de TKO j Dour, = {4 if i+ j. 


Since the formal multiplication of infinite matrices is nonassociative, there is no actual correlation 
among these three properties. 


PROBLEM: (1) Is the infinite Hilbert matrix A formally one-to-one? 
(2) Is A formally onto? 
(3) Does A have a formal inverse? 


III. Strong Positivity. If T = [7,;] is a matrix with 7,, > 0 and r is a real number, then let T!”! 
denote the matrix [7,7]. 
It is worthwhile to praise the n X n Hilbert matrix A, for all its positivity in the highest. 


PROBLEM: Prove that A‘"! > 0 for each real number r € (0, 00). 
Note that in general, “7 > 0” need not imply “T!”! > 0”; example: 


1 1 O 1 1 0 
T=/1 2 1/20, but TU7%=]1, 212 1130. 
0 1 1 0 1 | 


IV. Factorization. In view of the positivity of the infinite Hilbert matrix, Paul Halmos (oral 
communication) has asked: Is it possible to write out explicitly A = B* with B > 0? This question 
looks very frightening—apparently, it may involve finding the spectral decomposition of A. Still, 
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there is a more tractable problem. 


PROBLEM: Write out explicitly A = BB* where B is lower-triangular (1.e., B = [B,;)\ <i, j< oo With 
B;; = 0 whenever i < /). (Note that such B is unique if, in addition, all diagonal entries of B are 
positive real numbers.) 


There are some merits for the triangular factorization: 

(a) Let B, = [Bi j)i<i,;<, be the n X n upper-left corner of B. Then it follows A, = B, By. 
Thus an induction to seek B is admissible. 

(b) Furthermore, det A, = |det B,|? is completely determined by the diagonal entries of B. (Cf. 
remark (b) following the problem in Section I.) 

(c) Because of its very tractable structure, each lower-triangular infinite matrix with nonzero 
diagonal entries always admits a formal inverse which is also lower-triangular. Thus let C be the 
formal inverse of B. Then the abstract expression C*C = A~' makes sense in the following 
manner: If Av = w, then C*(Cw) = v—the formal matrix manipulations for Cw and C*(Cw) can 
be carried out even though the formal matrix product C*C is not defined. 


V. Companion. A loyal companion of the Hilbert matrix is the matrix 


1 > 3 1 3 n 
Itoi 1d 
Lb=|5 + 4 orL,=| 2% 2 

33 3 : 3 

i ul 

n n 


whose entries run in a reversed-L-shaped pattern. 

Notice that LZ (or L,,) is an assemblage of reciprocals of positive integers, too. Aside fr 
external resemblance, L (or L,,) also enjoys all those delightful properties as A (or A,,) does in the 
preceding problems. Notably, L admits a triangular factorization L = CC* where C is the infinite 
Cesaro matrix, 


1 O O QO 
1 4.0 0 
c=|]3 3 3 0 
JI 12 212 2 
4 4 4 & 


It is very pleasant to call upon the companion L for the proof that A: /? — /* is a bounded 
operator. 


PROBLEM: (1) Observe that I — (I — C)(I — C)* > 0. 

(2) Prove that ||A|| < ||L|| < 4. 

Apparently, this problem provides the quickest elementary way to show that A is a bounded 
operator. 


VI. Heredity. The immediate offspring of the infinite Hilbert matrix A is 


l 


4 


° Bp Whe dl 
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which has shared the name of Hilbert matrix in literature. 
Undoubtedly, A’ deserves the title of Hilbert matrix because A’ indeed carries on all heredity. 


PROBLEM: Show that A and A’ are unitarily equivalent (i.e., there exists a unitary operator U 
such that A = U*A’U ). 

It is apparently surprising to see that (a) A’ is entrywise strictly smaller than A, and (b) A’ isa 
proper part of A—the first row (or the first column) of A has been erased, yet it turns out that A’ 
is unitarily equivalent to A. 


VII. Compact Perturbation. Many offspring of the infinite Hilbert matrix A can be obtained 
by simply erasing the first p, rows and the first p, columns from A. Thus, there are infinitely many 
Hankel matrices 


gt+1 qt+2 qt+3 
] ] 


with g = p, + p, being any nonnegative integer. In particular, A(0) = A, A(1) = 4’. 

It is a pleasure to report below that all these offspring are essentially the same (i.e., the 
difference of any two of them is a compact operator). Let T, = [1,;li<;,j<, be the nxn 
upper-left corner matrix of T = [7;;]) <i, ;<00> T, can be regarded as an infinite matrix [@;;]) <i, ;<.0 
with 


t ifi< nandj <n, 
@) elsewhere. 


By one of many equivalent definitions, T is said to be compact if lim,,., ,,||T — T,|| = 0; T is said 
to be of trace-class if the sum of the main diagonal entries of (T*T)!/? is finite. It is well known 
that 


{trace-class operators} C {compact operators} C {bounded operators} 
where C stands for a proper inclusion as a linear submanifold. 


PROBLEM: (1) Show that the infinite Hilbert matrix A is not a compact operator. 
(2) Show that A — A(q) is a trace-class operator. 


VIII. The Euler Recipe. There are many possible ways to make 7; but it is striking to get 7 
from the magnificent formula of Euler 


m’/6=14+1/274+1/3°+1/44+---. 


A countable ordered set I is called an Euler vector if T as an unordered set is the same as 
{0} U {1/k: k © Z\ {0}, and furthermore each nonzero element | /k occurs three times exactly in 
I’. By Euler’s formula, it follows immediately that each Euler vector is of norm 7 in the /?-norm. 


. . e . e ° . . xX 
Next, recall that a matrix T, is a dilation of a matrix 7, if T, can be written in the form 7 TI: 
0 


In other words, if JC, is a subspace of a Hilbert space JC, and P is the orthogonal projection from 
JC, onto Wy, and if T,: 0, > H,, Ty: Hy > Hy are operators satisfying PT,|5,, = To, then T, is 
called a dilation of Tp. 


PROBLEM: Dilate the infinite Hilbert matrix A (or its immediate offspring A’) to a real symmetric 
matrix T such that each column-vector of T is an Euler vector, and furthermore any two column-vec- 
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tors of T are orthogonal to each other. 
Consequently, T* = 77J and ||T|| = 7; therefore ||A|| < ||7|| = 7. 
IX. w Again. A close relative of the n X n Hilbert matrix A,, is 


, Jt 1 1 
2 3 n 

1 ] 

2 3 0 
z=|1 
"13 

a, 0 

"1 


a finite-order Hankel matrix with all zeros below the main cross-diagonal. Thus “by analogy,” is 
there a corresponding relative of the infinite Hilbert matrix A? Answer: A itself (plausible but not 
convincing?); in other words, the infinite-dimensional counterpart of Z, ought to be A (stil 


controversial?). 
Anyhow, there is an interesting connection between the equalities 7 = 2 Arcsin | and 
||Al] = hm, .. || Z,|| Gmstead of || A]| = lim, ..||4,l))- 


PROBLEM: (1) Show that 
a= lim {(1k)'7 + ((k—-1)) 7 +++) + (KI) 4. 
k—0o 
(2) Use (1) or some others to show that lim,., ,,||Z,,|| > 7. 
The combination of the results of Problems in Sections VIII and IX brings home the delightful 
fact ||A]| = 7. 


X. Two Natural Dilations. The Hilbert matrix A is a “one-way-infinite Hankel matrix.” In 
contrast, there is a family of “two-way-infinite Hankel matrices” of the form 


Two of these matrices arise as the natural dilations of A: the first is denoted by A* with 


(0  ifk=0, 
“x \ 1/k if k is a nonzero integer; 
the second is denoted by A‘ with 
_ {0 if k is a nonpositive integer, 
“k \1/k if kis a positive integer. 


PROBLEM: (1) Show that ||A*|| = 7. 
(2) Show that ||A‘|| = 00. 
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By (1), it follows again that ||A|| < ||A*|| = 7. Apparently, this is the simplest way to show 
||A|| < a. (Cf. problems in Sections V and VIII.) 


The combination of (1) and (2) yields an extremely amazing fact: Although A’ is a lower-right- 
triangular part of A*, it turns out that At blows up while A® is still bounded. 


Alternatively, rewrite 
4* =| ~4 Be a= |i Bg 


where B and B, are one-way-infinite Toeplitz matrices 


0 -1 -4 -} 0 0 0 0 
| 0 - —4 1 0 0 O 
B=|3 l 0 —-l , B= + 1 0 0 
1 4 0 3 2 1 O 
Then it follows that ||Bl| < ||A*|| < oo and ||Bo|| = ||A‘|| = 00, yet By is the lower-left-triangular 
part of B. 
SOLUTIONS AND NOTES 


I. Solution. Let 
tipo +i-1\(n+j—-1\(i+j-2)\ 
= WE YE WT) 
ie ot) ed Ce ee) er | ee | GO 
The direct but cumbersome computation leading to [a;,;] = A, can be justified by means of 


mathematical inductions (on i, on j/, or on n) or some general counting principles. 
Alternatively, it may be easier to apply a known fact: 


det|1/(x, + y,)| = I1,.,((x, a x, )(y, — y))/T,, (x: + y,) 


(see [6, p. 92, Problem 3]). Let x, = i — 1, y, =/; it follows immediately that 


l 


Similarly, all cofactors of A, can be computed out explicitly. Henceforth A; ' = [a, ,] follows. 


Note. The answer to this problem has been known for many years. The solution provided 
above has already appeared in [9]. See [13] for other related computational problems. 


II. Solution. (1) Some observations are needed so as to reduce tedious computation. 

(a) If A annihilates a sequence (0,..., 0,1, y,, y2,... ) whose leading nonzero entry is 1, then 
1<|y) + [yl +-°-- 

(b) If A annihilates a sequence (a, a), a3,... ), then A also annihilates (0, a,, a,/2, a;/3,... ). 

(c) If A annihilates a nontrivial sequence, then A also annihilates a nontrivial /'-sequence. 

(d) If A annihilates a sequence (0,..., 0,1, 8,, B5,... ) whose leading nonzero entry is | at the 
pth position, then A also annihilates a sequence (0,..., 0,0, 1, y,, y2,... ) whose leading nonzero 
entry is | at the (p + 1)st position and |y,| < p|B,|/(p + 1) for all/. 

Statement (a) is obvious. Statement (b) follows from the computation 

0O= Dia/(itj-1) foreachi = 1,2,3,... 


j=) 
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| oO oO 


=0=— La/i- La/ir)) 
J= j=1 


- ¥ a/ (+i) - ¥ a, AC Ii +1). 


To get (c), apply (b) three times; thus if A annihilates a nontrivial sequence (a,, a5, @3,... ), then 
A also annihilates 


(0,0,0, «,/(1-2-3),...,a/(i(7 + 1)(j + 2)),...) 


which is an /'-sequence in virtue of the fact La /j = 9 (from the assumption). Statement (d) is an 
immediate consequence of (b). 

Now suppose that A is not formally one-to-one. Then by (c), we may assume that A annihilates 
a sequence (0,..., 0,1, 8,, B5,...) whose leading coefficient is 1 at the pth position, and 
|B,| + |B] + --- < co. Application of Statement (d) n times yields that A also annihilates a 
sequence (0,..., 0,1, y|, Y2,-.. ) whose leading entry is | at the (p + n)th position and |y,| < 


P\B|/(p + n). Thus 
vil + lvl +--+ <pQBil t+ [Bl+--:)/(p +n) <1 


when n is sufficiently large. But by (a), this leads to contradiction. Therefore A must be formally 
one-to-one. 

(2) and (3). Observe that whenever A maps a sequence (a,, a, a3,... ) to (1,0,0,... ), then A 
will automatically annihilate (0, a,, a,, a3,... ). Since A is formally one-to-one, it follows that the 
sequence (1,0,0,... ) is not in the formal range of A. This, of course, shows that A does not have a 
formal inverse. 


Note: It is well known that A: /* > /? is one-to-one. (See e.g. [5, Theorem 1, pp. 703—4] for the 
usual proof.) On the other hand, M. Rosenblum [8] has shown that for each complex number A 
with ReA > 0, there is a sequence v = (a,) with Lla,,|* = 00 satisfying the formal equality 
Av = Xo. 


Ill. Solution. If x and r are real numbers and r > 0,|x| < 1, then by the binomial expansion, 
(l—x) ' =a) + a,x + a,x*4+--- 
where @ = 1, a, = r, and each a, is a positive real number. Thus 


i+j-l=71-x) with x=(i- 1)(j-1)/(i) <1, 
and 


(G+ 7- DRE T(~ x)= Lagi Ii -D*7y)™. 


Notice that [a,(i — 1)*(j - JG, is a rank 1 positive matrix. Therefore A!”! = 
(i +j7— )~"],,,, as a sum of positive matrices, is also positive. 


Note: In particular, let r = 1, then a, = 1 for each k; thus the paragraph above provides a 
simple “discrete” proof for the well-known fact A, > 0 (see e.g., [1, Section (8)] for the usual 


proof). 


IV. Solution. As indicated in the text, the construction of B by the mathematical induction is 
admissible. Thus A = BB* where B = [B,,] with 


V27-VG- DY 
B=) GG+7—-DMi-J)! 7 J 
0 if i<j. 
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Notes. (1) The matrix B can also be constructed as follows. Regard /* as L*[0, 1] with the fixed 
orthonormal basis {f,, f,,... } where f,(x) is a polynomial of degree n — | carrying a positive 
coefficient in the x”! term. Let T: L*[0, 1] > L7[0, 1] be the continuous linear map sending each 
f, to x" |. Henceforth, it can be verified that B* is just the matrix representation for T. 

(2) For a graceful expression, it may be better to write A = EXDX*E where 


l 
a0 
2 3 
E = 3 , D= : , 
1 7 
4 
1 0 0 0 
1 1/3 0 0 
ya|1 2/4 1-2/4-5 0 
I 


3/5 2-3/5-6 1-2-3/5-6-7 


Although X and D are unbounded operators: /? > /*, there is, however, no difficulty to carry out 
the formal matrix computation B = EXD'/*, A = BB*. 
(3) B has a formal inverse [y,,] where 
(-1)'%V2i-1 ' vd | | if i>j, 
j-1U J\F-} 
0 if i<j. 


ipo 


This result has been noticed by J. Todd [12]. 


V. Solution. (1) is plain since J — (J — C)(I — C*) is a diagonal operator with positive 
diagonal entries 1, 1/2,1/3,.... 

(2) Here the equality ||T7*|| = ||7'||* for the Hilbert-space-operator norm will be used twice. 
First from (1), 


|W -— Cl? = - OC) - CI < 1; 
Le., || — Cl] < 1,||C]| < 2. Next, 
LI] = |]CC*|] = ICI? < 4. 
Since each entry of A is a positive real number smaller than or equal to the corresponding entry of 
L, it follows that ||A]| < || Z|] < 4 as desired. 
Note: This very simple proof that ||C|| < 2 has appeared in [2, pp. 128-129]. 


VI. Solution. Let C be the infinite Cesaro matrix as mentioned in Section V. It is easy to see 
that both C: /* > I? and C*: /? > [’ are one-to-one (thus, C is of dense range). Next, a direct 
computation yields CA = A’C. The rest of the proof is the restatement of the following lemma 
which is known to many operator-theorists. 


LEMMA: Suppose T, S, and C are bounded operators satisfying. CT = SC. If both T and S are 
hermitian and if C is one-to-one and of dense range, then T and S are unitarily equivalent. 


Proof. By the polar decomposition, there is a unitary U and a positive P such that C = UP 
and P is one-to-one and of dense range (see e.g. [3, p. 169]). Thus 


P?T = PU*UPT = C*CT = C*SC = (SC)*C = (CT )*C = TC*C = TPU*UP = TP’, 
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and consequently PT = TP. Therefore, 
UTP = UPT = CT = SC = SUP. 
Since P is one-to-one and of dense range, it follows UT = SU; i.e., T = U*SU as desired. 


VIL. Solution. (1) Clearly, A — A,, contains an n X n submatrix 


I | I 


2nt+1l 2n+2 | 3n 
oe 
Y= 2n+2 
tI -_ 
3n 4n—] 


Let Q be the nm X n matrix where every entry 1s 1/n; then Q is an orthogonal projection (.e., 
Q = Q* = Q”) and thus of norm 1. Since Y is entrywise larger than Q/4, it follows ||A — A,]|| > 
IY || > ||O/4|| = 1/4. This proves that A is not compact. 

(2) First observe that A(q) > 0. This is an immediate consequence of the fact that A (or A’) is a 
dilation of A(q) if g is an even (or odd) integer. Next, use the notion of Schur product: if T = [1,] 
and S = [o;,], then the Schur product T * S = [1,,0,,]; it is an elementary fact that if T > 0 and 
S > 0 then 7 * S > 0. Henceforth, 


A-A(q) = | ayo, 


i+j-1l qtitj-1 
ee 
(i+jy-1)(q+it+ j-1) 


Since A > 0 and A(q) = 0, it follows that A * A(q) > 0 and thus A — A(q) > O. Finally, observe 
that the sum of diagonal entries of the positive operator A — A(q) is 


Lai - 14 + 27-1) < ee. 


| = ateA(a). 


Therefore A — A(q) is a trace-class operator. 


Note. It is well known (see e.g. [1, Section 8]) that A is not compact. It is even easier to see that 
A — A(q) 1s a Hilbert-Schmidt operator (1.e., the sum of squares of entries of A — A(q) is finite), 
as noted in [1, Section 8]. 


VIII. Solution. First define an auxiliary function ®: Z* > R by 
l/k if k#0,1=0 
l/l if k=0,/#0 
-l/k if k=140 

0 elsewhere. 


®(k,1) = 


It follows that 
(a) ®, regarded as an ordered set, is an Euler vector, 
(b) if (Kg, /)) is a fixed pair of integers different from (0,0), then 


9 O(k,/)O(k+k,,1+1,) =9. 
k,l 


Next, let SC) be the Hilbert space /* with the orthonormal basis {/,, f,,... } and let IC, be 
I?(Z7) with the orthonormal basis {e,:"E Z); imbed SC, into IC, by identifying each f, with e, 
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where p. = (/j,0). Thus the infinite Hilbert matrix A (or A’) stands for an operator: 1, > KH, 
while two matrices T = [7,,],S =[o,,] (u,v € Z*) to be constructed below are operators: 
JC, - IC,. Now, define 


t,, = O(k — 1,7) 


5 = 0(k.1) ifm tv =(k,1)€ Z?*. 
by ? 


Consequently, if i and j are positive integers and « = (1,0), » = (j,0), then 7,, = 1/(i + 7 — J), 
G,, = |/(i + J); Le., Tis a dilation of A, and S is a dilation of A’. From (a) and (b), it follows that 
each column vector of T—i.e., (7,,), for each fixed »—is an Euler vector and any two column 
vectors of T are orthogonal (the similar result holds for S). 

Note. Alternatively, the operators T and S$ in the solution above can be constructed as follows. 
Let T be the unit circle with the normalized Lebesgue measure p, let SC = L?(T X T, p X pw), and 
let M: 3C > SC be an operator defined by 


(2,w) M h(z,w) img(z,w) if Argz+Argw <2zq, 
> = 
She" om —ing(z,w) if Argz+Argw> 2zq, 


subject to the constraint 27 > Arg z > 0,27 > Arg w > 0. Then it follows immediately that 
M = M*, M* =7’I. Moreover, there is a natural way (respectively, a quasi-natural way) to 
identify /7(Z*) with L*(T X T): let e, = zw! (resp., e, = z*~'w') whenever p = (k,/) € Z?. 
Henceforth, it can be verified that M has the matrix expression [4,,,] (resp., [7,,]) as in the solution 
above. 


. -\\-1/2 
IX. Solution. (1) ro (i(k ~j) = mite — al is a Riemann sum for the 
function (x(1 — x))~!”* on (0, 1). 


k-1 
slim YS (j(k-s)) 7 = [a —x)) '7 =2 Arcsin 1 = 7. 
k— oo y=0 0 
(2) Let v be the column vector (1,27 '!/7,... n~'/7). Then 
Zl > (Z,0, 0) Jel? = | » au/k|/| » V/| 
k=] k=] 


where a, = rao i(k —j))~'”. Since lim, _,,,a,, = 7 (from (1)), it follows by general computa- 
tion lim, , o (2-1 a, /k)/Qr_, 1/k) = lim, .,,, a, = 7; thus lim,,_, ,.||Z,,|| > a as desired. 


Note. The main ingredient of the proof above has appeared in [11]. There are other known 


proofs of ||A|| = 7 in the literature (see e.g. [4, Chapter 9 and Appendix III] [10, Chapter 9, p. 
101)). 


X. LEMMA. Let T and H be two-way-infinite matrices of the form 


If a, > 0 for each integer k, then ||T|| = ||H|| = UU”, a,. 
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Proof. Let S, be the matrix in the same form as T with ones along the k th diagonal and zeros 
elsewhere. Then it is obvious that ||S,|| = 1 and ||7|| = |Le@,S,|| < La,. On the other hand, 
consider the n X n Toeplitz matrix 


Xo a) 6) a, 
Q_ | Xo QO, 

XxX QA _4 Q_ | Xo 
Qa» Xo 


(as a submatrix of T) and the constant column vector v = (1,1,..., 1) © C”. Then 


IT || > (Xo, v) loll? 
I 2 l 
= QA) + l-— (a, + a_,) + 1-— (a, +a_,)+-°° + 7 (a, + @n). 


When n approaches to infinity, the right-hand side approaches to La,. Thus ||7|| > Lea,. 
Therefore the equality ||7'|| = Xa, is verified. 

The proof above can be modified so as to yield the similar result for H. Alternatively, let U be 
the unitary matrix in the same form as H with ones along the main cross-diagonal and zeros 
everywhere; then it follows that T = HU, and thus ||T|| = || || as desired. 


Solution. (1) Euler’s formula will be used twice here. First by direct computation, (A*)?* is in 
the same form as T in the Lemma with a, = 77/3 and a, = 2/k?’ if k is a nonzero integer. Since 
A* is hermitian, it follows that 


o @) 
2 
NA" |)? =[(4*) = Day = 2. 
—- co 


Therefore ||A*|| = 7 as desired. 
(2) follows from the Lemma. 


Note. The fact ||B|| < 7 and || Bo|| = oo is well known (see [4, § 9.6]). 
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A GROUP WITH CONSTANT GROWTH RATE 


R. HIRSHON 
Polytechnic Institute of New York, 333 Jay Street, Brooklyn, NY 11201 


1. Introduction. The theory of direct products in groups and related constructions in algebra is 
a rich and varied field. A sample list of some works (by no means complete) is included in the 
references. Problems often center on obtaining certain invariants for the decompositions involved. 
Sometimes the problems that arise are easy to state and comprehend. However, as in elementary 
number theory, some of these problems await fresh insights and new methods for their solutions. 
Sometimes the results obtained are somewhat anomalous insofar as they show that direct products 
do not always behave as one might expect. It is only after looking at this construction for some 
time that one begins to accept a general principle. This is: Almost anything can happen! In this 
paper we will discuss a result which seems to violate the intuition of our formative years. 


2. A Review of Some Basics. Before stating the exact problem to be discussed, a brief review 
of the fundamental concepts which occur might be useful. If A and B are isomorphic groups, we 
write A = B. 


2.1 Direct Products. We say that a group G is the direct products of its n subgroups 


G,,G),..., G, 
if three conditions hold. In the first place, every element g in G must be expressible as a product 
(1) $= 81°82 °° Bn» Bi EG, 


Secondly, if w © G,; and v € G,; and i #7 then wv = ow. Finally the expression in (1) must be 
unique. That is, if in addition to (1) we also have 


$= 8° 82°°' Bs, & EG, 
then g, = g; for all i. When these three conditions hold we write 

G=G,XG,X:::XG,. 
The above definition (internal definition) starts with the group G. A natural question which arises 
is the following. If one is given n arbitrary groups 
(2) G,,G),..., G, 
which are not necessarily distinct, is there a group G with 

G=G,XG,X-::XG, 

with G, ~ G, for all i? The answer is yes. Define G as the cartesian product of the G,. That is, G is 
the set of all n-tuples a of the form 
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has taught at the Polytechnic Institute of New York since 1967 where he is currently a Professor of Mathematics. He 
is fond of cycling, swimming, reading, the movies and his family (not necessarily in the given order). He is currently 
at work on a study of direct products entitled Some J Replacement Results. 
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at work on a study of direct products entitled Some J Replacement Results. 
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& = (81, 825---5 Bn)s 8 EG. 
We say that g, is the ith coordinate of a. If B = (h,,A>,..., /,,) is in G, aB is defined by 
ap = (gif), Brho,.--; g,h,). 


It is easy to verify that G is a group under multiplication of n-tuples. Furthermore if we define G, 
as the set of all n-tuples of G whose jth coordinate for all j, 7 # i is the identity of G,, then 
G, = G, and G = G, X G, X G; X ::: X G,. We will be especially concerned with this construc- 
tion when all the groups in (2) are indentical. That is 


G,=G,=G,=::: =G6,=H. 
In this case we call the resulting group (or any group isomorphic to the resulting group) the direct 
product of n copies of H and write G = H” for short. In dealing with direct products and in 


particular with H”, the construction of G shows that by considering the groups G, we may always 
use the internal definition of the direct product. 


2.2 Finitely Generated Groups. A group H is said to be finitely generated if H contains a finite 


set of r elements h,,h,,..., h, such that every element h in H may be expressed in at least one 
way as a product of one or more of the elements 
h,,hy,...,h,, hy, hz',..., hp’. 


In this case we say that H is generated by h,,hy,...,h, or that h,,h,,...,h, are a set of 
generators of H. Clearly every finite group is finitely generated. Standard arguments on cardinal- 
ity show that a finitely generated group is countable so that for example the group of real numbers 
under addition is not finitely generated. If H is a finitely generated group let d(H) designate the 
least positive integer r such that H is generated by r of its elements. For example d(H) = 1 if and 
only if H is a cyclic group. On the other hand if H is the symmetric group on n elements 
d(H) = 2 ((44], v.1, p. 49). If H is the infinite cyclic group, one can easily show d(H”) = n. 


2.3 Finitely Presented Groups. Let us begin with an example. Let G be the symmetric group on 
three elements 1, 2,3. Let a and £ be the cyclic permutations a = (1,2) B = (1,2, 3). Then a and B 
generate G and 


(3) a” 'Ba=B*, a =1, BP=1. 
It is possible to show that any other relation involving a and B is a consequence of the three 
relations (3) and the trivial relations yy ~' = 1 where y is a, B, a~' or B~'. One describes this 


situation by saying G is generated by the elements a and B subject to the defining relations (3). 
The elements a, 6 together with the relations (3) are said to be a presentation of G. For short we 
write 

(4) G=(a,B; a 'Ba=B*, a = B= 1) 

and refer to (4) as a presentation of G. One of the nice results of presentation theory tells us that 
anything we write down that looks like a presentation in fact turns out to be the presentation of 
some group. For example if we write four symbols a, b, c, d and if we write 

(5) b-'ab =a’, c 'be=b*, d-'ed=d’*, a™'da=d?’ 


and designate these relations by P,, P,, P;, P,, then there is exactly one group (up to an 
isomorphism) that has the presentation 


(6) (a,b,c, d; P,, Po, P3, Ps). 
A finitely presented group is one which has a presentation involving a finite number of generators 


and a finite number of relations among these generators. Of course the presentation that one 
writes down may turn out to define the identity group. For example the group on one generator w 
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subject to the relations w* = 1 and w? = 1 must clearly define the identity group. A little less 
obvious is the fact that the group on three generators a, b, c subject to the relations 


b-'ab=a*, co 'bc=b*, a 'ca= Cc?’ 


defines the trivial group [18]. However the group (6) turns out to be an infinite group [18]. The 
problem of determining whether or not a given presentation defines the trivial group is often very 
difficult. In fact it has been shown that it is impossible to describe a general method which works 
in a finite number of steps for all presentations [50]. A very clear exposition of the elements of 
presentation theory can be found in the first few chapters of [46]. 


3. Groups with Constant Growth Rate. One seems to feel that if G is a finitely generated 

nontrivial group that 
(7) lim d(G") = oo. 
Indeed if G is a nontrivial finite group, (7) is true [12], [60]. Using the fact that every finitely 
generated abelian group is a direct product of cyclic groups [44] one can easily see that (7) is true 
if G is a finitely generated abelian group. If G and G, are finitely generated and Gy, is a 
homomorphic image of G, then d(G,) < d(G). In this case G4, is also a homomorphic image of 
G” so d(G4,) < d(G”"). Consequently one sees that if G is a finitely generated group and G has a 
nontrivial finite homomorphic image, then again (7) is true. An interesting study of the behavior 
of d(G") is made in [60]. For example, the growth rate of d(G”) for G a finite perfect group is 
quite slow. If A, is the alternating group of five elements, 49*°4° requires only five generators 
[60]. 

At the other end of the spectrum, M. Jones has discovered an amazing nontrivial finitely 
generated group G such that G* ~ G [38].* Hence Jones’ group has the property that G” ~ G and 
d(G") = d(G) for all n. 

The construction of Jones’ group is fairly complicated. The purpose of this note is to show that 
the infinite group H which is given by (6) has the property that d(H”) = 4 for all n. It seems then 
that H might be one of the simplest groups whose growth rate d(H”) is constant. 


4. Proof that d(H") = 4. To begin with it is useful to note that (5) implies 
bath’ = a*’, b’a’?’b- = ase j> r> 0 


and corresponding relations for the other generators. We assume that H” is the direct product of 
its subgroups H,, H,,..., H,, where H, is generated by a,, b,, c,, d; subject to the relations 
(8) b-'a,b; = a7, c; ‘bc, = b?, d,'c,d,=c?, a; 'd,a, = d?. 
Let 

a,azaz a q? =wi(/) 

byb3b3 +++ BF’ = w2(J) 

cycje} + C7 = (J) 

d\djd3 ++ dj’ = w,(/) 
and let 

w,=w(n), w= wi(n), Ww; = w3(n), wa = w4(n). 


We claim that H” is generated by w,, w,,w3, w,. To outline the idea of the proof, let L be the 


*The temptation at this point to cite one of those easily understood but perhaps difficult problems is irresistible. 
In the 1950’s Peter Hilton asked whether one could find a nontrivial finitely presented group with G* =~ G. The 
problem is unsolved today. 
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subgroup of H” generated by w,, w>, w3;, w,. Note first because of the symmetry in (8), for any fact 
concerning a,, b,, c,, d, there is a corresponding fact obtainable by permuting these generators. 


no ns no 


Now note that it suffices to show 
(9) L contains a,,, b,c, and d,. 


For if we can show (9), then it follows that if L, is the subgroup of L generated by 
WiC J), W2( J), W3(J), Wa(/) then 


(10) L=L,_, X H,. 
However any proof that shows (9) will at the same time yield a proof that shows 
(11) L,, | contains a, 1, D, 15 Cy —19 An —1 
so that L,_, = L,_»X H,_,. Clearly, by repeating this inductive procedure we obtain our 
assertion. 
Now to show (9) it suffices to show 
(12) L contains a,, 


because by symmetry, any proof that shows L contains a,, will work to show L contains b,, c,, and 
d,,. In order to show (12), it suffices to show that 


(13) L contains an odd power of a,, 
and L contains an element of the form 
(14) z=ya,, y€H,XH,X::- xX H,_). 


For if we can show (13), then by symmetry, L contains an odd power of b,, c,, and of d,,. Let a, 
a>, a; and a, be the least positive powers of a,, b,, c, and d,, which are in L. By symmetry, 
Q, = a) = a, = a, = a. By the Euclidean algorithm, if a? is in L, a is a divisor of y. Hence if (13) 
is true, a is odd. But then from (14) L contains b, “ya, b* = y. But y = ya? so that (ya?')(ya,)~' 
is in L. Hence if B = 2% — 1, a® is in L and a divides B. If « is greater than 1, this is impossible. 
For if a > 1, let p be the least prime divisor of a. Then 2% = 1 mod p and 2?~' = 1 mod p. 
Hence if the order of 2 mod p is d, d divides a and d divides p — | contradicting the minimality of 
p. Hence a = 1 so that (12) is true. We complete the proof by showing (13) and (14). 
To show (13) consider all elements in L of the form, 


— 9% Pst oe. 78 
(15) v= aykalks avn 


where the exponent @, of a, is positive. We call v an n — k + J string. For example, w, is an n 


string. If o is ann — k + | string given by (15), then an easy calculation shows that if 6, = 27", 
then 


(16) W>- lowyp 7” O«-1 = aks eee ay 


where «, = 6, (6,_, — 6,_,). In particular ¢, > 0 so that (16) yields an n — k string in L. By 
induction, there is a | string in L. That is L contains an element of the form a’, 6 > 0. Let 
6 = 2°0,‘where 6, and 2 are relatively prime. Letyn = 2”"', a=a,. If8 <7, then 


(a°)" = as 
so that a2"’* is in L. But then 


n _ 
way, ! = a 


so a®* is in L. If 8 > n, let 6 = wy + po,0 < po < 9. Then 


_ HO 
wha’ w5 Ma gq? % 
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so that a2 + is in L and as above this implies that a®* is in L and (13) is shown. Finally if we 
choose integers 4, and p.. with p,2”~' + 6, = 1, then 


wiigt2« = wi(n—1)"a@ 


so that we may take y = w,(n — 1)". 

In concluding we pose a question which we make no attempt to answer. Does there exist a 
finitely generated group H,, H, #1 such that H, is defined by three or fewer than three 
defining relations and such that the growth rate d( 14) is constant? 
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techniques of the calculus. To begin, a few excerpts from the history of elementary function 
approximation are presented. 


2. Tabular Function Values. Prior to our era of automatic computation, most function values 
used in applied problems were read from tables. The literature of the eighteenth and nineteenth 
centuries abounds with interpolation schemes devised to extend tabular data. Even the use of a 
slide rule for three significant figure estimates of elementary function values amounts to a table 
reading exercise. 

Trigonometric tables, in the form of arc-chord data, have been traced to the time of 
Hipparchus (150 B.C.) through the work of Ptolemy (150 A.D.). Ptolemy’s table, the result of 
meticulous measurements of inscribed regular polygons and an arc addition formula, was used by 
astronomers for better than one-thousand years [7]. The arc addition formula may be considered 
as the basis for today’s trigonometric addition formulas. 

Henry Briggs (1561-1631), credited with the base-first approach to logarithms, presented the 
properties as well as applications, a method of construction, and the first table of common 
logarithms in his Arithmetica Logarithmica (1624). This work and a later table containing sines, 
tangents, log sines, and log tangents, from 0° to 90°, in increments of one-hundredth of a degree, 
and with function values to fifteen decimal places, served as the standard for all later refinements. 
Of particular interest here is Briggs’ computation scheme and an extension of that scheme 
suggested three hundred years later by Glaisher [4]. An outline of these procedures serves as a 
revealing introduction to the unified scheme used on today’s hand calculators. 

After defining the logarithms of 1 and 10 to be 0 and 1, respectively, Briggs began extracting 
successive square roots of 10, reasoning that each such extraction corresponded to halving the 
corresponding logarithm. Desiring 15 figure accuracy, he carried 32 decimal places and continued 
until the root extracted was within 107'° of unity. This tedious process required 54 such 
extractions and yielded a skeletal table of values ay, a@,,..., @54, where a) = 10, and 


(2.1) 0.4, = (0,, k=0,1,..., 53, 
thus 
log a, =2-*, k=0,1,..., 54. 


To use these skeletal values, Briggs asserted that if a = 1+ a, and B= 1+ b, withO <a< 
10~'°, and 0 < b < 107!°, then 


log @ 
a 


log B = 


is accurate to 15 significant figures. Using the value of a from az, (i.e., 102° — 1), he obtained 
(log(1 + @))/a = .43429... as the constant of proportionality. Now for any prime number P, 
Square roots were extracted until 8 = 1 + b, with b < 107 '°, was obtained, then log B was taken 
as (.43429...)b. Finally, log 8 was doubled the same number of times as square roots were 
extracted, the end result being a 15 figure approximation of log P. Composite integers N were 
merely factored into primes whose logarithms were summed to obtain log N. 

Today, using results of the calculus that were developed more than a century after Briggs’ 
work, one can readily establish his proportionality claim. Clearly, (log(1] + a))/a — log e, as 
a — (0, and so the constant of proportionality used in these calculations was an approximation to 
log e. The accuracy claims can be easily substantiated as well. Using the Maclaurin series 
expansion for In(J + x), one can show that for0 < a< 1 and0O<b< 1, 


llog(1 + b) — * jog( + a)| < M*/in 100 


where M = max(a, b). 
In 1916 Glaisher offered a method of resolving any positive real number into a product of 
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certain of the Briggs skeletal values given in (2.1), and perhaps powers of 10. By summing the 
corresponding logarithms of the a,;’s used in this product, the desired logarithm is obtained 
without performing the root extractions or using the constant of proportionality. To resolve a 
number x, between ] and 10, proceed by dividing x by the largest a, less than x, then divide the 
quotient by the largest a, less than itself, and so on until unity is reached (to 15 decimal places). 
When completed with this process, x may be written as a product of the a, used as divisors, and 
thus log x is the sum of the corresponding log a,. Glaisher concludes by noting that the required 
logarithm has base 2 representation 


54 . . . . 
1, if a, is used in the product representation, 
\> 6,2~*, where 6, = Kk P P 


k=0 0, otherwise. 


The idea of decomposing a given quantity into a combination (product here) of cleverly prescribed 
constants is fundamental to today’s calculator schemes. 

With the development of the calculus, series expansion became the primary method of 
approximating elementary functions. In 1669 Newton’s De Analysis [16] contained series for the 
sine, cosine, and exponential functions. A year earlier, series for the natural logarithm and inverse 
tangent were published by Mercator and Gregory, respectively. By the end of the nineteenth 
century, through the work of such men as Fourier, Chebyshev, and Weierstrass, rigor had been 
added to function approximation theory. 

Table refinements, using series expansions, continued through the first half of the twentieth 
century. Among the more recent efforts was the 1939 Project for the Computation of Mathemati- 
cal Tables, sponsored by the National Bureau of Standards, and funded through the Works 
Progress Administration. (See, for example, [10].) After more than 300 years of fine tuning, 
economy of size was probably the primary improvement over the early function value tables. 
Domain reduction techniques, depending on function properties and notational advances, allowed 
a decrease in the number of tabular entries without decreasing the domain of their application. 
Today, domain reduction techniques are fundamental to the automatic computation of function 
values. 


3. Computer Function Evaluation. Elementary function values were first generated electroni- 
cally during the 1940’s as inputs to analogue computations. In analogue computers numerical 
quantities are represented as voltages. By applying electrical devices, the equivalent of arithmetic 
operations are performed on the voltages. Among the analogue multiplication schemes is the servo 
multiplier. With this scheme a variable voltage is effectively multiplied by a constant by applying 
the voltage across a potentiometer, adjusted to the prescribed constant value. Using a resolver, a 
specially wound potentiometer, a voltage proportional to sin @, or cos @, is similarly obtained from 
a given voltage 8. Such generation is used to convert between rectangular and polar coordinates, 
and thus voltages representing inverse tangents are produced in a continuous fashion as well. 

In digital computers numerical quantities are represented digit by digit, actually bit by bit, 
rather than by the intensity of an electromagnetic force. The computation circuitry for these 
machines is designed to perform only the four arithmetic operations. Function evaluation 
schemes, therefore, are dependent upon the efficient and accurate execution of an appropriate 
sequence of the four arithmetic operations. Typically, an evaluation subroutine employs a near 
minimax polynomial, or rational function, approximation on a selected interval. Given domain 
values are reduced to the interval of approximation, then stored coefficients are applied to the 
reduced domain value to produce the approximation. The scheme used to evaluate e* in the IBM 
360 Scientific Subroutine package illustrates this procedure. First the rational function 


R(x) =1- 2x 


034657359x? + x + 9.9545957821 — 0212722099320 


x? + 87.4174972022 
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is used to approximate 2~* on [0, 1], with maximum relative error 2 X 107’. Now, to find e* for a 
given x, take y = —xlog, e, (log, e = 1.44269504] to 10 significant figures), n = [ y], ([y] being 
the greatest integer less than or equal to y), and z = y — n. Then, 0 < z < 1, and 


e~=27"R(z). 


When working in base 2, 2” merely represents an n-bit shift of the binary point. Approximations 
used on various digital computers through the mid 1960’s are contained in Fike [2], Hastings [5], 
and Kogbetlhiantz [8]. 


4. The CORDIC Scheme. By the early 1970’s, miniaturization of electronics circuitry had 
advanced to a level which allowed the inclusion of function evaluation capabilities on hand 
calculators. A first difference between calculator circuitry and digital computer circuitry was the 
binary-coded decimal (BCD) representation of numbers. Hardware for conversion between the 
user’s base 10 preference and the off-on switching simplicities of base 2 was not practical because 
of cost limitations and the fact that only a few arithmetic operations are used with each calculator 
input /output [12]. In BCD each digit of a base 10 numeral is represented in binary form, and the 
arithmetic operations, at least addition and subtraction, are wired accordingly. In choosing the 
function approximation schemes, calculation speed required that divisions and multiplications be 
avoided, and size dictated the use of a uniform scheme applicable to various elementary functions. 
For speed and efficiency, Hewlett-Packard selected the CORDIC (Coordinate Rotation Digital 
Computer) scheme for trigonometric function evaluation on the HP-35 [1]. Today, Hewlett- 
Packard, Texas Instruments, and many other manufacturers use some version of this scheme for 
all function evaluation. 

The CORDIC computing technique was first employed by Volder in 1959, [14], to solve the 
trigonometric relationships that arise in navigation problems. Involving only a fixed sequence of 
additions, or subtractions, and binary shifts, this scheme was used to quickly and systematically 
approximate the value of a trigonometric function or its inverse. Following generalizations to base 
10 arithmetic [13], and to other elementary functions [15], a CORDIC arithmetic processor chip is 
now available to perform multiplications and divisions, to calculate square roots, and to evaluate 
the sine, cosine, tangent, arctangent, sinh, cosh, tanh, arctanh, In, and exp functions [6]. In binary 
form the unified CORDIC scheme consists only of the iterative equations 


(4.1) Xpea1 =X — MSY“, 

(4.2) Ver =Ve + by x,~2™%, 

(4.3) Ze41 = 2 — OK Ex: 

(4.4) 6,.= +1, fork =0,1,...,n, 


where m = 1, 0, or —1, 1s a mode indicator and (e,)} is a sequence of prestored constants 
depending on m. By appropriately selecting xo, yo, Z), and the sign of each 6,, these four 
equations may be used to generate approximations for any of the elementary functions mentioned 
above. 

Beforé deriving the CORDIC scheme, consider the following decomposition theorem which 
summarizes observations of Volder [14] and Walther [15], and upon which the scheme depends. 


THEOREM. Suppose & > & > & > °°: > &, > 0 is a finite sequence of real numbers such that 
n 
(i) eg< d ete,, for0<k<n, 
J=kt+i 


and suppose r is a real number such that 


n 
(ii) In}< Die, 
j=0 
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If sy) = 0, and s,,, = 8, + 6,8, for0 < k <n, where 


eee 5 _ l, ifr > s,, 
(iit) kK \-1, ifr <s,, 
then 


n 
(iv) Ir—sl< Dete,, forrO<k<n, 
jak 


and so in particular |r — s,,41| < €- 


Proof. (Induction on k). For k = 0, 
n n 
I7— Sol =I[rl » 8 < ete, 
j=0 j=0 


using (ii). Assuming |r — s,| < £72,418, + €,, consider |r — s,4,|. Note that from (iii), 6, and 
r — s, have the same sign and thus 


IP = Spa h= IP — 84 — 8x eel = [IP Sxl — &l- 
Now, 
n n 
Ir—s,J-e< De t+e,- & = > €; + En, 
J=k jJ=k+1 


by the inductive hypothesis. Also, from (4), 


and so 


Combining these two results, 


Ir — 54] — &| < » E, + Ens 
J=k+i 
and the theorem 1s true. 

Note that each approximant e; is used in the decomposition described above. This approach is 
favored over the seemingly more natural option of allowing 6, = 0 in some cases, because of the 
associated circuit simplicity. Condition (i) allows the use of all e,. Should |r — s,| be less than e, 
for some k <n, then e, can be added and the remaining e, subtracted, returning again to 
|r Sn+l < Ene 

To obtain the CORDIC scheme, suppose # is decomposed in the fashion described above, and 
the trigonometric addition formulas are applied to the approximation s,,, = s, + 6,e,. Then 


(4.5) cos(s,,,) = cos s,cos e, — 6, sins, sin €, 


cos e, (cos s, — 6, sin s, tan e, ) 
and 
(4.6) sin(s,,,) = sins, cos e, + 6, cos s, sin &, 

= cos e, (sin s, + 5, cos s, tan e, ). 


Now, if s) = 0, and e, = tan” '27* for0 < k <n, and if x,,, and y,,, are associated with 
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cos(s;,, ) and sin(s,,,), respectively, then but for the cos e, factor, equations (4.5) and (4.6) could 
be replaced by (4.1) and (4.2) with m = 1. Actually, if x9 = 1, y = 0, and K = IT/'_,cose,, then 
cos(S,,+,) = Kx,,,, and sin(s,,,,) = Ky,.,. So by choosing x9 = 1/K and y) = 0, we obtain 
X41 = COS(S,4 )), and y, 4, = sin(s,,,) from (4.1) and (4.2), with m = 1. 

To verify that the choice e, = tan~'2~* satisfies hypothesis (i) of the decomposition theorem, 
the mean value theorem may be applied to the arctangent function to obtain 


€— & 4, < 2/t'/(1 + 27/77) 
and 
e > 2//(1+ 2%), forj =0,1,..., 2. 
Thus 


n—\ n n 
i= Lle-giid< Le w/Ut+2V%)< Lb ge, 
jak jak+i jak+i 
establishing the desired result. 

Clearly Le o8 ; > 7/2, and so from (11) of the decomposition theorem we need only require that 
|0| < 7/2. Domain reduction to this interval may be achieved through division of a given @ by 
a/2, or through repeated subtractions of 7 (when # > 0). 

In addition to equations (4.]) and (4.2), the complete CORDIC scheme includes the decom- 
position equation (4.3), in which the approximation s,,,, to a given @ is formed, as weil as the 
assignment equation (4.4). Taking z, = @, and 


; l, ifz,>0 
k \-1, ifz, <0, 


the s, of the decomposition theorem are related to the z, of (4.3) by z, = @ — s,. Here the 6, are 
selected to force z, toward zero, hence forcing X7/_00,€, toward 6. Volder termed this selection of 
the 6, the rotation mode of the CORDIC scheme. This title is descriptive for in the rotation mode 
we begin with (x9, yo) on the x-axis and rotate +e, radians in the Ath step until after n steps, 


(X,,41> Yn +1) lies on the terminal side of an angle whose radian measure is within e,, of the given @. 
(See Fig. 1.) 


(x1, Yi) 
® 


6= 7-1 


(X2, y2) . 


(Xn+1 Yn+0) 
ee 5, 


td 
o* 


(x3, 3) 


(X0» Yo)) x 


Fic. 1. The Rotation Mode. 
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A first indication of the versatility of the CORDIC scheme is encountered in the computation 
of inverse tangents. To find 9 = tan™' w for a given w, we think of w as yp/X, where (Xo, yo) is on 
the terminal side of 8. Then the 5, are selected so that the terminal side of @ is rotated to the 
x-axis. The negative of these accumulated rotations forms 6, within «,, and is recorded as z,,,, if 
we begin with z, = 0. Volder terms this selection of 6, to force y, toward zero, 1.e., 


§ = l, ify, < 0 
k \-1, ify, > 0, 


the vectoring mode of the CORDIC scheme. (See Figure 2.) In the vectoring mode, if w is given, 
Zo = 0, Yo 1S Selected to satisfy (11) of the decomposition theorem, and x) = y,)/w, then 


Zn.) =tan>'w and x,,,= Ky xo + 6. 


(Xo, Yo) | 


oe 


o" (x5, y2) 


(x1, y1) 


Fic. 2. The Vectoring Mode. 


Recall that in today’s hand calculators, base 10 arithmetic is employed, so in actuality 
e, = tan”! 10~*, and equations (4.1) through (4.4) are evaluated 10 times for each k. 

In moving to hyperbolic functions, the application of hyperbolic addition formulas to ap- 
proximation s,,, would leave (4.2) unchanged and require that m = — 1 in equation (4.1), if x, 
and y, are associated with cosh e, and sinh e,, respectively, and if e, = tanh~'2~*. Unfortunately, 
in this case the «, do not satisfy (i) of the decomposition theorem for all k, and so for certain k the 
equations (4.1) through (4.4) must be evaluated twice [15]. With this in mind, if K’ = IIcoshe 7 
where the product contains the needed duplications, x) = 1/K’, yy = 0, 29 = 8, and the 6, are 
selected for the rotation mode, then x,,,, = cosh@, and y,,, = sinh@, so 


gO 
C= Xnaip tT Vas: 


In the hyperbolic vectoring mode, if x9 and yp are given and z, = 0, then 
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Zn4, = tanh” '(yo/x9) and x,4) = K'/ xo + 6. 
Perhaps the most useful applications of this mode are realized when for a given w, x») = w+ 1 
and y = w— l,orx) = w+ .25 and yo = w — .25. In the first case we obtain 


while the latter case yields 
vw = Xnt /K". 

A final surprising feature of the CORDIC scheme is its utility in performing multiplications 
and divisions if equation (4.1) is altered by setting m = 0, and e, = 27 *. In the rotation mode, if 
Yo = 0, and x, and Z, are given, 7 _0,€, approximates z, thus 

Yn+1 = XoZo- 
In the vectoring mode, if z) = 0, and x9 and yp are given, then yy + xX9[Lp_00,€,] 1s forced to 
zero, hence 


n 
znt+1 » OB, &, = Vo/Xo- 
k=0 


The multiplication-division form of the CORDIC scheme is analogous to the pseudo multiplica- 
tion and division of Meggitt [11]. A comparison of the techniques of Meggitt, Volder, and others, 
with error analyses, can be found in Franke [3]. Numerical examples of CORDIC computations 
appear in Kropa [9]. 

The various modes of the generalized CORDIC scheme are summarized in the table below. 


The Generalized Binary CORDIC Scheme 
X41 = Xz — mb, y,~2-“ 


Vea =Vy + 8,x,27* fork = 0,1,..., nh. 


Ze 41 = 2 — OLE, 
Rotation (z, — 0) Vectoring (y, — 0) 
5 l, ifz,>0 5 1, ify, <0 
tk \-1, ifz,<0 tk \-1, ify, > 0 
m=0 Xo given, yo = 0, Zo given Xo given, yo given, Z) = 0 
yields yields 
eg, = 20" Yn+1 = XqZo Zn+1 = Yo/Xo 
m= Xo = 1/K, yy = 0,2) = 0 Xo Ziven, yp given, Z) = 0 
yields yields 
e, = tan '2~* Xn+) = CoS 6,y,,, = sin8 Z,4, = tan | yo/xo, and 
K = II cose, Xntl= Ky x6 + yo 
m= —] Xo = 1/K’, vo = 0, Zp = 8 Xo given, yp given, Z) = 0 
yields yields 
e, = tanh” '2~* X,+, = cosh6,y,,, = sinhd, Z,4, = tanh! y,/x,, and 
K* = If cosh e, CX tna Xn41 = K’ x2 — y@ 


yields 
Inw = 22,4, 


Xp =wt 25, yy =w— .25 
yields 
Vw = Xn+1/K’ 
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We will demonstrate that not only this min-max value, but indeed the geometric mean of these 
(d 
points are arranged in any hypercuboid of lattice points orthogonal to the axes (including the 
trivial pattern of a row of n points along one axis). The min-max is certainly achievable if n 
happens to be a perfect dth power. The complementary problem of determining an upper bound 
for mean projection size is trivial as the points ((i,i,..., 7): 1 <i< n)} do not compress under 
projections. 

This problem arose [1] in establishing a lower bound on the number of comparisons necessary 
to find a d-key record which has been stored in an array of n such records and is specified by s of 
its keys. We feel the problem is of interest in its own right as a fundamental result about 
projections of a finite set, and are pleased that it can be demonstrated by elementary techniques 
requiring only multiple induction and a finite form of Hélder’s Inequality. 

We now introduce the required notation and formally state the theorem. 


cardinalities is at least n°/¢. This lower bound on the geometric mean is achieved when the 


Let X = {x,,..., x,,} denote an arbitrary (finite) set of n distinct points in Euclidean d-space, 
E“, and let N, = ({l,..., d} be the index set representing the coordinates. We wish to consider 
projections onto a subspace E* with s < d effected by retaining s coordinates from each point and 


d 


deleting the other d — s coordinates. There are ( possible projections for us to choose from. We 


identify a particular projection by selecting S C N, with |S|=s. For example, S = N, Cc N, 
identifies the projection that keeps the first s coordinates and discards the last n — s. Of course 
discarding some coordinates might leave some of the x,’s identical, that is, different points can 
project to the same image point in E*. We let 7( X, S) denote the set of image points that result 
when the set X is projected onto the subspace E* whose s coordinates are given by S. Let p(X, S) 
be the order of 7(X,S); clearly 1 < p(X,S) <n. We shall think of these projections as 
“shadows” of the original set. This term is most appropriate when s = d — 1. For other values of s 
the term may be a bit contrived, but it still provides a perspective which helps one recall the result. 

The original problem is to find the minimum over all n-sets X of the maximum over all s-sets S 
of p(X, S). Our theorem is somewhat stronger in that it provides a sharp lower bound on the 
geometric mean of all the projections. 


THEOREM. For any set X of n points in E“, the projection size is bounded by 
(1) Geom. Mean { p(X, S): S C N,and|S| = s} > n°". 
Furthermore, if n can be factored as n = a,a5... aq with each a, a natural number, then the bound is 
attained by choosing X to be the Cartesian product x N,.: 

Proof. We first verify that the specified Cartesian product attains the bound. Evidently, 


s-] different s-sets, we have 


p(X, S) = Il,<s5a,. Since each i lies in (2 7 

d (7-') (7-') 

(2) TT p(X, S)= [yap ante 
|S|=s i=] 


Taking the (4 )eh root of this equation yields 


(3) Geom. Mean { p(X, S): S ¢ N, and |S| = s} = n°/4. 


Notice that a suitable factorization exists for all m and d, since a, =n and a,=1 for i>2 


provides a trivial factorization in every case. 
d 


)eh root, so we raise 


To prove the bound, we observe that the geometric mean involves a ( 


both sides to the (4 power to get 


(4) TT p(x sya nl). 
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We must verify this inequality to prove the bound. First, we shall treat the case s = d — | with the 
proof obtained by induction on d and depending on the lemma presented below. Then we 
complete the proof for arbitrary s by induction on the difference d — s. First, the lemma: 


LEMMA (Hdlder’s Inequality). For all positive reals x;; and positive integer p, 


p ok P 
dx, > (2 Thay) 
J=l j=) i=1| j=} 

Proof. This is in essence a finite variation of Hélder’s Inequality reported as inequality (2.7.2) 
in Hardy, Littlewood, and Polya [2, page 22]. For our purposes, we have specialized (2.7.2) by 
selecting a = B = --- =A =1/p and we have raised both sides to the pth power. The more 
general form of (2.7.2) requires only that a, B,..., A be p positive variables whose sum is 1. The 
reader might notice that the case p = | gives a trivial equality while p = 2 is just a version of the 
familiar Cauchy-Swarz inequality. The inequalities for p > 3 are less well known. 

We now return to the proof of the theorem for the case s = d — 1, and we proceed by 
induction on d. The first case, d = 2, is essentially solved by the pigeonhole principle. For i = 1 
and 2, let S. = N,— {i} be the set that omits i. (This strange choice of notation is made for 
later convenience.) Clearly, every point in X lies in the set 7(X,S,) X 7(X,S,). Thus 
P(X, S|) p(X, S,) = n as required by (4). Now assume the inequality has been verified for 
dimension d -- 1 and consider the d-dimensional problem. Denote p(X, S,) by r. Thus the n 
points of X lie along r lines parallel to the lst coordinate axis. The n points also determine k 
distinct hyperplanes of dimension d — | which are orthogonal to this Ist axis. Since the 1 points 
lie among the rk points found by intersecting lines with hyperplanes, we have 
(5) rk>n 
Let X, denote the set of points in the ith hyperplane and set 1, = |X;,|. Clearly 1 < n,; < r and 
*_,n, =n. In each hyperplane, there are d — 1 different projections onto (d — 2)- dimensional 
subspaces. Let 7; for 2 < j < d denote the index set 7; = N, — {1, 7}. Now p(X;, T;) counts the 
order of the 1 image of the projection from the ith hyperplane onto the coordinates of 7;. But now 
we replace the first coordinate and consider the corresponding projection onto S,. Because the 
hyperplanes form a parallel family, no image point can have preimages in more than one 
hyperplane. Consequently, for2 <j <d, 


(6) p(X, S;) = D p(X, 7). 


Recalling that p(X, S,) = r, we are prepared to bound II%_, p(X, S,). First, we substitute from 
(6) to get 


(7) Mo(x.5) =r ¥ p(X. 7). 


Next we apply the Hélder Inequality with exponent p = d — | to find 


(8) [os s)> of 5 E(o(%.7))""") 


j=l i=1lJ= 


But by the induction hypothesis, 
d 
(9) [1 p(X, 7%) > nt? 


and so 


d k d— | 
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To simplify the lower bound, we begin by writing each term as n,-n; '“4~, so 


d k d—| 
(11) Il(x,s) > | y noone) | 
j=l i= 
Next, we distribute the leading factor of r into the sum. Because of the power to which the sum is 
raised, the factor r appears as r'’‘4—) in each term: 


d k » \a-1) d~\ 
(2) Ho(xs)=(dn(t)o 
j= 
But we have already observed that ; < r, so the quantity (r/n,) is bounded below by |. Thus, we 
may suppress this factor from each term to obtain 


d k d—| 
(13) [1 p(X, S$) > [3 =ni. 
jai i= 
Thus, (4) has been verified when s = d — 1 for all d. 

The final stage of the proof is to induct on the difference d — s. Each set of size s is contained 
in d —s different sets S,; of order d— 1. Whenever S C S; C Nj, the projections 7(X, S) and 
a(a(X,S;,),S) are identical sets. (The projection is accomplished in two stages in the second 
expression.) Consequently, 


J 


TT (a(x 5).5). 


d-s 
(14) [Tes] = 
|S|=s S 
SCS, 
But the inner product on the right represents a projection from E%~' to E*. By the induction 
hypothesis, we may insert the bound from inequality (4) to get 
d—2 
(4-7) 


(15) | Il p(x.s)] > {To(x.5) 9) = Tp(x.5) 


[Sj=s j=l j=l 


Finally, the theorem has already been verified for projections from E% to E¢~' so we may bound 
the bracketted product on the right to conclude 


(16) | I] p(x.s)] > pnd (27). 


|S|=s 
Taking d — s roots of both sides yields (4) completing the proof for arbitrary d and s. 


Recall that the question which inspired this theorem sought a lower bound on max { p(X, S): 
|S| = s and S C N,,}. Since the maximum is an integer at least as large as the geometric mean, we 
may conclude max{ p(X, S): |S| = s and S C Nj} > [ n*/7]. For which values of n, s, and d can 
we actually find a set X attaining the bound? When s = 1, set m = | n'/@|. Let X be any subset of 


xX N,, with order n. Clearly the maximum projection attains the bound of m. For s > 1, the 
conditions for attaining the bound become much more subtle. We illustrate with the smallest case, 
namely s = 2 and d = 3. By detailed and boring combinatorial argument, we find that | 12/7] is 
attained for all n < 12 except n = 5,10, and 11 when the maximum for every X is at least 
1 + [n?/?] points. Certainly the best possible bound for other values of n, s, and d is a difficult 
combinatorial problem which we leave open for the highly ambitious. One final and trivial 


observation: Whenever n = k“ happens to be a perfect dth power, the set ¥ = XX N, achieves the 
bound for all s. i= 
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Throughout this discussion, we have limited our attention to projections onto coordinate axes 
only. However, it should be clear that any set of d mutually orthogonal directions could serve as 
the “coordinates” used in the theorem. 

In conclusion, there is a continuous version of the theorem. Presumably, it could be proved 
directly from the integral form of Hdélder’s Inequality stated as inequality 188 in Hardy, 
Littlewood, and Polya [2, p. 140] namely 


1/p 


Pp 1/P Pp 
LT fics ax > f(s] dx. 


We state the continuous version as a corollary to the finite version using m,(X) to denote the 
d-dimensional measure of the set X. 


COROLLARY. If X is a set with finite positive Riemann measure, 

Geom. Mean {m,(7(X,S)):|S| =sandS CN,)>m,(X)“". 

Sketch of proof. We shall replace the infinite set X by a sequence of finite sets X, for each 
natural number k. Partition E“ into d-dimensional cubes with edge 1 /k and measure 1 /k*%. Select 
the central point of each cube to le in X, if and only if the entire cell is contained in X. Now 
m ,(X) > |X,|/k@ and analogously m,(7(X, S)) > p(X,, S)/k*. Take the geometric mean over 
all S Cc N, of order s and apply the theorem to the projections of X, to obtain 


(17) Geom. Mean {m,(7(X,S)):|S| = sand S C N,} > |X, |8/4/k*. 


Since lim, , ,.|X,|/k*” = m,(X), the limit as k > oo of the right-hand side is m,(X)°/4 as 
required. 
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MISCELLANEA 


103. Among the miscellaneous reading of my youth was a history of modern Europe, which 
concluded with a general survey and attempted forecast of progress in arts, science, and literature. 
So far as I can judge, this work was written about the time of Euler or Lagrange. On the subject of 
mathematics the writer’s conclusion was that fruitful investigation seemed at an end, and that 
there was little prospect of brilliant discoveries in the future. To us, a century later, this judgment 
might seem to illustrate the danger of prophesying, and lead us to look upon the author as one 
who must have been too prone to hasty conclusions. I am not sure that careful analysis would not 
show the author’s view to be less rash than it may now appear. 


—SIMON NEWCOMB, address to the New York Mathematical Society, 
Bull. N.Y. Math. Soc., 3(1894) 95-107. 
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Its Applic., V. 13. Elsevier North-Holland, 1982, xv + 516 pp, $111.50. [ISBN: 0-444-86368-0] The 
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90651-7] Reflects advances since the 1967 edition (TR, August-September 1968), brings the bibliogra- 
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Beauzamy. Math. Stud., No. 68. Elsevier North-Holland, 1982, xi + 308 pp, $41.75 (P). [ISBN: 0- 
444-86416-4] A readable, self-contained intermediate-level graduate text; assumes only elementary 
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properties, geometry of super-reflexive spaces. Exercises follow each chapter. PZ 


Functional Analysis, P. Approximation of Hilbert Space Operators, Volume I. Domingo A. Herrero. 
Res. Notes in Math., No. 72. Pitman Pub, 1982, xiii + 255 pp, $19.95 (P). [ISBN: 0-273-08579-4] 
Approximations related to similarity invariant operators; norm closure characterizations; estimates 
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Analysis, T, $(16-18), P, L. Asymptotic Methods in Analysis. N.G. de Bruijn. Dover Pub, 1981, xii 
+ 200 pp, $4.25 (P). [ISBN: 0-486-64221-6] An unabridged and corrected republication of the Third 
Edition (1970) of the work originally published in 1958. Attention is focused mainly on methods 
(especially the saddle point method and iteration methods). Examples, chosen mainly for purposes of 
instruction, are explained in detail and can be easily followed by anyone with a certain maturity 
with respect to analysis and some general knowledge of complex function theory. LCL 


Analysis, P. Lecture Notes in Mathematics-88l: Nonstandard Analysis. Robert Lutz, Michel Goze. 
Springer-Verlag, 1981, xiv + 261 pp, $16.80 (P). [ISBN: 0-387-10879-3] This work presents a number 
of examples illustrating the use of nonstandard analysis in the treatment of perturbation problems. 
in algebra and differential equations. It also includes a tutorial introduction to nonstandard 


analysis and examples of its use in topology and differential calculus. AO 


Algebraic Geometry, P. Lectures on p-adic Differential Equations. Bernard Dwork. Grund. der math. 
Wissenschaften, B. 253. Springer-Verlag, 1982, viii + 310 pp, $46. [ISBN: 0-387-90714-9] A study of 
p-adic properties of solution to the p-adic hypergeometric differential equation by constructing the 
so-called associated Frobenius structure. SG 


Geometry, T(15-16: 1, 2), L. Geometry: A Metric Approach with Models. Richard S. Millman, George 
D. Parker. Undergrad. Texts in Math. Springer-Verlag, 1981, x + 355 pp, $29.80. [ISBN: 0-387- 
90610-X] Intended for a first rigorous course in geometry, this text is based on Birkhoff“s metric 
approach, Six chapters on absolute geometry are followed by chapters on the theory of parallels, 
hyperbolic geometry, classical Euclidean geometry, area, isometries. Models are used to illustrate 
axioms, definitions and theorems. Numerous problems. JNC 
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the Mathematical Field. John Pottage. Addison-Wesley, 1983, xxi + 480 pp, $35.95. [ISBN: 0-201- 
05733-6] Scholarly work on the art of mathematical discovery via a sequence of problems in elemen- 
tary geometry. Cast in the form of a Galilean dialogue. For the non-specialist as well as the spe- 
cialist. Meticulously-executed drawings. Extensive “Annotationes." Well-graded complementary "Pro- 
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Algebraic Topology, P. Lecture Notes in Mathematics-941: Homotopie des Espaces de Sections. André 
Legrand. Springer-Verlag, 1982, vii + 132 pp, $8.80 (P). [ISBN: 0-387-11575-7] 


Optimization, P, L. Lecture Notes in Economics and Mathematical Systems-197: Integer Programming 


and Related Areas: A Classified Bibliography, 1978-1981. Ed: R. von Randow. Springer-Verlag, 1982, 
xiv + 338 pp, $27 (P). [ISBN: 0-387-11203-0] The third in a series of bibliographies on integer 
programming, this volume covers the period from early 1978 to mid 1981. Publications are listed by 


author and by subject. AO 


Probability, T(14-15: 1, 2), S. Probability and Errors for the Physical Sciences. S.K. Muthu. 
Orient Longman (US Distr: Apt Books), 1982, ix + 568 pp, $35. [ISBN: 0-86131-137-X] A rather 
expanded treatment (i.e., many examples and extended explanations) of elementary probability and 
statistics -(binomial, Poisson, normal distributions) with a large chapter on curve fitting (the 
principle of least squares). The language (calculus) and heuristics (motivations and applications) 
will be comfortable to students in the physical sciences, LCL 


Statistics, S(17), P*, L. Residuals and Influence in Regression. R. Dennis Cook, Sanford Weisberg. 
Chapman & Hall, 1982, x + 230 pp, $25. [ISBN: 0-412~-24280-X] Well-written, comprehensive monograph. 
First half deals with diagnostic methods using residuals, last half with a variety of metkods_ for 
the study of "influence" (e.g., the effect on the analysis of deleting a case). Methods are amply 
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Computer Literacy, T(13-16: 1), S. Computers in Society. Nancy and Robert A. Stern. Prentice- 
Hall, 1983, xxi + 518 pp, $18.95. [ISBN: 0-13-165282-6] Well organized liberal arts survey. First 
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half gives overview of history, hardware (including chapter on micros and minis) and software 
(including chapter on Basic), stressing the user’s point of view. Remainder surveys applications 
and problems in business, education, health, etc., stressing the pros and cons. Coverage is 
(predictably) superficial in many areas and the bibliography seems unduely limited, but the text is 
still well worth considering. GHM 


Computer Programming, T(13-16: 1). Essentials of Cobol Programming: A Structured Approach. Gerald 
N. Pitts, Barry L. Bateman. Computer Sci Pr, 1982, ix + 145 pp, $14.95 (P). [ISBN: 0-914894-34-x] A 
well-organized introductory programming text. It presents the syntax without ever muddying the 
waters with general ideas from computer science, e.g., data structures or sorting algorithms. The 
structured style of programming is emphasized. JAS 


Computer Programming, T(13: 1), S, P, L. Pascal, An Introduction to Methodical Programming, Second 
Edition. William Findlay, David A. Watt. Computer Sci Pr, 1981, x + 404 pp, $13.95 (P). [ISBN: 0- 
914894-73-0] A well-written and easy to follow introduction to structured programming and Pascal. 
This edition contains more examples and exercises and is now compatible with the proposed ISO stan- 
dard for Pascal. (First Edition, TR, June-July 1979.) CEC 


Computer Programming, S. An Introduction to Visicalc Matrixing for Apple and IBM. Harry Anbarlian. 
McGraw-Hill, 1982, xiii + 252 pp, $22.95 (P). [ISBN: 0-07-001605-4] This book is primarily for 
Apple and IBM Personal Computer users. Visicalc matrixing is presented primarily through examples. 
CEC 


Computer Programming, S, P. Basic Programs for Scientists and Engineers. Alan R. Miller. SYBEX, 
1981, xvii + 318 pp, $14.95 (P). [ISBN: 0-89588-073-3] The second in the SYBEX Programs for Scien- 
tists and Engineers series, this volume includes basic programs for the mean and standard deviation, 
random numbers, vector and matrix operations, simultaneous solution of linear equations, curve fit- 
ting, sorting, Newton’s method and numerical integration. JNC 


Computer Programming, L. Osborne CP/M User Guide, Second Edition. Thom Hogan. Osborne/McGraw-Hill, 
1982, xii + 286 pp, $15.95 (P). [ISBN: 0-931988-82-9] This excellent new edition adds a complete 
discussion of CP/M-86 for the newer 8088 and 8086 based microcomputers. MP/M II and CP/NET are also 
discussed. The coverage is very broad--including languages and major applications packages. The 
coverage is also conservative--avoiding the CP/M users group and other hobbiest support. (First 
Edition, TR, April 1982.) JAS 


Computer Programming, T(15-17), S, P, L. The Science of Programming. David Gries. Texts & Mono. 
in Comp. Sci. Springer-Verlag, 1981, xiii + 366 pp, $19.80. [ISBN: 0-387-90641-X] This pioneering 
text shows many signs of hasty composition (e.g., reference to nonexistent exercises, material of 
questionable relevance, etc.) but brings us closer to the goal of making the composition of clear 
and correct programs a science rather than an art. Shows how to use predicate calculus to express 
correctness assertions at the design stage and use them to guide development so that one can 
rigorously prove correctness of the final program. Many interesting exercises, historical notes. 
GHM 


Software Systems, P, L*. Relational Database Systems: Analysis and Comparison. Ed: Joachim W. 
Schmidt, Michael L. Brodie. Springer-Verlag, 1983, xv + 618 pp, $19.80. [ISBN: 0-387-12032-7] This 
volume is a collection of working documents prepared for a group investigating the justifiability of 
an ANSI standard for relational database management systems. It includes a feature catalogue and 
detailed analyses and comparisons of fourteen database management systems supporting the relational 
data model. AQ 


Software Systems, S, P, L. UNIX Time-Sharing System: UNIX Programmer’s Manual, Revised and Expanded 
Version. Holt, Rinehart & Winston, 1983, xiv + 425 pp, (P). [ISBN: 0-03-061742-1] The complete 
reference manual for Bell Labs” UNIX operating system: commands, system calls, subroutines, and file 
information. This is not an introduction to UNIX, but a concise, often cryptic set of reference 
pages for each command. Local adaptations of UNIX may not conform exactly to these reference pages, 
but the major features should be available on any UNIX system. LAS 


Computer Science, S(15-17), L*. Programming Languages: A Grand Tour. Ellis Horowitz. Computer Sci 
Pr, 1983, ix + 664 pp, $37.95. [ISBN: 0-914894-67-6] A collection of 24 reprints of important arti- 
cles on programming languages, some general articles, many on features of particular languages. Sec- 
tioned into history and design, the Algol family, applicative languages, data abstraction, con- 
currency, and languages for the “80°s. RWN 


Computer Science, T(14-17: 1, 2), S, L. Laboratory Minicomputing. John R. Bourne. Academic Pr, 
1981, x + 297 pp, $27. [ISBN: 0-12-119080-3] Presents background material for successful use of min- 
icomputers for activities such as acquiring analog data, displaying data, controlling real time 
processes, etc. Describes PDP-ll and LSI-11 for hardware and UNIX and C for software. As well, 
there are chapters on laboratory I/O, interrupts, real time programming. Three appendices. Chapter 
exercises. References. Index. RJA 


Computer Science, S(13-8). Operating Systems: Concepts and Principles. John Zarrella. Microcom- 
puter Applic, 1979, 144 pp, $8.95 (P). [ISBN: 0-935230-00-9] Written for persons who know how to 
program but are not experts nor intend to become operating systems specialists. Very readable; key 
new terms on each page in boldface type. Many simple but useful diagrams. Ideal place for an 
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average microcomputer user to begin to understand how operating systems are organized. Appendices. 
References. Index. RJA 
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Throughout this discussion, we have limited our attention to projections onto coordinate axes 
only. However, it should be clear that any set of d mutually orthogonal directions could serve as 
the “coordinates” used in the theorem. 

In conclusion, there is a continuous version of the theorem. Presumably, it could be proved 
directly from the integral form of Hdélder’s Inequality stated as inequality 188 in Hardy, 
Littlewood, and Polya [2, p. 140] namely 


1/p 


P |/p P 
LT faim ax > ipa) dx. 


We state the continuous version as a corollary to the finite version using m,(X) to denote the 
d-dimensional measure of the set X. 


COROLLARY. If X is a set with finite positive Riemann measure, 
Geom. Mean {m,(7(X,S)):|S| =sandS Cc N,} > m,(X)“. 


Sketch of proof. We shall replace the infinite set X by a sequence of finite sets X, for each 
natural number k. Partition E“ into d-dimensional cubes with edge 1/k and measure 1/k“. Select 
the central point of each cube to lie in X, if and only if the entire cell is contained in X¥. Now 
m,(X) > |X,|/k@ and analogously m,(7(X, S)) > p(X,, S)/k*. Take the geometric mean over 
all § c N, of order s and apply the theorem to the projections of X, to obtain 


(17) Geom. Mean {m,(7(X,S)):|S| = sand S C N,) > |X,|°/4/k°. 


Since lim, ,,.|X,|/k* = m,(X), the limit as k > oo of the right-hand side is m,(X)°/4 as 
required. 
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MISCELLANEA 


103. Among the miscellaneous reading of my youth was a history of modern Europe, which 
concluded with a general survey and attempted forecast of progress in arts, science, and literature. 
So far as I can judge, this work was written about the time of Euler or Lagrange. On the subject of 
mathematics the writer’s conclusion was that fruitful investigation seemed at an end, and that 
there was lit{le prospect of brilliant discoveries in the future. To us, a century later, this judgment 
might seem to illustrate the danger of prophesying, and lead us to look upon the author as one 
who must have been too prone to hasty conclusions. I am not sure that careful analysis would not 
show the author’s view to be less rash than it may now appear. 


—SIMON NEWCOMB, address to the New York Mathematical Society, 
Bull. N.Y. Math. Soc., 3(1894) 95-107. 
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Who are these four heads? See page 340. 
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sup g(wu) = essential sup g(u) = sup |F(z)]. 


O<u<2a O<u<2a |z|<1 


Solution by the proposer. Let C, denote the circle with radius r and center 0. By Cauchy’s 
integral formula, for n a positive integer and |z| < r < 1, 


\F(z)| = (2mi) 'f F(v)"(w —z) ‘dv 


< (27) '(r- ia)" F( re) du. 
0 
So 
n —1 —ly.. 20 n 
IF(z)| < (27) "(1 — [z\) lim inf f \F(re')| du. 
r>i- “0 


Now |F ( re'“)|" < h(u)” for all r < 1, and it follows from the Lebesgue dominated convergence 
theorem that 


27 . n 27 
liminf | |F(re’™)| du< "du. 
iminf f |F(re')| u<f g(u) du 
Thus 


F(z)" < Qn) '(1 fe)" f"e(u)"du < (1 |2l)* (¢ss.supg(u)) 


O<u<2a7 


and |F(z)| < (1 — [z))~'/"(ess. supp cy crn Z(U))- Let n> c to obtain |F(z)| < ess. sup g(u). 
The rest is clear. O<u<2a 


Tangents to a Circle and a Parabola 


6364 [1981, 711]. Proposed by the late K. B. Leisenring, University of Michigan 


A circle with center at the vertex and radius equal to the latus rectum meets a parabola at P, Q. 
The circle and parabola have common tangents meeting the parabola at X, Y. Prove that XP, YO 
are tangent to the circle. 


Solution by Anders Bager, Akelejevej 5, DK-9800 Hjorring, Denmark. The parabola y* = x and 
the circle x? + y? = 1 meet at P=(a,vVa) and Q =(a,—Va) where a = (V5 — 1)/2. The 
tangent at the point (17,7) on the parabola has equation 2ty = x + t*, and hence its distance 
from the origin 0 is t?7(4r? + 1) ~'/?. Setting this distance equal to 1 produces X = (b, — Vb) and 
Y = (b,Vb) where b = 2 + V5. Hence OP - OX = 1 and so 


OP - PX = OP -(OX — OP)=1-1=0. 
Thus PX is tangent to the circle and, similarly, so also is YQ. 


Also solved by 21 other readers. The problem turned out to be rather elementary. 


ANSWER TO PHOTO ON PAGE 330 


It isn’t often that four heads of the same department (at different times) can stand next to each 
other and smile. From left to right: Heini Halberstam, Paul Bateman, Stewart Cairns, and Mahlon 
Day. The department is at the University of Illinois; the order of seniority is Cairns, Day, 
Bateman, and Halberstam (the present head). The picture was taken by Robert J. McEliece. 


NOTES 
EDITED BY SABRA S. ANDERSON, SHELDON AXLER, AND J. ARTHUR SEEBACH, JR. 


Material for this department should be sent to Professor J. Arthur Seebach, Jr.. Department of Mathematics, St. 
Olaf College, Northfield, MN 55057. 


A UNIVERSAL ENTIRE FUNCTION 


CHARLES BLAIR 
Department of Business Administration, University of Illinois, Urbana, IL 61801 


LEE A. RUBEL 
Department of Mathematics, University of Illinois, Urbana, IL 61801 


THEOREM. There exists an entire function f such that the set (f\:n=0,1,2,...} of all 
derivatives of f is dense in the space of all entire functions in the topology of uniform convergence on 
compact subsets of the complex plane C. 


This means that for every compact set K C C, every « > 0, and every entire function g, there is 
a derivative f” of f so that |f(z) — g(z)| < e for all z © K. We call such an f “universal.” A 
different kind of universality, namely, density of the set of integer translates, was treated in [BIR] 
and [SEW]. For a treatment of density of linear combinations of certain derivatives, see [RUT]. 


Proof of Theorem. Let P,, P,, P;,... be an enumeration of all the polynomials with rational 
coefficients. Let J be the integral operator I(h)(z) = {5h(w) dw. Represent repeated applications 
of I by 1”. Our function f will have the form 


f= LI*(P,), 


where the K,, are large positive integers chosen to satisfy the following. First we need K, > K; + 
deg P for j = 1,2,...,— 1. Also, let H, = I**(P,). Then we require that 


(*) HP (2)\<sq for j=0,1,2,...,K 
If this can be done, then the series defining f converges uniformly on compact sets and can be 
differentiated term by term. Further, 


fE(z) = P,(z) + E,(Z) 
where |E,,(z)| < 277 for |z| < n. 

The result follows easily since any entire function g is the uniform limit on compact sets of a 
sequence of polynomials (the partial sums of its Taylor series) and since on any compact set, any 
polynomial is uniformly approximated by a polynomial with rational coefficients. 

To complete the proof we must show that K, can be chosen so that (*) is satisfied. 

Now I(z’) = z’*!/(r + 1) so that 


n—19/2| <n. 


gitk - rz 
(r+1)---(rtk)| 0 RY 
On any fixed disk {|z| < R), |z”| < R” and |z*|/k! tends uniformly to zero on this disk as k > oo. 


Therefore (*) is easy to achieve by taking K,, sufficiently large, because P,, is just a finite linear 
combination of terms z’. The proof is done. 


II*(z")| = 
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FROM LOSS OF MEMORY TO POISSON 


BRUCE R. JOHNSON 
Department of Mathematics, University of Victoria, Victoria, B.C., Canada 


The purpose of this note is to share my favorite way of presenting the Poisson process and 
deriving the Poisson distribution for upper division courses in probability or mathematical 
statistics. Although the ingredients of this approach are well known, they are blended in ways 
different from standard textbook presentations. The main feature of our approach lies in the 
formulation of Poisson postulates that have immediate intuitive appeal, avoiding the usual 
analytic expressions 


oO 
P,\(h) =Ah+o(h) and > P(h) =0(h). 
n=2 
While the principal goal is to derive the Poisson distribution from a foundation of intuitively 
attractive postulates, our derivation also obtains the exponential and gamma waiting time 
distributions as important by-products. 

Our approach requires the “lack of memory” characterization of the exponential distribution, 
and the probability distribution of independent and identically distributed (i.id.) exponential 
random variables. The “lack of memory” characterization is an intriguing topic for lecture, but, 
with appropriate hints, it could be assigned as a challenging homework problem. Finding the 
distribution of the sum of iid. exponential random variables is an ideal homework problem, 
either to illustrate the power of generating function theory or to reinforce convolution properties 
and the tool of mathematical induction. 

The presentation begins with an intuitive definition of a Poisson process: a process that 
continues over time (or space) where certain rare “happenings” occur at random in time (or 
space) at some fixed average rate. Then several examples are given to develop understanding and 
to display wide applicability for the Poisson model. 

To formalize the definition of a Poisson process, we specify three postulates that are consistent 
with the intuitive definition. 


Poisson Postulates 


Stationarity: The probability distribution of the number of happenings in a specified time 
interval depends only on the length of the interval and not on its position. 

Independence: Events involving the number of happenings in disjoint time intervals are 
independent. 
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FROM LOSS OF MEMORY TO POISSON 


BRUCE R. JOHNSON 
Department of Mathematics, University of Victoria, Victoria, B.C., Canada 


The purpose of this note is to share my favorite way of presenting the Poisson process and 
deriving the Poisson distribution for upper division courses in probability or mathematical 
statistics. Although the ingredients of this approach are well known, they are blended in ways 
different from standard textbook presentations. The main feature of our approach lies in the 
formulation of Poisson postulates that have immediate intuitive appeal, avoiding the usual 
analytic expressions 


oO 
P\(h) =Ah+o(h) and > P(h)=o(h). 
n=2 
While the principal goal is to derive the Poisson distribution from a foundation of intuitively 
attractive postulates, our derivation also obtains the exponential and gamma waiting time 
distributions as important by-products. 

Our approach requires the “lack of memory” characterization of the exponential distribution, 
and the probability distribution of independent and identically distributed (i.1.d.) exponential 
random variables. The “lack of memory” characterization is an intriguing topic for lecture, but, 
with appropriate hints, it could be assigned as a challenging homework problem. Finding the 
distribution of the sum of i.i.d. exponential random variables is an ideal homework problem, 
either to illustrate the power of generating function theory or to reinforce convolution properties 
and the tool of mathematical induction. 

The presentation begins with an intuitive definition of a Poisson process: a process that 
continues over time (or space) where certain rare “happenings” occur at random in time (or 
space) at some fixed average rate. Then several examples are given to develop understanding and 
to display wide applicability for the Poisson model. 

To formalize the definition of a Poisson process, we specify three postulates that are consistent 
with the intuitive definition. 


Poisson Postulates 


Stationarity: The probability distribution of the number of happenings in a specified time 
interval depends only on the length of the interval and not on its position. 

Independence: Events involving the number of happenings in disjoint time intervals are 
independent. 
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Rareness: Beginning at any starting time there is certain to be a positive (but finite) wait 
until the next happening, and multiple happenings cannot occur simultaneously. 


Now, a Poisson process is defined formally to be any process that continues over time (or space) 
and satisfies the Poisson postulates. 


For a given Poisson process we reset the time clock to zero, let 
T = waiting time until the next happening, 


and investigate the probability distribution of T. By the rareness postulate, T is seen to be a 
positive valued random variable. The tail probabilities P(T > t) are positive for every t > 0. This 
follows from the fact that 


P(T>s)tP(T>0)=1 as sO, 


so we can find s) > 0 such that P(T > s,)) > 0, from which we use independence and stationarity 
to conclude 


P(T > ns.) = P(no happenings in (0, ns, ]) 
= ( P(no happenings in (0, s,]))” > 0 


for all positive integers n. Also, T has the “lack of memory” property because, for s > 0 and 
t > 0, we obtain 


P(T> 5 + tT > 1) =P inohappenings in (0, s + ¢]) 


P(T>t) 
_ P(no happenings in (0, t])P (no happenings in (0, s]) 
P(T>t) 
by independence and stationarity, 
=P(T>Ss). 
Therefore, 7 must have the exponential distribution with parameter A = —I1n P(T > 1). 


Letting S, = T, + T, + --- + T, where T, = T and T; denotes the waiting time between the 
(j — 1)th and the jth happening, for 2 < j < n, we use stationarity and independence to conclude 
that S, is the sum of nm independent random variables each having the exponential (A) distribu- 
tion. Hence, the distribution of S, is given by the gamma density 


A" n—1,—-Ax 
In) = Tay le , x> 0. 


If, for t > 0, we let 
N(t) = the number of happenings in time interval (0,¢ |, 
then 
P(N(t) = 0) = P(T> t)=e™, 
and, forn = 1,2,3,..., 
P(N(t) =n) =P(S,<t, S,4)> 0) 
= P(S, < t) — P(S,41 < t) 


n+] 
x"eW7* dx 


_ t A" n—-1,—Ax _ th 
Gop e “* dx oer 


_ (At)" e™ 


n! 


bd 
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from integration by parts applied to the first integral. Thus, whenever the Poisson postulates are 
satisfied, N(t) has the Poisson distribution with mean p = At. Since E( N(1)) = A, we see that A is 
just the average number of happenings per unit time (or space). 
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E 2995 Proposed by Miroslav D. ASi¢é, London School of Economics. 


A square matrix of order n, n > 2, is said to be “good” if it is symmetric, invertible and all its 
entries are positive. What is the largest possible number of zero entries in the inverse of a “good” 
matrix? 


E 2996. Proposed by Phil Novinger and Daniel Oberlin, Florida State University. 


Let L°_,a,, be a positive term convergent series and r, = L°_,a,,,n = 1,2,.... 


n=1 


(i) Show that if 0 < p < 1, then there is an absolute constant C, such that 


ore) a, ore) l—p 
a <G d 4, 


n=| on n= 
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from integration by parts applied to the first integral. Thus, whenever the Poisson postulates are 
satisfied, N(t) has the Poisson distribution with mean p = At. Since E( N(1)) = A, we see that A is 
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E 2995 Proposed by Miroslav D. Asié, London School of Economics. 


A square matrix of order n, n > 2, 1s said to be “good” if it is symmetric, invertible and all its 
entries are positive. What is the largest possible number of zero entries in the inverse of a “good” 
matrix? 


E 2996. Proposed by Phil Novinger and Daniel Oberlin, Florida State University. 


Let L°°_,a,, be a positive term convergent series and r, = L°_,a,,,n = 1,2,.... 


n= m=n"m? 


(i) Show that if 0 < p < 1, then there is an absolute constant C, such that 


ore) a, ore) l—p 
a <G d 4, 


n=| on n= 
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for all such series. 
(ii) Find the best possible C,. 
E 2997. Proposed by Irving Adler, North Bennington, Vermont. 


Let p, be the perimeter of an inscribed regular n-gon in a unit circle, and let d, be the distance 
from the center of the circle to the side of the inscribed regular (2* - n)-gon. Prove that 
(o @) 
] 


2 pei dy 


7. 


E 2998. Proposed by Clark Kimberling, University of Evansville. 
Suppose x and y are complex numbers such that (x” — y")/(x — y) 1s an integer for some four 
consecutive positive integers n. Prove it is an integer for all positive integers n. 
E 2999. Proposed by J. D. Shallit, University of California, Berkeley, and Karel Zikan, San 
Jose State University. 
Let S be the set of nontrivial integer kth powers, 1.e., 
S = (n*|n > 2,k > 2) = {(4,8,9, 16, 25, 27, 32, 36,...}. 


Evaluate X(s — 1)~', the sum being extended over all members s of S. 


E 3000. Proposed by P. Erdés, Hungarian Academy of Sciences. 


Let 0 < u, < u, < --: bean infinite sequence; suppose 11 /u; converges. Denote by f(x) the 
number of pairs (i, j) for which the partial sum L/u, is <x. Prove that lim f(x)/x = 0 
(x —> 00). 


SOLUTIONS OF PROBLEMS DEDICATED TO EMORY P. STARKE 
Permutations with f Fixed Points 


S 14* [1979, 503]. Proposed by C. L. Mallows, Bell Laboratories, Murray Hill, NJ. 


Let n(m, f,r) be the number of arrangements (a), a@,...,a,,) of (1,2,..., m) that have f 
fixed points (a; = i) and r rises (a; <.a,,,). Prove (a) n(m,0,r) = n(m,1,r)forl<r<m-1, 
and (b) n(m, f,r) = 


SO eyes samp iff +7 -1 m+ 1 _ym-f O41-¢ (m 
Bev Gey we nr i) pls] 


for0 <f<r+t+1<m,0 <r, where 6, = 1, = 0if k = 0, # 0, respectively. 


This problem has been solved by Ira Gessel, Massachusetts Institute of Technology. Part (a) can 
be generalized as follows: Let A, be the set of arrangements of (1,2,..., m) with f fixed points. 
For any subset R of {1,2,..., m — 1}, let Bp be the set of arrangements with {i|a, < a,,,) = R. 
Then if R is nonempty |A, NM Br| = |A, OM Bal. Part (b) can be generalized to the formula 


a eee ee gee re) 


where E, = Phen p47), forO<r<m-—1. 


The solution will appear elsewhere. 
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from the center of the circle to the side of the inscribed regular (2* - n)-gon. Prove that 


Por | 


= —_{— = 77. 
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Suppose x and y are complex numbers such that (x” — y”)/(x — y) 1s an integer for some four 
consecutive positive integers n. Prove it is an integer for all positive integers n. 
E 2999. Proposed by J. D. Shallit, University of California, Berkeley, and Karel Zikan, San 
Jose State University. 
Let S be the set of nontrivial integer k th powers, 1.e., 
S ={n*|n > 2,k > 2) = {4,8,9, 16, 25, 27, 32, 36,...}. 


Evaluate X(s — 1)~', the sum being extended over all members s of S. 


E 3000. Proposed by P. Erdos, Hungarian Academy of Sciences. 


Let 0 < u, < u, < -:- be an infinite sequence; suppose 1 1/u,; converges. Denote by f(x) the 
number of pairs (i, j) for which the partial sum L/u, is <x. Prove that lim f(x)/x = 0 
(x —> 00). 
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SOLUTIONS OF PROBLEMS DEDICATED TO EMORY P. STARKE 
Permutations with f Fixed Points 


S 14* [1979, 503]. Proposed by C. L. Mallows, Bell Laboratories, Murray Hill, NJ. 


Let n(m, f,r) be the number of arrangements (a,, a),...,a,,) of (1,2,..., m) that have f 
fixed points (a; = i) and r rises (a; <.a,,,). Prove (a) n(m,0,r) =n(m,l1,r)forl<r<m-1, 
and (b) n(m, f,r) = 


mr _ m—-r—-fo Jem—f-j ft+j-1 m+ ] _ m—f O41 m 
Xf 1) (j-1)/7""/ | j-1 ane 1) rreid 


forO0 <f<r+1<m,0 <r, where 5, = 1, = Oif k =0, # 0, respectively. 


This problem has been solved by Ira Gessel, Massachusetts Institute of Technology. Part (a) can 
be generalized as follows: Let A; be the set of arrangements of (1,2,..., m) with f fixed points. 
For any subset R of {1,2,..., m — 1}, let Bp be the set of arrangements with {i|a; < a,,,}= R. 
Then if R is nonempty |A) 1 Br| = |A, O BI. Part (b) can be generalized to the formula 


ane —B) | Wat 


where E; = Lim — p+ 1(—1)*7/-* H ,for0 <r<m-l. 


The solution will appear elsewhere. 
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SOLUTIONS OF ELEMENTARY PROBLEMS 
Series La,, and Una,, 


E 2902 [1981, 619]. Proposed by Lynn Sennott, Illinois State University. 


Let La, = a be a convergent series of nonnegative terms. Let s,, be the nth partial sum. Show 
that Lina, converges if and only if (a — s,,) converges. 


Solution. Many solvers pointed out that the proposal is a version of Fubini’s theorem. Some 
gave generalizations to Lebesgue-Stieltjes integrals; some replaced Xna, by Un“a,, with ap- 
propriate changes in partial sums. One of the more extensive generalizations was submitted by 
J.R. Nurcombe, Hall Green Technical College, Birmingham, England. 


THEOREM. Let Ua, be a convergent series of real numbers. (See remarks at end.) Set r,, = LU a,. 

Suppose one of (1), (11), (11) holds. 
(i) {Q,) is a bounded sequence of real numbers. 

(1a) LP |Ont — On+K-1| = 0(\Q,, | ), and 

(ib) Cole ~ r,'| = O(|tra ),asn—> o. 

(iii) {Q,_ ,a,} are all positive; {|Q,,|} is nondecreasing; and (iib) holds. 

Define q, by the relation O,, = Xb gy. 

Conclusion: LY O,_,a, and US q,1,., converge to the same limit. 


We use a well-known lemma (K. Knopp, Theory and application of infinite series, Hafner, New 
York, 1971, p. 130). If (p,} is a sequence of real numbers with |p,| —- oo (n > oo), and if 
volPp. — Pe_-1| = O(lp,|), and if Xa, converges, then /} p,a, = o(p,,). We also need a lemma 
that seems to be new. 


LEMMa. If {m,,) is a real sequence, if M, = <5 m,, and if U?|m,,,,| = O(|M,|) (n — 00), and 
if La, converges, then LU Ma, = o(M,). 
Proof of the lemma. We have 


o 6) 


0.6) (o @) (o@) 
>» Ma, = UM. = fea) = M,t, + » Ina k( Mian A Mask- = Mt, + » Mrtkn+k- 
” k=} k=1 


" 


Hence 


ore) ore) 
» M,.a,/M,, = r, + » Mis khn+k/ Mn: 
k=n k=} 


Since La, converges, we know that r, — 0 (n > oo). Also, 


oe) 
» Mis ktnek/M, ~ Dl Ins x/M,| S ( max |r, . " (Stn sd/|Mul): 
k=] = 


The second factor here is bounded by hypothesis, and the first factor — 0 for every k (as n — 00). 
The lemma is proved. 


Proof of the theorem. For all three parts (1), (11), (iii), we need the identity 
(*) On 18 = eles — Ontnsr 
1 0 


(i) Note r, > 0. Since {Q,} is bounded, Q,r,,, > 0. Thus (*) shows that if © Q,_ a, 
converges, then so does 1 q,7,,, (and conversely), and to the same sum. 
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(ii) Suppose L q,r,, converges to L. In the “well-known” lemma above, set p, = 1/r,413 
then 


Ue Te — o(1/r,41)- 
0 


This is tantamount to Q,7,,, — 0. From (*), L) Q;,_,a, converges to L. For the converse, if 
~ O,_ a, converges, take M, = 1/Q;,_, in the second lemma. Then 


» QO, 149;/Q,-1 = o(1/Q,,), 


n+] 


so that Q,7,,, — 0. The conclusion follows from (*). 

(iii) If U q,r,4, converges, we repeat the pertinent (first) half of (4). For the converse, if 
~ O,_ 1a, converges, the inequalities 
ye a, 


n+1 


o @) [0 @) [0 @) 
<|0,| % lad < ¥ |Q,-ylla,] = dS Qn—14% 


n+] n+] n+] 


lQ.hn+1\ =|Q,] 


hold, because {|Q,|} is nondecreasing and {Q,_,a,} is positive. Thus Q,,7,,, > 0. The identity 
(*) completes the proof. 0 


Remarks. The result of proposal E 2902 follows from (11) or (it) 1f q, = 1. 

The condition Dire! — r,'| = O(|Py + i) allows the sequence {a,} to be of mixed sign. 
For example, any convergent geometric series with negative ratio does satisfy the condition. 

The conditions of the two lemmas are necessary as well as sufficient for the respective 
conclusions. 

With minor changes, the theorem continues to hold for a sequence of complex numbers. 


Also solved by 117 readers. 


Recoverable Primes 


E 2904 [1981, 619]. Proposed by D. Hensley, Texas A&M University. 


20 = 


Let p > 2 be prime. Call x recoverable if x 
recoverable x 1s permuted by squaring. 


x mod p for some v > 1. Show that the set of 


Solution by D. M. Bloom, Brooklyn College. The case x = 0 is easy: 0? = 0. For x # 0, “x 
recoverable” means that the residue class of x (mod p) has odd order in the group Z¥ of nonzero 
residue classes: x?°~! = 1. 


LEMMA. The elements of odd order in any group are permuted by squaring (f: x > x”). 


Proof. If x has order 2m + 1, then (a) x* has order 2m + 1, (b) x”*! has order 2m + 1 and 
its square is x, and (c) if y? = x where y has odd order, then necessarily y?"*' = 1 and hence 
y = yr(m*l) = x™*+1_ Statements (a), (b) and (c) establish the assertion. 


Brandler noted that if G is a group and if (m,|G)|) = 1, then f: G - G(f(g) = g”) is byective. 
Cater and Erlebach noted that the assertion is true in any field. 


Also solved by R. Beigel (student), J. Brandler, J. Browkin (Poland), K. A. Brown, P. S. Bruckman, P. Butzmann 
(Germany), F. S. Cater, Chico Problem Group, G. Dedaj (student), J. Deutsch, C. W. Dodge, H. M. Edgar, L. 
Erlebach, P. Flusser, L. L. Foster, C. V. Heuer, J. Hook (student), Y. Ikeda, W. Janous (Austria), M. Josephy (Costa 
Rica), O. P. Lossers (Netherlands), J. M. Masley, D. E. Orr, J. P. Robertson, J. Shallit, D. Singmaster (U.K.), L. 
Somer, K. Spindler (Germany), A. Stenger, G. Stutzin (student), M. B. Suryanarayana, R. Teitler (U.K.), E. Trost 
(Switzerland), D. G. Weinman, M. Woltermann, K. Zikan, and the proposer. 
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A Sequence of Angles in a Triangle Converging to 7 /3 


E 2906 [1981, 620]. Proposed by Jack Garfunkel, Flushing, NY. 


Let I be the incenter of triangle ABC. A’, B’,C’ are the intersections of AI, BI, CI with the 
incircle of ABC. Continue the process by defining 7’ (incenter of A’B’C’), then A” B”’C”, etc. 
Prove that the angles of triangle A(™B'C™ approach 7/3. 


Solution by Clayton W. Dodge, University of Maine at Orono. From triangle [AC we have that 
xAIC = —- A/2—-C/2=(n+ B)/2, 


so that B’ = x<A'B’C’ = 4X AIC = (2 + B)/4. Similar relations hold for A’ and C’. Assuming 
A<B<«<C, then 


A =4(7+A)< B’ <i(7+ B)<C’=4(7+C), and C’-A’'=(C-A)/, 
so triangle A’B’C’ is “4 times closer” to equilateral than triangle ABC is. The theorem follows. 


Also solved by A. Bager (Denmark), N. M. Bejlegaard (Denmark), P. S. Bruckman, G. Dedaj (student), J. Dou 
(Spain), J. Heuver (Canada), W. Janous (Austria), P. Kumar (India), O. P. Lossers (Netherlands), E. Nadler, W. A. 
Newcomb, R. E. Spaulding, J. M. Stark, D. Tsakmakis (Greece), G. H. Wenzel (Germany), and the proposer. 


ADVANCED PROBLEMS 


Solutions of these Advanced Problems should be mailed in duplicate to Professor G. L. Alexanderson, Department 
of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by September 30, 1983. The solver’s full 
post-office address should be on each sheet. 


6427. Proposed by Fred Galvin, University of Kansas. 


Suppose that S is a semigroup, and there are elements a and b in S such that (1) for each x in S 
there is a unique y in S such that xy = a, and (2) for each x in S there is a unique y in S such that 
yx = b. Show that S is a group. 

6428. Proposed by M. Steele, University of Chicago. 


Let e be a separation of {1,2,..., 2m} into two disjoint classes a, < a, < --- <a, and 
bj < by < +--+ <6,. Let M, denote the number of solutions of a; — b, = k, —2n < k < 2n, and 
put M(n) = min,max,M,. Selfridge, Motzkin, and Ralston showed liminf n~'M(n) < .4, 
and Moser showed lim sup n~'M(n) > .357. Prove that the limit lim n~'M(n) actually exists. 


6429. Proposed by B. Powell, Kirkland, WA. 


Prove that for any positive integer M there exists an even positive integer ny = ny(M) such 
that the number of pairs of consecutive primes (P,, P,,,) that differ by exactly no is greater 
than M. 


SOLUTIONS OF ADVANCED PROBLEMS 
A Trace Inequality 


6360 [1981, 623]. Proposed by Emeric Deutsch, Polytechnic Institute of New York. 
Let A be a nonnegative irreducible n X n matrix with Perron root r. Show that 
tr| (rT — A)” adj(rI — A)] <0, 


where I is the n X n identity matrix, adj X is the classical adjoint of X, X" is the transpose of X, 
and tr X is the trace of X. Find necessary and sufficient conditions for equality to hold. 
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A Sequence of Angles in a Triangle Converging to 7 /3 


E 2906 [1981, 620]. Proposed by Jack Garfunkel, Flushing, NY. 


Let I be the incenter of triangle ABC. A’, B’,C’ are the intersections of AJ, BI, CI with the 
incircle of ABC. Continue the process by defining /’ (incenter of A’B’C’), then A”B”C”, etc. 
Prove that the angles of triangle A(B'C™ approach 7/3. 


Solution by Clayton W. Dodge, University of Maine at Orono. From triangle IAC we have that 
XAIC =a —-A/2—-C/2=(7+ B)/2, 


so that B’ = <A'B’C’ = xX AIC = (a2 + B)/4. Similar relations hold for A’ and C’. Assuming 
A<B«<cC, then 


A'=i(7+A)<B <i(7+B)<C’=4(7+C), and C’- A’ =(C-A)/4, 
so triangle A’B’C’ is “4 times closer” to equilateral than triangle ABC is. The theorem follows. 


Also solved by A. Bager (Denmark), N. M. Bejlegaard (Denmark), P. S. Bruckman, G. Dedaj (student), J. Dou 
(Spain), J. Heuver (Canada), W. Janous (Austria), P. Kumar (India), O. P. Lossers (Netherlands), E. Nadler, W. A. 
Newcomb, R. E. Spaulding, J. M. Stark, D. Tsakmakis (Greece), G. H. Wenzel (Germany), and the proposer. 


ADVANCED PROBLEMS 


Solutions of these Advanced Problems should be mailed in duplicate to Professor G. L. Alexanderson, Department 
of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by September 30, 1983. The solver’s full 
post-office address should be on each sheet. 


6427. Proposed by Fred Galvin, University of Kansas. 


Suppose that S is a semigroup, and there are elements a and b in S such that (1) for each x in S 
there is a unique y in S such that xy = a, and (2) for each x in S there is a unique y in S such that 
yx = b. Show that S is a group. 

6428. Proposed by M. Steele, University of Chicago. 


Let e be a separation of {1,2,...,2n} into two disjoint classes a, < a, < --- <a, and 
b, <b, < +++ <5,. Let M, denote the number of solutions of a; — b, = k, —2n < k < 2n, and 
put M(n) = min,max,M,. Selfridge, Motzkin, and Ralston showed liminfn7~'M(n) < .4, 
and Moser showed lim sup n~'M(n) > .357. Prove that the limit lim n~'M(n) actually exists. 


6429. Proposed by B. Powell, Kirkland, WA. 


Prove that for any positive integer M there exists an even positive integer ny = ny(M) such 
that the number of pairs of consecutive primes (P,, P,,,) that differ by exactly no is greater 
than M. 


SOLUTIONS OF ADVANCED PROBLEMS 
A Trace Inequality 


6360 [1981, 623]. Proposed by Emeric Deutsch, Polytechnic Institute of New York. 
Let A be a nonnegative irreducible n X n matrix with Perron root r. Show that 
tr| (rT — A)” adj(rI — A)] <0, 


where I is the n X n identity matrix, adj X is the classical adjoint of X, X" is the transpose of X, 
and tr X is the trace of X. Find necessary and sufficient conditions for equality to hold. 
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there is a unique y in S such that xy = a, and (2) for each x in S there is a unique y in S such that 
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Prove that for any positive integer M there exists an even positive integer ng = ny(M) such 
that the number of pairs of consecutive primes (P,, P,,,) that differ by exactly no is greater 
than M. 


SOLUTIONS OF ADVANCED PROBLEMS 
A Trace Inequality 


6360 [1981, 623]. Proposed by Emeric Deutsch, Polytechnic Institute of New York. 
Let A be a nonnegative irreducible n X n matrix with Perron root r. Show that 
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where I is the n X n identity matrix, adj X is the classical adjoint of X, X” is the transpose of X, 
and tr X is the trace of X. Find necessary and sufficient conditions for equality to hold. 
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Solution by the proposer. Let R” denote the vector space of real n-tuples. For y € R", we denote 
by y, the ith component of y and by y > 0 we mean that y, > 0 fori = 1,2,..., 7. 

Friedland and Karlin have proved [Duke Math. J., 42 (1975) 459-490, Theorem 3.1] that if A is 
a nonnegative irreducible n X n matrix with Perron root r and if 


Au =ru, Av =rv,u,v € R",u>0,0>0,u'v = 1, 
then 


(1) r< (42) 


whenever x € R”, x > 0. Equality holds in (1) if and only if x and uw are linearly dependent. By 
the arithmetic-geometric mean inequality, we have 


i= xj x 


i i=] 1 
with equality if and only if all the (Ax),/x, are equal, i.e., if and only if x is a positive eigenvector 
of A, 1.e., if and only if x and uw are linearly dependent. Consequently, a corollary of (1) is 
n 
A 
(2) r<) u,v A. for all x > 0 
i=] i 


with equality if and only if x and wu are linearly dependent. Taking x = v in (2), we obtain 
(3) r< >) u,(Av), = u7Av 


i=] 


with equality if and only if u and v are linearly dependent. 
It is known that 


B:= adj(rI — A) = Buv’ for some B > 0. 
Then, by (3), we have 
tr| (77 — A)" B| = Bte|(rI — A)" uv" | 


= B((rI — A)"u)"v = Bu(rI — A)v = B(r — uTAv) <0. 


Here we have made use of the fact that the rank one matrix ab’ has trace equal to a7b. 

Equality holds if and only if wu and v are linearly dependent, ie., if and only if A and A’ have a 
common eigenvector corresponding to r. This in turn holds if and only if B:= adj(rI — A) is 
symmetric. It is easy to see that this condition is sufficient, since when it holds 


(rl — A)'B=[B(rI — A)]" =0. 
In particular, equality holds whenever A is normal or generalized doubly stochastic. 


Bounds of Analytic Functions 


6362 [1981, 710]. Proposed by F. S. Cater, Portland State University. 


Let F(z) be an analytic function for |z| < 1. For each number u, 0 < u < 27, let 


h(u) = sup |F(te'“)| and g(w) = limsup|F(te)|. 
0<t<1 to1— 


Suppose that for each positive integer n, [f"h(u)"du < oo in the Lebesgue sense. (This would 
happen, for example, if F were bounded.) Prove that 


1983] PROBLEMS AND SOLUTIONS 339 
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i i=] 1 
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(3) r< do u,(Av), = u7Av 


i=] 


with equality if and only if u and v are linearly dependent. 
It is known that 


B:= adj(rI — A) = Buv’ for some B > 0. 
Then, by (3), we have 
tr| (rT — A)"B| = Btr[(rI — A)" w" | 


= B((rI — A)"u)'v = Bu'(rI — A)v = B(r— u7Av) < 0. 


Here we have made use of the fact that the rank one matrix ab’ has trace equal to a7b. 

Equality holds if and only if uw and v are linearly dependent, i.e., if and only if A and A” have a 
common eigenvector corresponding to r. This in turn holds if and only if B:= adj(rI — A) is 
symmetric. It is easy to see that this condition is sufficient, since when it holds 


(rl — A)'B=[B(rI — A)]’ =0. 
In particular, equality holds whenever A is normal or generalized doubly stochastic. 


Bounds of Analytic Functions 


6362 [1981, 710]. Proposed by F. S. Cater, Portland State University. 
Let F(z) be an analytic function for |z| < 1. For each number u, 0 < u < 27, let 
h(u) = sup |F(ze')| and g(u) = limsup|F(te'“)|. 
0<?t<1 to1— 


Suppose that for each positive integer n, [f"h(u)"du < oo in the Lebesgue sense. (This would 
happen, for example, if F were bounded.) Prove that 
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sup g(u) = essential sup g(u) = sup |F(z)]. 


O<u<2a7 O<u<2a7 |z|< 1 


Solution by the proposer. Let C, denote the circle with radius r and center 0. By Cauchy’s 
integral formula, for n a positive integer and |z| < r < 1, 


|F(z)| = (2mi) 'f F(o)"(w —z) ‘dv 


< (27) ‘(r- 2)" F(re™)| du, 
0 
So 
n —] —-ly.. 27 n 
\F(z)| < (27) (1 —|z\) liminf [ |F(re')| du. 
r>i— “0 


Now |F ( re'“)|" < h(u)” for all r < 1, and it follows from the Lebesgue dominated convergence 
theorem that 


2a \ 7 27 n 
lim inf F(re'")| du< du. 
iminf f |F(re'“)| u<f g(u) du 
Thus 


F(2)|" <Q) '(1— fe)" fa w)"du < (1 —[2l)' (ess. sup g(a) 


O<u<27 


and |F(z)| < (1 — |z) ~'/"(ess. supp cy<27 8(U)). Let n > co to obtain |F(z)| < ess. sup g(u). 
The rest is clear. O<u<2m 


Tangents to a Circle and a Parabola 


6364 [1981, 711]. Proposed by the late K. B. Leisenring, University of Michigan 


A circle with center at the vertex and radius equal to the latus rectum meets a parabola at P, Q. 
The circle and parabola have common tangents meeting the parabola at X, Y. Prove that XP, YO 
are tangent to the circle. 


Solution by Anders Bager, Akelejevej 5, DK-9800 Hjorring, Denmark. The parabola y* = x and 
the circle x7 + y? = 1 meet at P=(a,Va) and Q =(a,—Va) where a= (V5 — 1)/2. The 
tangent at the point (#7, ¢) on the parabola has equation 2ty = x + t*, and hence its distance 
from the origin 0 is t7(4t? + 1)~'/*. Setting this distance equal to 1 produces X = (b, — vb ) and 
Y = (b,Vb) where b = 2 + V5. Hence OP - OX = 1 and so 


OP - PX = OP -(OX — OP)=1-—1=0. 
Thus PX is tangent to the circle and, similarly, so also is YO. 


Also solved by 21 other readers. The problem turned out to be rather elementary. 


ANSWER TO PHOTO ON PAGE 330 


It isn’t often that four heads of the same department (at different times) can stand next to each 
other and smile. From left to nght: Heini Halberstam, Paul Bateman, Stewart Cairns, and Mahlon 
Day. The department is at the University of Illinois; the order of seniority is Cairns, Day, 
Bateman, and Halberstam (the present head). The picture was taken by Robert J. McEliece. 


340 PROBLEMS AND SOLUTIONS [May 


sup g(u) = essential sup g(u) = sup |F(z)]. 
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2a \ 2 27 n 
lim inf F( re’ )| du< du. 
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Tangents to a Circle and a Parabola 


6364 [1981, 711]. Proposed by the late K. B. Leisenring, University of Michigan 


A circle with center at the vertex and radius equal to the latus rectum meets a parabola at P, Q. 
The circle and parabola have common tangents meeting the parabola at X, Y. Prove that XP, YO 
are tangent to the circle. 


Solution by Anders Bager, Akelejevej 5, DK-9800 Hjorring, Denmark. The parabola y” = x and 
the circle x? + y? = 1 meet at P=(a,Va) and Q = (a,—Va) where a= (V5 — 1)/2. The 
tangent at the point (77, 7) on the parabola has equation 2ty = x + t*, and hence its distance 
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Y = (b,Vb) where b = 2 + V5. Hence OP - OX = 1 and so 


OP - PX = OP -(OX — OP)=1-1=0. 
Thus PX is tangent to the circle and, similarly, so also is YO. 


Also solved by 21 other readers. The problem turned out to be rather elementary. 


ANSWER TO PHOTO ON PAGE 330 


It isn’t often that four heads of the same department (at different times) can stand next to each 
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Bateman, and Halberstam (the present head). The picture was taken by Robert J. McEliece. 
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Exponential Series 


6365 [1981, 711]. Proposed by Alan Wilde, University of Michigan. 


For integers n > h > 0, define exp,(z) = Lz*/k!, where the sum is over all nonnegative 
integers k = h(mod n). 

(i) Assuming Fermat’s Last Theorem, show that if m > 2, there is no complex number z such 
that exp)(z) and exp,(z) are nonzero rational numbers while exp,(z) = --: = exp, _,(z) = 0. 

(ii)* Is Fermat’s Last Theorem needed? 


Solution by R. G. E. Pinch, Mathematical Institute, University of Oxford. The following two 
theorems appear in A. Baker, Transcendental Number Theory, Cambridge University Press, 1975. 


THEOREM 12.1. If both £,,& and y,, > are linearly independent over Q, then two at least of 
€,,7,, exp(§;n,) (i, J = 1,2) are transcendental. 


THEOREM 1.4. Ifa,,..., a, are distinct algebraic numbers and B,,..., B, are algebraic numbers, 
not all zero, then 


k 
> B,exp(a,) # 0. 


1=] 
The problem is now solved by the following theorem, the proof of which does not require 
Fermat’s Last Theorem to be assumed. 
THEOREM . If n > 2 and z + 0, then exp)(Z),..., €Xp, —;(Z) cannot all be algebraic. 


Proof. Suppose they are. Then the numbers 


n—-l 


exp(w'z) = > wexp,(z) (i= 0,1,2,...), 
h=0 


where w is a primitive nth root of unity, are all algebraic as well. 

Firstly, z cannot be a rational multiple of w, for if so, z and exp(z) are both algebraic, which 
contradicts Theorem 1.4. 

But now, €, = 1, €, = w and 9, = w, 7, =z are linearly independent over @ and so by 
Theorem 12.1 at least two of z, w,exp(wz) are transcendental, a contradiction. 


Also solved by Robert Breusch, E. G. Straus, William C. Waterhouse, and the proposer. 


REVIEWS 
EDITED BY ALLAN L. EDMONDS AND JOHN H. EWING 
COLLABORATING EDITOR FOR FILMS: SEYMOUR SCHUSTER, CARLETON COLLEGE 
13 Lectures on Fermat’s Last Theorem. By Paulo Ribenboim. Springer-Verlag, New York, 1979. 
xvil + 302 pp. 


DAVID R. HAYES 
Department of Mathematics and Statistics, University of Massachusetts, Amherst, MA 01002 


While browsing in an antique store one afternoon, I came across a collection of old comic 
books. Among them I found one entitled “Donald Duck in Mathmagic Land.” As I recall, the 


1983] REVIEWS 341 


Exponential Series 


6365 [1981, 711]. Proposed by Alan Wilde, University of Michigan. 


For integers n > h > 0, define exp,(z) = Lz*/k!, where the sum is over all nonnegative 
integers k = h(mod n). 

(i) Assuming Fermat’s Last Theorem, show that if m > 2, there is no complex number z such 
that exp)(z) and exp,(z) are nonzero rational numbers while exp,(z) = --: = exp, _,(z) = 0. 

(ii)* Is Fermat’s Last Theorem needed? 


Solution by R. G. E. Pinch, Mathematical Institute, University of Oxford. The following two 
theorems appear in A. Baker, Transcendental Number Theory, Cambridge University Press, 1975. 


THEOREM 12.1. If both £,,& and y,, n> are linearly independent over Q, then two at least of 
€,,7,, exp(§;n,) (i, J = 1,2) are transcendental. 


THEOREM 1.4. If a,,..., a, are distinct algebraic numbers and B,,..., B, are algebraic numbers, 
not all zero, then 


k 
> B,exp(a,) # 0. 


i= 1 


The problem is now solved by the following theorem, the proof of which does not require 
Fermat’s Last Theorem to be assumed. 


THEOREM . If n > 2 and z + 0, then exp)(Z),..., €Xp, —;(Z) cannot all be algebraic. 


Proof. Suppose they are. Then the numbers 
n—-1 


exp(w'z) = > w"exp,(z) (i= 0,1,2,...), 
h=0 


where w is a primitive nth root of unity, are all algebraic as well. 

Firstly, z cannot be a rational multiple of w, for if so, z and exp(z) are both algebraic, which 
contradicts Theorem 1.4. 

But now, €, = 1, §, = w and 7, = w, yn» =z are linearly independent over @ and so by 
Theorem 12.1 at least two of z, w,exp(wz) are transcendental, a contradiction. 


Also solved by Robert Breusch, E. G. Straus, William C. Waterhouse, and the proposer. 


REVIEWS 
EDITED BY ALLAN L. EDMONDS AND JOHN H. EWING 
COLLABORATING EDITOR FOR FILMS: SEYMOUR SCHUSTER, CARLETON COLLEGE 
13 Lectures on Fermat’s Last Theorem. By Paulo Ribenboim. Springer-Verlag, New York, 1979. 
xvi + 302 pp. 


DAVID R. HAYES 
Department of Mathematics and Statistics, University of Massachusetts, Amherst, MA 01002 


While browsing in an antique store one afternoon, I came across a collection of old comic 
books. Among them I found one entitled “Donald Duck in Mathmagic Land.” As I recall, the 


342 DAVID R. HAYES [May 


story began with Donald in a sweat over the disarray of his personal finances. Most troublesome 
was a large loan from his Uncle Scrooge which must be repaid soon. Suddenly a caped figure 
appeared, the Spirit of Mathematics, and whisked Donald away to a magic land where the flora 
and fauna were twisted into geometric and numerical shapes. Although I don’t now remember 
seeing it, somewhere in that surrealistic landscape there must have been an irregular prime 
chirping from its nest in a Bernoulli number. 

You can discover your own wide path to Mathmagic Land in these lectures of Ribenboim. 
From the moment he states Fermat’s Last “Theorem,” 


The equation x" + y" =z", where n is a natural number greater than 2, has no solution in 
integers all different from 0, 


in the first lecture, Ribenboim provides the reader with a varied, amusing and enlightening flow of 
information. Did you know that Fermat’s copy of the complete works of Diophantus with the 
most famous margin in scientific history is lost and that we must depend upon a secondary source 
for the quote, “... I have discovered a truly remarkable proof which this margin is too small to 
contain?” Did you know that if a counterexample (x, y,z) to FLT exists, then x must be 
magnitudes larger than the number of atomic nuclei which could be packed into the Universe? 
These two bits of information are sampled from the historical survey in Lecture I and the 
statement of the latest results on FLT in Lecture II. Both these initial lectures are accessible to 
anyone who remembers his high school mathematics. 

In Lecture III, entitled B.K.= Before Kummer, Ribenboim reproduces the proof given by 
Fermat himself for the case n = 4. After n = 4, one need only consider n = p, where p is an odd 
prime, and Ribenboim more or less restricts himself to that case in the subsequent lectures. He 
concludes Lecture III by settling FLT for the exponents n = p = 3, 5 and 7. 

The next few lectures form the heart of the book. Beginning with The Naive Approach (Lecture 
IV), Ribenboim proceeds to survey the fruits of three centuries of intense effort. Here it all is: The 
astounding array of necessary conditions which must be satisfied if there is a solution in which p 
does not divide xyz, the so-called “first case” of FLT; the surprising connections with Bernoulli 
numbers which were spaded out of the problem by the Herculean labors of Kummer; and the 
extensive numerical calculations carried out first by hand and then, in modern times, electroni- 
cally, culminating in the recent results of Wagstaff which verify the truth of FLT for all odd prime 
exponents less than 125,000. The Fermat Problem has clearly absorbed an enormous amount of 
intellectual capital! 

A naive reader, floating along on this flood of results, may well begin to feel lost in a 
mathmagical fog. Ribenboim does not directly address questions like, ““Why is FLT important?” 
or, “Where is all this leading?” Indeed, somewhere in the middle of Lecture V, the naive reader 
may conclude that people are interested in FLT mainly as a sport. Ribenboim’s style tends to 
reinforce this idea. For example, in Lecture VIII, as though describing the appearance of a new 
slugger in a history of baseball, Ribenboim writes: “Then came Wieferich ... Few people were 
able to understand how Wieferich succeeded, like a magician, in unravelling from very com- 
plicated formulas, so simple and beautiful a criterion as: 


If the first case fails for the exponent p, then p must satisfy the congruence 
2?-! = 1 (mod p’).” 


By the way, there are many other names in the FLT Hall of Fame, all conveniently listed with 
their papers in extensive bibliographies at the end of each lecture. 

In his A Mathematician’s Apology, G. H. Hardy attempts to define a deep sense in which a 
theorem or problem is “serious.” In Lectures V-VII, Ribenboim describes Kummer’s Monument, 
the work which in the middle 1800’s penetrated deeply into the problem and in the process 
founded the theory of algebraic numbers. The committed reader, naive or not, cannot complete 
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these three lectures without feeling that he has passed through some profound depths. Like a 
maturing love affair, the Fermat Problem suddenly becomes “serious.” The level of exposition in 
these lectures is very high. The author skillfully sketches the necessary background in the theory of 
cyclotomic fields, providing an understanding of Kummer’s ideas and techniques which is 
accessible to a wide audience. Ribenboim does not give complete proofs of the major theorems, 
but he does give a detailed outline of how the proofs go and of the difficulties which Kummer 
faced and overcame. 

In Lecture IX, Ribenboim indicates how class field theory can be used to improve both the 
results he has presented in previous lectures and one’s understanding of why they are true. Even 
the reader without background in class field theory can gain insight from this lecture. Ribenboim 
sketches modern developments related to FLT in Lecture X. In particular, he mentions the 
Iwasawa theory and the connections to points of finite order on elliptic curves. In Lecture XII and 
the amusing Lecture XIII, Variations and Fugue on a Theme, Ribenboim reports on what happens 
when mathematicians invoke their license to modify a problem when they cannot solve it. For 
example, “‘ Variation I (In the Tone of Polynomial Functions)” considers solutions to the Fermat 
equation in the ring of polynomials over a base field K. In work too recent to be included in these 
lectures, David Goss has shown that when K is finite, there is a much more profound analogue of 
the Fermat Problem than the obvious one. Goss’ analogue has most of the features we have come 
to associate with FLT itself. 

On his return from Mathmagic Land, Donald Duck offered his Uncle Scrooge a deal. He 
would work for Scrooge for thirty days. The first day, his wage would be one cent, and every day 
thereafter his wage would be double that of the previous day. Uncle Scrooge bought it. It’s clear 
that Donald profited from his trip to Mathmagic Land! You also will profit from reading this 
book of Ribenboim, but the coinage you receive will not be stamped out by the U.S. Mint. 
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As an undergraduate I had the misfortune to study thermodynamics after studying advanced 
calculus. My course in thermodynamics treated the state of a given body of gas as described by its 
pressure p, temperature 0, and density p when these variables do not vary with position and when 
they change so slowly (“quasistatically”) with time that their time derivatives are negligible. The 
differential as an infinitesimal, justly banished from analysis, was the main tool used for 
describing such processes: The differential was the time derivative of a function independent of 
time. (There were in fact several species of such differentials, one denoted by a “d” bearing a 
cross.) The fundamental laws of thermodynamics told how the gas would behave in a heat engine 
having the grace to preserve the spatial uniformity of the gas while coaxing it along quasistatically. 
The existence of such an ideal engine was never questioned, although a version of the Second Law 
of Thermodynamics did prohibit another such ideal engine, the perpetual motion machine. The 
quasistatic evolution of the gas was expressed in terms of differentials of p, 0, p, differentials of 
other quantities such as internal energy, entropy, enthalpy, and free energy (which were function- 
ally related to each other and to p, 9, ), and differentials of quantities such as the heat, which 
doesn’t have a differential. Each function relating one set of thermodynamical variables to another 
having the same cardinality was assumed smooth and invertible (according to what Truesdell calls 
the “Principle of Thoughtless Invertibility” (p. 283)). We students were required to compute all 
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possible partial derivatives of these functions, which was easy because we could invoke the related 
principle that “No Jacobian ever vanishes,” apparently the true First Law of Thermodynamics. 
But the pleasure of studying a science exempt from the rigors of analysis was scant compensation 
for the absence of principles as clean as “f = ma” and the absence of problems as substantial as 
those of classical mechanics. 

It was consequently with relief and satisfaction that I read Truesdell’s telling critiques [1], [2] of 
the incomprehensibility of expositions of thermodynamics. His prescription for the surfeit of 
differentials was simple: Variables such as @ are to be considered as functions of position and 
time. Then expressions containing the differential d@ can be replaced by precise and illuminating 
expressions containing the partial derivatives of 6. For example, a typical contemporary text 
might state that the heat 6Q added to a spatially uniform gas in a quasistatic process taking (6, p) 
to (0 + d0,p + dp) is 


(1) 8Q = A(9,p)d6 + B(O,p)dp 


where the functions A and B characterize the thermomechanical properties of the gas. (The 6 
before Q in (1) could be replaced with a cross-bearing d.) The vague expression (1) can be 
replaced by a statement that the local rate per unit volume at which heat is added to the material 
at x at time ¢ when @ and p are specified functions of x and ¢ is simply 


(2) a(O(x,1), p(x, 2))6,(x,t) + b(A(x, 2), p(x, t))p,(x, #) 


where a and b are proportional to A and B and where the subscripts denote partial derivatives. 
The total heat added to a body of gas in a process lasting from f, to f, is just the integral of (2) 
over (¢), ¢) and over the set of x’s forming the material points of the gas. The use of (2) not only 
conforms exactly to the notion of rate, but also makes it clear that thermodynamics need not be 
chained to quasistatic processes for bodies described by spatially constant fields. In general, (2) is 
not the partial time derivative of any function with values g(@(x, t), p(x, f)), i.e., the differential 
]-form on the right side of (1) fails to be exact. Consequently the appearance of Q on the left side 
of (1) is misleading. (The use of such notations is a source of the difficulties many students of 
science experience with the concept of virtual work. The notion of exactness of forms plays a 
central role in the traditional theory of entropy.) 

Truesdell’s prescription for the murkiness of the statements of the fundamental laws was 
likewise straightforward: The laws should be expressed as mathematical relations among functions 
describing physical entities and not as rules for the operation of hypothetical steam engines. In 
this way their validity could be determined by scientific test rather than by speculation almost 
theological in nature. In these writings Truesdell promoted the adoption of a language for civilized 
discourse in a thermodynamics rich enough to encompass fields varying in space and time. This 
language is just that of clean mathematics. (The determination of the correct versions of well 
formulated laws of continuum thermomechanics and of their consequences is an open problem 
which is being actively pursued.) 

How did it happen that thermodynamics seems to be so conceptually inferior to continuum 
mechanics and electromagnetic theory, the two other classical field theories of physics? The book 
under review, its title suggesting both entertainment and narrowly focused historical analysis, 
offers convincing answers to this question. 

The subject of Truesdell’s history is the conceptual founding of thermodynamics primarily by 
Carnot, Clausius, and Reech, and to a lesser extent by Rankine, Kelvin, and others. (Fourier’s 
work, duly analyzed, contributed little to this process: His Analytic Theory of Heat did not treat 
heat.) Truesdell blames the scientific inferiority of early thermodynamics to early mechanics on 
the mathematical inferiority of these founders of thermodynamics (except Kelvin) relative to the 
founders of mechanics, namely Newton, the Bernoullis, Euler, and Cauchy: “The vagueness and 
vacillation of Carnot’s concepts and assumptions... are typical of a theory not subjected to the 
discipline of mathematical statement, and perhaps unavoidable in such a theory” (p. 101). 
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“Among physicists of the first rank, Carnot is the first who was not in at least equal measure a 
mathematician. Thermodynamics is the first mathematical science to have been invented without 
the control afforded by patient, merciless, mathematical criticism. It has suffered from this 
congenital defect from 1824 until now” (p. 136). “Obscure logic, the painfully awkward calculus 
that dogs standard thermodynamics to this day, and vague expression have joined Clausius’ 
splendid achievements to form his legacy” (p. 207). Truesdell attributes the continued inferiority 
of thermodynamics to mechanics to the slavish adherence of the successors of Carnot and 
Clausius to the unfortunate expository precedent they established. Even Kelvin and Poincaré 
could not shake the dead hand of thermodynamics’ sorry mathematical tradition (cf. pp. 143, 168). 

Truesdell’s book contains three kinds of exposition: (1) straightforward accounts of the primary 
literature in a unified notation, (11) a lively set of strongly opinionated (and clearly identified) 
critiques, and (ili) retrospective sketches of what could have been achieved in the early history of 
thermodynamics with the tools then available. The “ahistorical moralizing” (p. 5) of the critiques 
make the book consistently entertaining: “... Carnot does not follow the tradition of eighteenth- 
century rational mechanics he has just praised for its generality and extent. Instead, the sardonic 
muse directs him to write in a medium that anybody can understand. An obvious necessary 
condition is that no mathematics be used in the main text. This condition did not turn out to be 
sufficient... . He is reported to have insisted that his brother, untrained in the subject, read and 
criticize the work; according to the legend, his brother understood it perfectly. Later students, 
unable to seek help from that brother, have puzzled, are puzzling, and forever will puzzle over it” 
(p. 80). The same pungency can be found in the amusingly apt quotations from the Divine Comedy 
that introduce each chapter; these are worth translating. Who could resist a book with a section 
entitled: “The Disastrous Effects of Experiment on the Development of Thermodynamics, 
1812-1853’? 

One can argue that it is not fair to compare nineteenth-century thermodynamics invidiously 
with eighteenth-century mechanics: The pioneers of thermodynamics should rather be judged 
against the pioneers of mechanics who preceded Newton. Such a view ignores the rich arsenal of 
mathematical technique available to the early thermodynamicists but not to Newton and his 
predecessors. Moreover, the pioneers of thermodynamics had before them the exemplar of a 
mature and coherent mathematical science, mechanics, whereas the pioneers of mechanics had to 
overcome a two thousand year old tradition of disputatious Aristotelianism. But the telling 
objection to this claim of unfairness is that the founders of electromagnetic theory faced the same 
obstacles as the founders of thermodynamics at the same time, yet succeeded in creating a 
coherent and beautiful mathematical science. 

One need only examine the progression from Newtonian to Lagrangean to Hamiltonian 
mechanics as presented in most contemporary texts on discrete mechanics to realize that the 
logical standards of traditional expositions of other fields of classical physics cannot withstand 
careful scrutiny. But the most general and most accessible formulation of classical mechanics, the 
Newtonian, admits a perfectly clear presentation, while the logical deficiencies in the introduction 
of the other formulations can be removed by the same methods that remove them in thermody- 
namics. Newtonian mechanics also encompasses innumerable specific, tractable, and illuminating 
problems.’ Nineteenth-century thermodynamics had nothing comparable to it. (I surmise that 
thermodynamics got off to such a bad start because Carnot, influenced by the Lagrangean 
formalism, modeled thermodynamics on the statics of discrete systems described by the Principle 
of Virtual Work expressed in terms of the ambiguous differentials, rather than on Euler’s more 
appropriate field theory of perfect fluids.) 

Can a reader untrained in classical thermodynamics follow the scientific development of the 
book without serious difficulty? I think so. By keeping one eye on the critiques following the 
straight historical analysis, such a reader can avoid being upset by failing to follow an argument 
later criticized as faulty. Alternatively, this history could serve as a vademecum to anyone wishing 
to assault a traditional exposition. 
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Of what value is this specialized historical tract to a mathematician, one, say, who is not 
consumed with an overwhelming desire to follow the involutions of the genesis of a physical 
theory that has had little impact on mathematics? There are several answers: The book gives a 
vivid impression of the struggles necessary to establish a mathematical science describing physical 
reality. Thereby it indirectly gives an appreciation of the genius of such as Euler, Cauchy, and 
Maxwell, who triumphed in their struggles to establish continuum mechanics and electromag- 
netism. It contains the evaluations by one who is not only a historian of science, but is also an 
active researcher who has made significant contributions to thermodynamics. It makes plain that 
the problems of cleaning up the exposition of thermodynamics and distinguishing principle from 
approximation require not the full array of modern mathematical technique, but only precisely 
stated hypotheses and the careful language of advanced calculus. To mathematicians as purveyors 
of the language of science it offers the disturbing suggestion that there is some law of inertia by 
which authors of contemporary books in the mathematical sciences adhere to clumsy mathemati- 
cal formulations because their predecessors, from whom they uncritically copied, used those very 
formulations. 

Why don’t scientists approach their disciplines with a critical mind? A few do. (Indeed, 
Truesdell protects his flank from attack by physicists with quotes by the Nobel laureate P. W. 
Bridgman on the deficiencies of thermodynamics.) But most of the scientists and engineers we 
educate in this country, who typically take little more than two years of college mathematics and 
who, never having seen a logically coherent presentation of a mathematical science, regard science 
as a bunch of memorized procedures, are simply unequipped to detect mathematical or logical 
weaknesses. Most mathematicians attribute their failure to comprehend a typical presentation of a 
physical subject to their lack of physical intuition rather than to the inadequacy of the presenta- 
tion. What we need is an educational system creating scientists so secure in their knowledge that 
they can (justly) say, “If I can’t understand it, then it’s incomprehensible.” 
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MISCELLANEA 


104. Sir James Jeans went so far as to assert that the Almighty must be a mathematician 
Himself, or otherwise natural phenomena could not be described in such beautiful mathematical 
terms. This argument takes us rather out of our depth, but we hope we shall not be accused of 
impiety if we remark that it is clear that no such feeling for beauty inspires the Board of Inland 
Revenue (in the U.S.A., the Internal Revenue Service— Editor). 


—B. V. BOWDEN, Faster Than Thought, London, 1953, p. 259 


105. The mathematicians are well acquainted with the difference between pure science, which 
has to do only with ideas, and the applications of its laws to the use of life, in which they are 
constrained to submit to the imperfection of matter and the influence of accidents. 


—SAMUEL JOHNSON, The Rambler, no. 14, May 5, 1750 
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ON THE GEOMETRY OF THE KEPLER PROBLEM 


JOHN MILNOR 
Institute for Advanced Study, Princeton NJ 08540 


Abstract. It will be convenient to use the term Kepler orbit for any curve x = x(t) in 3-space 
which arises as a solution to the Newtonian two body problem. Hamilton showed that the velocity 
vector v = dx/dt, associated with any nondegenerate Kepler orbit, moves along a circle. Follow- 
ing Gyérgyi, Moser, Osipov and Belbruno, this velocity circle can be interpreted as follows. If we 
fix the total energy EF, then the manifold M,, consisting of all vectors v with v - v > 2 E possesses a 
natural Riemannian metric of constant curvature —2 FE, whose geodesics are precisely the circles 
associated in this way with Kepler orbits. In other words, M, can be identified with one of the 
three classical geometries, that is with spherical, Euclidean or Lobachevsky space, so that each 
“straight line” in this geometry corresponds to a unique Kepler orbit. 


1. The Velocity Circle. In Kepler’s first attempts to understand the orbits of the planets, he 
tried the hypothesis that each orbit is a circle which lies in some plane containing the sun, but is 
not centered at the sun. This is of course wrong. Yet is does describe the correct answer to a 
slightly transformed problem. 

Consider a particle which is attracted to the origin by a force inversely proportional to the 
square of the distance, so that its position vector x = x(t) satisfies the Newton differential equation 


(1) d*x/dt? = —kx/)x\°. 


Here k is some fixed positive constant, and |x| denotes the Euclidean length yx - x of the vector x. 
(This equation is related to the Newtonian two body problem as follows. If two spherical objects 
with position coordinates x, and x, move about their common center of gravity in accordance 
with Newton’s laws, then the difference vector x = x, — x, will satisfy (1).) Solutions of this 
equation, whether elliptical or otherwise, will be called Kepler orbits. 

Throughout §1, we will assume that the position x and velocity v = dx /dt are linearly independent 
vectors at time t = to. 


THEOREM | (Hamilton, 1846). As ¢ varies, the velocity vector v = dx/dt moves along a circle C, 
which lies in some plane P containing the origin, but is not in general centered at the origin. Any such 
circle can occur, and this “velocity circle’ C, together with its orientation, determines the orbit 
X = x(f) uniquely. 


The proof will show that the corresponding orbit is either an ellipse, hyperbola, or parabola 
according as the origin lies inside, outside, or exactly on the velocity circle C. In the elliptic case, 
the velocity vector moves around the entire circle, but in the other two cases only that portion of 
the circle which is convex towards the origin is actually traversed. (Compare Fig. 1.) 

To begin the proof, recall that the cross product vector h = x X vis an invariant of the motion. 
That is the derivative dh/dt is identically zero; as one verifies by an easy calculation. Thinking of 
x(t) as the orbit of a particle of unit mass, we will call the length 4 = |h| the angular momentum of 
this orbit. Note that h > 0 by our linear independence hypothesis. 

It will be convenient to introduce a new system of cartesian coordinates X = (x, y, Z) so that the 
vector h points along the positive z-axis. The equation x(t) X v(t) = h = (0,0, 4) then implies that 
the vector x(t) always lies in the (x, y)-plane. Setting x(t) = (rcos 0, rsin 0,0), a straightforward 


John Milnor received his Ph.D. at Princeton University in 1954, under the direction of Ralph Fox. As a student, 
he was also strongly influenced by Norman Steenrod, and by a year in Zurich under Heinz Hopf. After teaching for 
some years at Princeton University, interrupted by stays in Oxford, Berkeley, U.C.L.A.,and M.I.T., he moved across 
town to the Institute for Advanced Study. His mathematical work was centered around the topology of manifolds 
and related areas of algebra. 
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computation shows that 
(2) h=r? d0/dt, 


The constancy of this expression r* d6/dt is just Kepler’s Second Law, which asserts that a line 
segment from the origin to x(¢) traverses equal areas in equal times. 

[It is interesting to note that Kepler’s interpretation of this law was very different from our 
modern interpretation, since he had no concept of inertia. He believed that the sun exerts a 
sideways force, inversely proportional to distance, which pushes the planets around their orbits.] 

Let us write the Newton equation as dv/dt = —k(cos 0,sin0,0)/r?. Dividing by d0/dt = 
h/r?, and setting R = k/h, we obtain 


dv/dé = — R(cos 6, sin 6,0), 
Integrating, we obtain 
v = —R(cos 0,sin 6,0) dd = R(—sin#@,cos 0,0) + ¢, 


where ¢ = (C), C,, 0) is a constant of integration. This proves that the velocity vector v moves along 
a circle centered at ¢, with radius R = k/h inversely proportional to angular momentum, and lying in 
the plane through the origin which is orthogonal to the vector h. 

It is interesting to note that the difference vector v — c = R(—sin#,cos 6,0) is always 
orthogonal to the position vector x = r(cos #,sin 8,0). Hence the angle between two velocity 
vectors, as measured around this circle C, is equal to the angle between corresponding position vectors 
as seen from the origin. 

The ratio e = |ce|/R, that is the distance of the center from the origin divided by the radius, is 
called the eccentricity of the circle C with respect to the origin. It will be convenient to choose our 
coordinates x, y for the plane so that the center ¢ of the circle C lies on the positive y-axis. With this 
convention, we can write 


v= R(-siné, e+ cos8,0). 


Substituting this expression in the equation h = x X v we obtain the formula h = rR(1 + ecos @) 
for angular momentum; and solving for 7 we obtain 


(3) r= A/(1 + ecos @), 


where A = h?/k. This is precisely the equation of a conic section of eccentricity e, with focus at the 
origin, in polar coordinates. (See Appendix 1.) This conic section is either a circle, ellipse, parabola, 
or hyperbola according aae = 0, O<e< 1, e=1, ore> 1. In the latter two cases, note that 
the possible values of the angular coordinate @ are constrained by the inequality 1 + ecos # > 0. 

Our coordinate system has been chosen so that the point of closest approach, where r = 1,,,,, 
lies on the positive x-axis, with § = 0. This corresponds to the classical convention that the 
angular coordinate 8, known as the anomaly, should be measured from this point of closest 
approach. The geometrical constant A, proportional to the square of angular momentum, is 
known classically as the semi-latus-rectum. It can be described as the distance from origin to orbit 
in a direction at right angles to the direction of closest approach. 

To complete the proof of Theorem 1, we must show that every such conic section (3) really 
yields a solution to the Newton equation. But Equation (3) shows that r can be expressed as a 
function of 6, and Equation (2) implies that t = {r(0)* d6/h can also be expressed as a function 
of @. It follows from the inverse function theorem that @ and hence x can be expressed as 
functions of ¢; and it is not difficult to check that the functions constructed in this way do indeed 
satisfy Equation (1). Details will be left to the reader. @ 

Another classical invariant associated with a solution x = x(t) of Equation (1) is the energy 
E=v-v/2 —k/r. A straightforward calculation shows that the derivative dE /dt is zero, so that 
E is indeed constant along any orbit. Note that the inequality v - v > 2 E must always be satisfied. 
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In terms of the velocity circle, with radius R and center c, energy is given by 
(4) 2E=c-c— R*= — (1 — &*)k?/h?, 
as one can check by evaluating FE at any point of the orbit. This computation can be expressed 


more geometrically by the following statement, which is essentially due to Euclid. (Compare 
Coxeter [4] pp. 8, 81.) 


LEMMA 1. Let v, and v, be two points of the circle C which lie on a common line through the 
origin. Then v, - Vv, = ¢-¢ — R’; hence v, - vy is equal to 2E. 


Proof. Let w be the point of the circle which is diametrically opposite to v,. Setting v, = c + e 
and w = c — e, we see that 


Vi: W=c-:c-—e-e=c:-c— R?=2E. 


Since Vv), V>,W 1s a right triangle, it follows that v, is orthogonal to the difference vector v, — w. 
Hence v, - V, 1s equal to v, - Ww = 2£, as required. & 

In the case of an elliptical orbit, the precise shape and period of the orbit are related to the 
energy F as follows. First note that the major axis 2a of the ellipse is given by the computation 


2a = rain + Tmax = A/(1 + €) + A/C — €) = 2A/(1 — &”). 
Since A = h?/k, it follows from (4) that 
(5) 2E = —k/a. 


Thus the energy E of an elliptic orbit is negative, and | E| is inversely proportional to the major axis of 
the ellipse. 
The minor axis 2b of the ellipse, computed by setting dy/dt = R(e + cos @) = 0, is given by 


b=y_. = A/V1 — e?. 


The area of the ellipse is then given by 
A = ab = rh’*/(1 — e2)/*. 


Applying Kepler’s Second Law in the form dA /dt = h/2, we see that the period T of such an 
elliptical orbit satisfies A/T = h/2. Together with (4), (5), and the definition of A, this implies 
that 


(6) T = 2ak/(-2E)” = 29(a3/k)”, 


which is a modern statement of Kepler’s Third Law. Thus the period is proportional to a*/* and is 
inversely proportional to |E|°/*. For further details of these calculations see, for example, Arnold 
[1] or Synge and Griffith [21]. 


Remark. A familiar but incorrect version of this Third Law asserts that the period is 
proportional to the 3/2 power of the “mean distance” of x(t) from the origin. In fact the mean 
distance, that is the time average ¢rdt/f dt, is not equal to the major semi-axis a but is rather 
equal to (1 + e*/2)a. However it is interesting to note that the time average of r~' is precisely 
equal to a~'. Proofs based on Formula (9) of Appendix 1 are easily supplied. 

It is noteworthy that the period of such a periodic orbit depends only on the energy. This is true 
for period solutions of Lagrangian or Hamiltonian differential equations under quite general 
conditions. Compare Herglotz [9], Wintner [22, §100], and Gordon [6]. 


2. The Levi-Civita Metric. Moser [16] has given a very pretty picture of the space of all elliptic 
orbits of fixed energy, or fixed period, in terms of stereographic projection. (See also Gyérgyi [7].) 
To carry out Moser’s construction, we must introduce a 3-dimensional sphere, consisting of all 
unit vectors u = (u,, U5, U3, U4) in Euclidean 4-space, and project it stereographically from its 
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“north pole” (0,0, 0, 1) onto the 3-dimensional Euclidean space consisting of vectors of the form 
V = (0), 0, 03,0). Thus each unit vector u maps to the unique vector v = (u,, U5, u3,0)/(1 — uy) 
which has last coordinate zero, and lies on the same straight line through the north pole. Then 
every circle on the 3-sphere projects onto a circle or line in 3-space. (Compare Appendix 2.) 

A brief computation shows that the antipodal map u> —u from the 3-sphere to itself 
corresponds to the negative inversion map v > —v/lv\" from Euclidean 3-space to itself. In other 
words, diametrically opposite points of the 3-sphere correspond to points v, and v, in Euclidean 
3-space which lie on a common line through the origin, and satisfy v, : v, = — 1. Evidently the 
images of great circles on the 3-sphere are just those circles, or lines through the origin, which map 
into themselves under this negative inversion operation. Using Lemma 1, we see that a circle in 
3-space has this invariance property if and only if it is one of our “velocity circles” with energy . 
E =(c-c— R’)/2 equal to —1/2, or equivalently with period T equal to 27k. Thus stereo- 
graphic projection carries great circles on the 3-sphere precisely onto the velocity circles associated 
with elliptic orbits of energy — 1/2. 

There is one apparent defect in this picture. Namely those great circles which pass through the 
poles correspond to straight lines through the origin in velocity space. These do not seem to fit 
into our picture. However this apparent defect is actually a virtue in disguise. For it enables us to 
give a precise description of the limiting behavior of elliptic orbits of fixed period as the angular 
momentum h tends to zero. If h — 0, keeping F fixed and negative, it is easy to see that the minor 
axis of the orbit ellipse tends to zero, so that the ellipse flattens out and tends towards a straight 
line segment of length —k/E with one end at the origin. In order for the orbit to vary 
continuously as h > 0, we must adopt the following. 


REFLECTION CONVENTION. If an orbit has angular momentum h = Q, and if dr/dt < 0 for some 
values of t so that x(t) falls freely along a straight line towards the origin, then x(t) is reflected back 
along this same line when it hits the origin. 


(Compare Devaney [5], McGehee [14].) In fact it is natural to adopt this same Reflection 
Convention whether the energy is positive, negative or zero. 

By rotating our coordinates, we can put such a singular orbit in the form ¢ > (r(¢), 0,0), where 
r(t) > O satisfies the differential equation (dr/dt)*/2 = E+ k/r. As an example, if E = 0, it 
follows that r is proportional to (t — t,)?/3. If E < 0, then the solution curve r = r(t) is a cycloid, 
periodic in ¢, and smooth except for a cusp wherever r = 0. (See Appendix 1.) Note in particular 
that the derivative dr/dt tends to minus infinity as r decreases to zero, and then jumps to plus 
infinity. If x(t) bounces off the origin, then the velocity vector v(t), moving along a straight line, 
sweeps through the “point at infinity” in 3-space. This velocity line corresponds, under stereo- 
graphic projection, to a great circle which sweeps through the north pole of the unit 3-sphere. 

Thus, if we adopt this reflection convention, then there is a precise one-to-one correspondence 
between orbits of energy —1/2 and great circles on the unit 3-sphere. The situation for other 
negative values of E is the same, except for a scale change in 3-space. 

According to Osipov [17], [18], and Belbruno [2], [3], there is an analogous description for 
parabolic or hyperbolic orbits. First consider a parabolic orbit, with energy E = 0. Recall that the 
velocity vector v(t) associated with such an orbit moves along a circle C which passes through the 
origin. Let us transform this problem by applying the inversion operation v > w = v/\v\”. Then a 
circle C through the origin corresponds to a straight line in the space of vectors w. (See Appendix 
2.) In this way we obtain a precise one-to-one correspondence between orbits of energy zero and 
straight lines in the Euclidean space consisting of all inverted velocity vectors w. In this picture, the 
orbits which bounce off the origin correspond to straight lines through the origin in inverted 
velocity space. 

Finally let us look at the positive energy case, say EF = + 1/2. Sincev: v > 2E = 1, it follows 
that the inverted velocity vector w = v/|v\” varies over the unit ball jw) < 1. Using Lemma 1, we 
see that the corresponding velocity circles C are invariant under inversion, and hence intersect the 
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boundary of the unit ball orthogonally. Thus each orbit of energy + 1/2 corresponds to a circle arc 
t +» w(t) which spans the unit ball \w\ < 1, and intersects its boundary 2-sphere orthogonally. Orbits 
which bounce off the origin correspond to diameters spanning this unit 3-ball. 

It is natural to compare this picture with the conformal unit ball model for the “hyperbolic” 
non-Euclidean geometry of Lobachevsky. (See Appendix 2.) In this model, discovered by Beltrami 
and later by Poincaré, points of hyperbolic space correspond to points in the Euclidean unit ball, 
and hyperbolic straight lines correspond to circle arcs or diameters which span the unit ball, 
meeting its boundary orthogonally. Thus the orbits of energy + 1/2 correspond precisely to “straight 
lines” in this model for hyperbolic 3-space. There is an analogous picture for any positive value 
of E. 

In terms of Riemannian geometry, we can put these three different constructions into one 
common framework as follows. Levi-Civita pointed out that it is possible to simplify solutions to 
Equation (1) by introducing a fictitious time parameter s = { dt/r along any orbit, where r = |x|. 
(Compare Appendix 1.) Using this parameter, we will prove the following. 


THEOREM 2 (Osipov and Belbruno). Fixing some constant energy E, consider the space M, 
consisting of all velocity vectors v for which v-v > 2E, together with a single improper point v = 
00. This space possesses one and only one Riemannian metric ds* so that the arc-length parameter 
{ ds along any velocity circle t > v(t) is precisely equal to the Levi-Civita parameter { dt/|x(t)|. This 
metric is smooth and complete, with constant curvature —2E, and its geodesics are precisely the 
circles or lines t +> v(t) associated with Kepler orbits. 


In other words, there is a unique way of defining the “length” of a curve I’ in this space M, of 
all compatible velocity vectors so as to coincide with Levi-Civita’s integral {-dt/r. If K = —2E is 
positive, then M,, with this definition of length, is isometric to a 3-sphere of radius 1/ VK in 
Euclidean 4-space; and the great circles on this sphere correspond to velocity circles. If K = 0, 
then M,, is isometric to Euclidean space; and if K < 0, it is isometric to hyperbolic space, with the 
unit of distance chosen appropriately. In particular, in all three cases, M@, can be made into a 
smooth manifold even in a neighborhood of the special point v = oo. Geodesics which pass 
through this special point correspond to Kepler orbits which bounce off the origin. 


Proof. The Newton equation dv/dt = —kx/r? implies that |dv/dt| = k/r?. Dividing this 
equation by the definition ds = dt/r, and recalling the definition of E, we obtain |dv/ds| = k/r 
=v-v/2 — E. Therefore !ds| = 2|dvj/(v - v — 2 E), or in other words 


(7) ds? =4dv-dv/(v-v—2E/Y, 


Thus there is one and only one Riemannian metric on M,, which satisfies our condition, and it is given 
by this formula (7). To describe what happens in a neighborhood of infinity, we work with the 
inverted velocity coordinate w = v/|v|. Note that 2 Ew - w < 1. Computation shows that dw - dw 
= dv -dv/(v - v)”. (See Formula (10) of Appendix 2.) Hence ds* = 4dw:- dw/(1 — 2Ew: w)’, 
where the denominator is always strictly positive. If we set K = —2£, this becomes 


(7’) ds? =4dw- dw/(l + Kw-wy, 


Except for a scale change, this is just the form given by Riemann, in his inaugural dissertation, for a 
metric of constant curvature K. In the case K = 0, this metric is obviously flat, with straight lines as 
geodesics. In the positive curvature case, we can compare it with the standard metric on a 3-sphere 
of radius 1/ VK. It is not difficult to check that these two metrics correspond isometrically, via 
stereographic projection from the 3-sphere to a 3-plane passing through its center; and that great 
circles correspond precisely to our (inverted) velocity circles. Further details of the proof, 
particularly in the case K < 0, may be found in Appendix 2. 


COROLLARY. The functions x(t) and v(t) depend smoothly on the time t and the initial conditions 
x(0) and v(0), even in the neighborhood of an orbit which bounces off the origin, so long as the vectors 
x(0) and x(t) themselves are not zero. 
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Here is an explicitly worked out example, to illustrate the behavior of these functions. Consider 
the family of parallel straight lines s > w = (—s, a, B)/2 in inverted velocity space, depending 
on two real parameters a and £. These correspond to the family of parabolic orbits 

xX = k (a? + p*— s*,2as,2Bs)/2. 
Not only this orbital position x, but also the distance 
r= x| = k(a? + B? 4+ 57)/2 


and the time 
t= [rds = ks(a? + B? + 5?/3)/2 
0 


can be expressed as smooth functions of the three parameters a, 8, s. Using the implicit function 
theorem, we can solve for s, and hence x, as functions of a, B, ¢t. These functions are smooth 
except precisely at those points where dt/ds = |x| is zero. | 
Similarly, we can look at the six parameter family of orbits depending on the initial position 

vector x(0) # 0 and the initial velocity vector v(0). Using the identity 

r=2k/(v-v—2E) =2kw-: w/(1 — 2Ew- w), 
where E = v(0) - v(0) — k/{x(0)|, we see that r and ¢ = {7 ds can be expressed as smooth functions 
of the parameters x(0) # 0, v(0) and s. In fact it is not difficult to verify the identity 

x = 4k(2(w- w’)w — (w- w)w)/(1 — 2Ew- w), 


where W = dw/ds, which shows that x can also be expressed’ as a smooth function of these 
parameters. Again, if we use ¢ in place of s as parameter, then we lose the smoothness of the 
resulting functions only at those points where x is precisely equal to the zero vector. 


Appendix 1. Conic sections. 


This will be a brief review of some classical constructions. An ellipse, with foci f, and f,, can be 
defined as the set of all vectors x in the Euclidean plane which satisfy the equation 


k — f\| + k — f,| = 2a. 


Here 2a > {f, — f,| is a constant equal to the major axis of the ellipse. The ratio e = [f, — f|/2a 
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of the distance between foci to the major axis is called the eccentricity of the ellipse. Note that 
0 < e < 1, where e = 0 only for a circle. If we choose cartesian coordinates x, y so that f, is the 
origin and f, is the point (—2ea,0), then the distances r and 2a — r of (x, y) from f, and f, are 
given by the equations 


x2+y2=r?, (xt+2ea) +y?=(Qa-ry. 
(Fig. 2.) Subtracting one equation from the other and dividing by 4a, we obtain 
(8) r=yx*+y?=A-—ex, 


where A denotes the constant (1 — e”)a, known as the “semi-latus-rectum.” 

More generally, if we fix any constants e > 0 and A > 0, then the locus described by Equation 
(8) is called a conic section with focus at the origin, and with eccentricity e. Squaring both sides of 
(8), we obtain 


(1 — e7)x* 4+ y* = A* — 2Aex. 


Evidently this equation describes a parabola if e = 1, or a hyperbola if e > 1. Substituting 
x = rcos @, y = rsin @ in (8), we obtain the equation 


(3) r= A/(1 + ecos 0) 


which defines such a conic section, with focus at the origin, in polar coordinates. In the elliptic 
case e < 1, note that the quadratic equation relating x and y can be put in the form (x + ea)*/a? 
+ y*/b* = 1, where b = aV1 — e*. Thus this ellipse is centered at the point (—ea,0), and has 
principal axes 2a => 2b. 

Given the constant k > 0, we can introduce the time ¢ into this geometrical picture by means of 
Kepler’s Second Law, d6/dt = h/r?, or in other words t = [r? d0/h, where h = VK A. Similarly, 
Levi-Civita’s fictitious time can be introduced as the integral s = { dt/r = {rd6/h along an orbit. 
In practice, it turns out to be easiest to express both time and position as functions of s. 
Substituting (3) in the equations x = rcos @, y = rsin @ and differentiating, we find that 


dx/ds = —Ry dy/ds = he + (1 — e”)Rx, 


and therefore d*y/ds” = 2Ey, where R=h/A, and 2E = —(1 — e”)R?’. From this, it is not 
difficult to compute y, x, and ¢ = {(A — ex) ds as functions of s. As an example, in the negative 


energy case 2E = —a* < 0, if we normalize so that s = 0 at the point of closest approach, we 
find that 
= —ea.t+ acosas, y= bsinas, 
with 
(9) =a(l—ecosas), t=a(s— ea 'sinas). 


(Compare KEPLER.) This form of the equations again makes the elliptical shape of the orbit quite 
clear. In Kepler’s terminology, the angle as is known as the “eccentric anomaly,” and the angle 
at/a = 2at/T is called the “mean anomaly.” If the eccentricity e tends to 1, keeping a and 
a=k/a’ fixed, so that b and A tend to zero, note that these expressions tend to well behaved 
limits. The resulting curve in the ¢, x plane is known as a cycloid. In the positive energy case, there 
are completely analogous formulas involving the hyperbolic sine and cosine functions. In the zero 
energy case, evidently y is a linear function of s, hence x is a quadratic function, and ¢ is a cubic 
function of s. 


Appendix 2. Inversive geometry, stereographic projection, and hyperbolic geometry. 


This will be a quick review of well-known material. Further details may be found for example 
in Coxeter [4] or in Hilbert and Cohn-Vossen [10]. 
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Let p be a base point in the n-dimensional Euclidean space E. The operation of inversion, in 
the unit sphere centered at p, is a smooth mapping from E — p to itself, defined as follows. Each 
point x # p of Euclidean space maps to the unique point y which lies on the ray (= half-line) which 
starts at p and passes through x, such that the Euclidean distance \y — p| is equal to the reciprocal 
1 /\x — p|. Thus each point of the unit sphere |x — p| = 1 is fixed by this inversion map, but the 
inside and outside of the sphere are interchanged. More generally, given any constant 7 > 0, we 
can invert in the sphere of radius r centered at p. The definition is the same, except that we set 
ly — p| equal to r*/|x — pl. 

It is often convenient to extend this construction by adjoining a formal point at infinity to 
Euclidean space. Then inversion maps F£ U oo to itself, with the understanding that the base point 
p maps to the point oo and that oo maps to p. 

Here is an example to illustrate the inversion map. Consider an (n — 1)-dimensional sphere S in 
Euclidean space, with two diametrically opposite points p and q, and let P be an (n — 1)-dimensional 
hyperplane, situated as in Figure 3, so that the half-line from p through q meets P orthogonally at a 
point z +p. Then for any x of S and y of P which lie on a common line through p, the right 
triangle p, x, q is similar to the right triangle p,z, y. Therefore the ratio |x — p|/lq — pj is equal to 
the ratio |z — p\/ly — p|. In other words, the distance from x to p is inversely proportional to the 
distance from y to p. This proves that the sphere S maps onto the hyperplane P U oo under inversion 
from p, provided that the constant r is chosen correctly; and conversely it proves that P U oo maps 
onto S. 


half-space H 


boundary 0H 


a, — 


wee eee 


(n — 1)-sphere fixed by inversion 


FIG. 3 


Definition. The one-to-one correspondence between the sphere S and the hyperplane P U oo 
which is constructed in this way is called stereographic projection from p. Geometrically, stereo- 
graphic projection maps each point x # p of S to the unique point y of P such that x, y, and p all 
lie on a common Straight line. 

If P* is a plane of dimension k in Euclidean n-space, where k can be any number between | 
and n— 1, then a completely analogous argument shows that the image of P* U oo under 
inversion is either a sphere S* (if P* does not contain the base point p), or is equal to P* U o 
itself if P* does contain p. More generally, we will prove the following. 
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LEMMA 2 (Steiner, 1824). The operation of inversion carries every straight line or circle in E U oo 
to a straight line or circle. Similarly it carries every k-dimensional sphere or plane to a k-dimensional 
sphere or plane. Furthermore it preserves angles. That is, if two lines or circles intersect at angle 6 (at 
a point other than p), then their images under inversion also intersect at angle @. 

To make sense of this statement, we must adopt the convention that the point at infinity 
belongs to every straight line or hyperplane, but that it does not belong to any circle. 

As an immediate corollary of this lemma: Stereographic projection maps every circle in the 
sphere S to a circle or straight line in the hyperplane P U oo. Furthermore, stereographic projection 
also preserves angles. 


Proof of Lemma 2. First consider a sphere of dimension n — 1. By the remarks above, we need 
only consider the case where this sphere S"~' does not contain the base point p. It will be 
convenient to choose cartesian coordinates so that p is the origin. If x and x’ are two points of 
S"~! which lie on a common line through the origin p, then, as in the proof of Lemma 1, the 
scalar product x - x’ is equal to a nonzero constant, which we can write as r*/a. It follows that the 
image of x’ under inversion is equal to ax. Therefore, the image of S"~' under inversion is precisely 
equal to the sphere consisting of all points ax with x in S"~'. 

The proof for a sphere S* of arbitrary dimension now follows easily. For S* can be described 
as the intersection of the sphere S”~' which has the same center and the same radius with a plane 
P**!. Since inversion maps both S"~' and P**! to a sphere or plane of the same dimension, it 
follows that it maps their intersection to a sphere or plane. 

Next consider two smooth curves [, and [, in Euclidean space which intersect at a point 
x + p. Then their images [’,* and [’,* under inversion intersect at a corresponding point x*. If @ is 
the angle between the tangent vectors of the two curves at x, and 6* is the angle between the 
corresponding tangent vectors at x*, we must prove that 0 = 0*. Let L, be the straight line which 
is tangent to IT, at x, let L; be the parallel straight line through p, and let C,; be the image of L, 
under inversion. Then L; is tangent to C, at p, so the angle between C, and C, at p is equal to 6. 
Evidently the angle between these two circles at their two points of intersection must be equal. 
Since C, is tangent to [’,* at x*, this completes the proof. @ 

Now let us describe a very different kind of geometry. Non-Euclidean geometry, also called 
hyperbolic geometry, was first introduced by Lobachevsky and Bolyai by means of a collection of 
geometric axioms. These were identical to the axioms of Euclidean geometry, except for the 
hypothesis that, within a plane, it is possible to construct more than one parallel to a given line 
through a given point. There was no proof that these axioms were consistent for many years, until 
Beltrami showed that hyperbolic geometry could be modeled within Euclidean geometry. (See 
Milnor [15] for references.) One of Beltrami’s Euclidean realizations of hyperbolic geometry was 
the upper half-space model, later utilized by Poincare. It can be described as follows. 

Let H be an open half-space, with boundary 0H, in the n-dimensional Euclidean space E. It 
will be convenient to make use of Cartesian coordinates x = (x,,:-:, x,), chosen so that the 
half-space H is defined by the inequality x, > 0. Thus dH is the hyperplane x, = 0. If T is a 
smooth curve, described parametrically by the equation x = x(t), then we will use the notation 
fr|dx| for the Euclidean length {|dx/dt| dt. 


Definition. By the hyperbolic length of a smooth curve T in the half-space H will be meant the 
integral {;|dx|/x,, along I’, of the Euclidean length element |dx| divided by the Euclidean 
distance from 0H. 

A curve I’ in H will be called a hyperbolic line if it provides the hyperbolically-shortest possible 
path between any two of its points. In other words, there must be no curve in H which joins two 
points of T and has hyperbolic length strictly less than the hyperbolic length of the segment I of 
I between these points. (In Riemannian geometry, such a curve I is called a minimal geodesic.) 


LEMMA 3 (Beltrami, 1868). A curve in the half-space H is a hyperbolic line if and only if it is 


1983] ON THE GEOMETRY OF THE KEPLER PROBLEM 363 


either a Euclidean half-line which meets the boundary of H orthogonally, or a Euclidean semi-circle 
which meets 0H orthogonally. 


Proof. First suppose that I is a half-line 


x, = constant,..., x,_, = constant, x, > 0. 


n 


If A is any other curve segment joining two points of I, then it is easy to check that 
flaxi/x, = fldenl/xn > [ ldenV/%n- 
A A Ty 


In fact there is a strict inequality unless A is also a vertical line segment. This proves that T is a 
hyperbolic line; and furthermore that it provides the unique hyperbolically-shortest path between any 
two of its points. 

Now consider an inversion mapping x > y, using any base point p on 0H. To simplify the 
computation, let us first suppose that p = 0. Differentiating the equation y = r?x/x - x, we 
obtain 


dy = r?((x- x) dx — 2(x - dx)x)/(x- x)’. 


If we take the dot product of this equation with itself, the terms involving x - dx cancel, so that we 
obtain 


(10) dy -dy =r dx-dx/(x-xy)’. 


More generally, for any choice of base point p, the appropriate formula is 


dy - dy = dx - dx(r/x — pl)’. 
It follows easily that 
|dyl/ly — pl = |ax|/x — pl. 


since the vector y — p is a positive multiple of x — p, this implies that 


Idyl/y, = |dx|/x,, 


whenever the base point p belongs to the boundary of H. Thus the inversion mapping x > y, with 
any choice of base point in 0H, maps the half-space H into itself so as to preserve hyperbolic length. 
Such a length preserving mapping from H to itself is called a hyperbolic isometry. 

It follows from Lemma 2 that inversion carries any half-line meeting 0H orthogonally to a 
semicircle meeting 0H orthogonally. Since we have shown that these Euclidean half-lines are 
hyperbolic lines, it follows that these semi-circles must also be hyperbolic lines. 

To see that there are no other hyperbolic lines, suppose that we start with two arbitrary points 
x and y of the half-space H. Then it is not difficult to check that x and y lie on either a Euclidean 
half-line or a Euclidean semi-circle which meets 0H orthogonally. Thus we have constructed 
enough hyperbolic lines to join any two points of H; and it follows that we have constructed all 
hyperbolic lines. 

More generally, a k-dimensional subset of H is called a hyperbolic k-plane if it contains the 
hyperbolic line joining any two of its points. A similar argument shows that a subset of H is a 
hyperbolic k-plane if and only if it is either a Euclidean half-plane or a Euclidean hemisphere, 
meeting the boundary of H orthogonally, and having dimension k. 

Note that the group consisting of all hyperbolic isometries from H to itself is quite large. The 
proof above shows that any inversion map, with base point in dH, belongs to this group of 
isometries. As a further example, it is not difficult to check that the mapping 


(X,,°°°, X,) > Cy (X15°° +, X,) + (C1557 +5 Cy—1,9) 
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is a hyperbolic isometry which carries the point u = (0,---, 0,1) to a completely arbitrary point 
c = (c),°°°, ¢,) of H. 

One characteristic property of this geometry is the following. In any hyperbolic plane, consider a 
triangle A of hyperbolic area A, bounded by three hyperbolic line segments. The sum of the interior 
angles of A is equal to 7 — A. Here we use the usual Euclidean definition of angle in the upper 
half-space model, but A must be defined as the integral {, (dA) guctiaean/(X,)7 SO as to be invariant 
under hyperbolic isometries; where (dA) puctidean Stands for the usual Euclidean area element. 


Proof. We may assume that A lies in the plane x, = --- = x,_, = 0. Setting x = x,,y=x,, 
and integrating A = [{{, dx dy/y” with respect to the y variable, we obtain A = [,, dx/y, to be 
integrated around the boundary of A. Each edge of A is either a vertical line segment, on which 
dx /y is zero, or a circle arc 


X=X)+acosé, y=asin8, 


on which dx/y = —d@. Thus A = —f,, d0. An elementary argument then shows that this 
expression is equal to 7 minus the sum of the angles. & 

By way of contrast, for any triangle on the unit 2-sphere the sum of the interior angles is equal 
to 7 plus the area. (Compare Coxeter [4], pp. 95, 297.) On a sphere of radius r, the corresponding 
formula would be 


(11) > (interior angles) = 7 + KA, 


where K = 1/r*. Let us take this last formula as a definition of the curvature of a space of 
constant curvature K. Then evidently Euclidean space has curvature K = 0, and hyperbolic space 
has curvature K = —1. More generally, if we change the definition of “length,” multiplying all 
hyperbolic lengths by r and all areas by r*, then we obtain a space of constant curvature 
K = —1/r’. 

Section 2 makes use of the conformal unit ball model for hyperbolic space. This is a slightly 
different Euclidean model which can be constructed as follows. Let us invert the upper half-space 
H, consisting of all points x with x, > 0, with respect to a sphere of radius r = ¥2 centered at the 
point p = (0,..., 0, — 1). Then it is not difficult to check that the image of H under this inversion 
is precisely the open unit ball B, consisting of all vectors y with ly| < 1. Let u be the upward unit 
vector —p = (0,---, 0,1). A brief computation, based on the formulas x + u = 2(y + u)/ly + ul? 
and |dx| = 2\dy|/ly + ul’, shows that the hyperbolic length element |dx|/x,, = |dx|/x - u in the 
half-space H corresponds to the length element 


(12) a\dy\/(. —y-y) 
in the ball B. This last expression is the Riemann-Beltrami formula for the hyperbolic length element 
in the unit bally -y < 1. 

We can take this ball B, with the length element (12), as an alternative model for hyperbolic 
space. It follows from Lemmas 2 and 3 that the hyperbolic lines in this model are just the circle 
arcs or diameters which span the ball B, meeting its boundary sphere orthogonally. 

This model gives us a further understanding of the richness of the group of isometries of 
hyperbolic space, for the subgroup consisting of all hyperbolic isometries of B which fix the origin 
can evidently be identified with the orthogonal group, consisting of all isometries of Euclidean 
space fixing the origin. The existence of such a large group of isometries would provide a key step 
in a proof that hyperbolic geometry, constructed in this way, satisfies all of the axioms of 
Euclidean geometry with the exception of the parallel axiom. 
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SHORT THEOREMS WITH LONG PROOFS 
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Long proofs are an anathema to mathematicians. Part of mathematics’ uniqueness is the 
verifiability of an argument. This is being called into question with the existence of proofs of 
inordinate length. A folk theorem, surely known in the 1930's, is particularly germane today. A 
recent note in which F. H. Norwood [1] appears unaware of this result provided the motivation for 
these remarks. We give three variants, with increasing formality. 


(i) There exist short theorems with long proofs. 
(ii) There exist theorems T whose shortest proofs have length at least 10'°2" where n is the 
length of T. 
(iii) For all recursive functions F there exist theorems T whose shortest proofs have length at 
least F(n) where n is the length of T. 
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There remains a hidden variable: the formal system in which theorems are proven. The folk 
theorem applies to any formal system for which there is no decision procedure for theoremhood. 
This includes Peano Arithmetic, Zermelo-Fraenkel set theory, or any formalization of “Generally 
Accepted Mathematics.” It does not apply to limited systems such as the first order theory of 
Finite Fields. 

Suppose (iii) were false for a specifie F and every theorem of length n had a proof of length at 
most F(n). This would give a simple decision procedure for statements S. Let n be the length of S 
and examine all texts of length at most F(n). The statement S is a theorem if and only if one of 
these texts is a proof of S. As no such decidability procedure exists, (iii) is proved. 

Version (i), even allowing for informality, is somewhat misleading. Since the integer n which 
exists by the folk theorem may itself be very large, it may be the case that there are long theorems 
with very, very long proofs. 

Einstein’s dogma, “God does not play dice with the universe” may be translated from physics 
to mathematics in (at least) two ways: 

(a) Short interesting statements are decidable. 

(b) Short interesting theorems have short proofs. 

Fifty years after Gédel, many mathematicians view (a) as false. (The Axiom of Choice and the 
Continuum Hypothesis are short, interesting, and known to be undecidable, but let us restrict 
ourselves to statements expressible in Peano Arithmetic.) “Are there an infinite number of primes 
of the form 2? — 1?” may well be undecidable. The advent of computer proofs has focused 
attention on (b). A decade ago no counterexamples to (b) were known. Now there are two. The 
Four-Color Theorem is the first. The cataloging of Finite Simple Groups (in 5000 pages) is the 
second—or, rather, a second class. Short interesting theorems such as “Every 6-transitive 
subgroup of S, is (1 — 2)-transitive” now have proofs of 5000 +e pages. In both cases shorter 
proofs are being sought and may eventually be found. Or perhaps not. The possible falsity of (b) 
is only now seeping into our mathematical consciousness. 
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Our purpose here is to consider allocation schemes based on fairness criteria for situations 
similar to the one just described. We presume that each sponsor’s willingness to pay for intercity 
travel reflects a component of the benefit the sponsor anticipates from the visit. Other benefit 
measures will not be considered explicitly at this time, but when they are available we would like 
to incorporate them in the analysis. Moreover, until the final section, willingness to pay will be 
assumed to equal the direct round-trip cost between the home city and the sponsor’s city. 

An allocation scheme that seems sensible to many people on first impression is a marginal-cost 
procedure that charges each sponsor at least the incremental cost caused by his participation or, if 
these increments sum to more than the total cost, that charges i more than j if i’s increment 
exceeds j’s. Unfortunately, all such procedures are inconsistent with the minimum conditions 
noted above. 

To be precise, suppose n > 3 cities are to be visited in fixed order (1,2, ..., n) before returning 
home (city 0). Let C be the intercity travel cost of this tour, and let C; be the travel cost if city i is 
omitted but all others are visited in their original order, so that sponsor i’s incremental cost is 
C — C;,. Then there is no general scheme that (1) allocates exactly C among the 7 sponsors, (2) 
charges no sponsor more than the direct round-trip cost between home and that sponsor’s city, 
and (3) charges i more than j when C — C; exceeds C — C. Fig. 1 shows why this is so. Since 
C — C, exceeds the other C — C,, (1) and (3) require sponsor 2 to pay more than $500. But this 
greatly exceeds the $220 round-trip cost between home and city 2, thus violating (2). Conse- 
quently, we must look elsewhere if (1) and (2) are to be honored. 


O 


400 400 Cz=$ 860 


FIG. 1. Incremental costs. 


The actual scheme chosen for the tour described in the first paragraph is what we shall call the 
proportional WTP (willingness to pay) scheme. Proportional WTP charges each sponsor in direct 
proportion to the cost of going directly from the home city to the sponsor’s city. Thus, if cities 
1,2,..., m are visited before returning home, if C is the actual air fare, and if c; is the direct cost of 
going from home to city 7, then (c;/Zc,)C is charged to the sponsor at city i. The case described 
in the opening paragraph is illustrated—without regard to geography—in Fig. 2, where x, is the 
cost of going directly from i toi + 1. 

Although proportional WTP is only one among many allocation schemes for situations of this 
type, we find it very attractive and will devote most of the paper to it. We will show that simple 
and appealing conditions for fair allocation in conjunction with either a general separability 
axiom or an additivity axiom lead to specialized classes of schemes which contain proportional 
WTP. A further condition that also can be defended as a criterion of fairness then singles out 
proportional WTP. 
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Cy, tCotCatCy 
Fic. 2. Proportional WTP scheme. 


In connection with Figs. 1 and 2 it should be emphasized that we are not concerned with 
minimizing total intercity travel costs, as in the familiar traveling salesman problem. Rather, we 
take the travel route as given, whether it is a minimum-cost route or not, and proceed on the basis 
of this fixed route. Various factors other than costs can of course affect the choice of route, so to 
retain flexibility we allow all possibilities that satisfy a version of the triangle inequality for 
intercity costs. 

The fixed-route assumption will be relaxed in the final section, where we mention generali- 
zations of our basic approach. The formulation for this approach and basic fairness criteria are set 
forth in the next section. We then examine the three-city (m = 2) case in detail before extending 
the analysis to larger n. 

Although the context of our allocation problem differs from most settings examined in the 
literature, many methods proposed for allocating joint project costs to sponsors or departments 
that benefit from the project are based on proportional and /or incremental schemes. An extensive 
review with numerous references is provided by Thomas [13]. The proportional WTP scheme 
described above is similar to what is sometimes referred to as SCRB (separable costs-remaining 
benefits) allocation [4], [9]. Kalai [5] shows how a proportional scheme that resembles our WTP 
proposal arises from simple axioms when allocation is treated as a bargaining game after the 
manner of Nash [8]. Other game-theoretic approaches to allocation, based in part on Shapley’s 
theory [10], are discussed by Shubik [12], Sharkey [11], Littlechild [6] and Young [14], among 
others. Two recent axiomatizations of allocation schemes that are based directly on the factors of 
allocation without embedding these in a game-theoretic formulation are Billera and Heath [1] and 
Mirman and Tauman [7]. Our approach resembles theirs. 


2. Formulation. We assume that an individual travels from his home city (city 0) to cities 
1,2,..., 1, in that order, and then returns home. Let x;; denote the cost of traveling directly from 
city i to city j. It is assumed that all x, , for i # j are positive, symmetric (x,,; = x,;), and obey the 
triangle inequality 

Xj + Xp 2 Xix- 
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Although violations of this inequality occur, it holds in most cases and will be an integral factor in 
our analysis. A relaxation is mentioned in the final section. 

For convenience, we let c; = Xq,;, the direct cost from home to city i, with c = (c),..., c,,) the 
vector of these costs. In addition, x; is defined as x, ;,,, with x = (x),..., X,_,). The individual’s 
intercity travel costs are 

n—1| 
C=c,+ Dix,t+¢. 


i=] 


Since x; < c; + c;,, by the triangle inequality, C never exceeds the sum of the round-trip costs 
between home and the other cities, 22c;. Indeed, C = 22c, only when x; = c; + c;,, for all i; a 
plausible picture for this has the individual alternately flying back and forth over his home city on 
each intermediate leg of his trip. 

There is a sponsor at each city visited who is to pay part of C. Sponsors at different cities could 
be identical. As in the introductory example, the individual himself could be the sponsor at one or 
more cities. We identify the sponsor at city 7 with index /, it being understood that an entity who 
is the sponsor at several cities pays the sum of the costs charged to those cities. 

It seems natural within this formulation to propose that the intercity travel costs charged to i 
depend only on the matrix [x,,] of intercity costs. We shall in fact assume more than this, namely 
that the charge to i depends only on c and x. Thus, it is presumed that direct travel costs between 
non-home cities that are not visited in succession are irrelevant. Although we feel that this is 
reasonable in view of the fixed-route context, it does rule out certain incremental allocation 
schemes when n > 3. 


Three Fairness Criteria. Henceforth, let f;(c, x) be the intercity travel costs charged to 7. Three 
conditions on the f, that we view as basic fairness criteria and that are similar to conditions 
discussed elsewhere [12], [13] are, for all applicable (c, x), 


Al. fi(e,x)+---+f,€¢,x)=C; 
A2. f:(c,x) > 0 for alli; 
A3. f;(c, x) < 2c, foralli. 


Suppose first that the individual himself is not a sponsor. Then the full-allocation axiom Al 
says that he neither makes a profit (2f, > C) nor subsidizes his sponsors (2f; < C) for his travel 
costs, which is fair both to the sponsors and himself. Condition A2 is a fairness condition among 
sponsors. Its failure in the context of Al would mean that some sponsors get “kickbacks” at the 
expense of other sponsors. 

Suppose next that the individual is a sponsor. Then Al and A2 do not prevent him from 
getting a free ride to his own sponsor city, but they do imply that he does not “profit” from his 
travel costs. Even though common practice appears to sanction free rides, they will be ruled out 
by later conditions. An exception to this appears in the last paragraph of the paper. 

Axiom A3 is‘a willingness-to-pay condition. On an ethical level, it reflects the judgment that it 
would be unfair to i to charge him more than the round-trip cost between home and city i. 
Moreover, sponsors may simply be unwilling to pay more than the round-trip costs to their cities. 

Since C < 22c,; by the triangle inequality, Al, A2 and A3 are simultaneously feasible. When 
x, =c, + ¢;,, fori = 1,...,n— 1, Al and A3 force f;(c, x) to equal 2c; for i = 1,..., n, so that 
these two conditions fully determine the allocation at the upper bound on x. 


Other Fairness Criteria. We shall impose additional conditions on the f, as we examine 1 = 2 
and then larger n in the next two sections. These conditions reflect notions of neutrality, 
monotonicity, homogeneity and other potential properties of allocation schemes that, for the most 
part, can be defended as reasonable fairness criteria. 

The n = 2 case will be discussed first since it 1s easily visualized and embodies many features of 
the general case. 
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3. Two Cities Visited. Our tale of two cities visited begins with two trips, a month or so apart. 
The first goes from Atlanta (home) to Boston to Chicago and back home. The second reverses the 
tour, but the costs remain the same: see Fig. 3. Because of the reversal, Chicago becomes city 1 
and Boston becomes city 2 on the second trip. Fair treatment suggests that the named cities ought 
to be charged the same on each trip. In particular, the charge to Boston for the first trip, 
f (C1, Co, X,), Should be the same as the charge to Boston for the second trip, f,(c2, c,, x;), where 
c, = 150 and c, =-100. This anonymity /neutrality condition [2] will be identified as 


Aé. fy(ey, C1» x1) =fi(¢,, C25 x1). 


ATLANTA ATLANTA 
O O 
150 400 150 100 
2] 2 1 
20 
BOSTON 20 CHICAGO BOSTON ' CHICAGO 
TRIP 4 TRIP 2 


Fic. 3. Trips with opposite tours. 


For convenience in the rest of this section we shall sometimes write f,(c,, Cc, X;) aS f(C), C2, X). 
That is, f(c,, c), x) is the charge to sponsor 1 when it costs x to go from 1 to 2. In working 
towards the proportional WTP scheme, where 


fe), €2, x) = 


C\ 


Cp tx+ 

c, + ¢, (c c), 
we pursue two approaches that delineate classes of schemes which contain proportional WTP. The 
final step to proportional WTP is taken later in the section. 


The First Approach. Our first approach uses only one new condition, a separability axiom for 
f. As we Shall see, this approach is mathematically efficient but lacks some of the intuitive appeal 
provided by the second approach. The new condition is 


A5. There are real valued functions g on (c, x) pairs, and h and k on c vectors, such that 
g(c), C25 x) = g(c, C1» x) 
and 
f (C1, C2, *) = (C1, C2, X)h(C), cy) + KC), cp). 

This says that the charge to 1 consists of a term that is independent of x plus a term that 
accounts for x in a multiplicative decomposition that is partly symmetric in c, and c,. Although 
the decomposition of f in A5 retains a great deal of flexibility, our first theorem shows that 
previous conditions severely limit its possibilities. 

Before stating Theorem 1, we remark that if A5 is modified by taking k = 0, so that 
f(C1, Co, X) = B(C), Cy, xX) h(c), c>), then proportional WTP follows directly from the proof of the 
theorem. In this case F(c,, c,) in (*) equals c,/(c, + c)). 


THEOREM 1. Suppose Al, A3, A4 and A5 hold. Then there is a nonnegative real valued function 
F onc vectors such that 
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F(c,,¢,) + F(¢,¢,) = 1 
and 
(+) f(€1, €2,%) = 2e, — (ce, + ey — x) F(e}, ¢2) 
for all applicable (.c, x) pairs. 

Expression (*) says that the charge to 1 equals 1’s willingness to pay, minus F(c,, c,) times the 
difference between the home-leg costs and the cost of the intermediate leg. Alternatively, for fixed 
c, | pays an amount 2c, — (c, + c,)F(c, c,), independent of x, plus an amount xF(c,, c) that 
is linear and nondecreasing in x. Our second approach assumes the monotonicity aspect for x and 
uses this along with an additivity condition to derive linearity in x. 

Given (*), the neutrality condition A4 says that the charge to sponsor 2 is 2c, — 
(c,; + c, — x) F(c5, c,). Since F(c,, c.) + F(cz, c,) = 1, the sum of the charges is C = c, + c, + 
x, which agrees with Al. Since c, + c, — x > 0 by the triangle inequality, and c,; + c, — x > 0 
for some x, A3 forces F to be nonnegative. Condition A2, which is not used in Theorem 1, is 


easily seen to be implied by the other conditions. That is, nonnegative charges for the sponsors 
follow from Al, A3, A4, and AS. 


Proof of Theorem |. By A5 
f(e1, €2,%) = B( C1, C2, x)h(C, c2) + k(c), ¢2), 
f (ep, 1, %) = B( C1, C2, x) hen, ¢,) + kp, ¢1). 
By Al and A4, C = f,(¢), C2, X1) + fo eqs Co, X1) = f(C]; C2, X) + f(C2; C1, x). Addition therefore 
yields 
c¢, +e, + x= 2(c,x)[A(c,, 0) +h(e,¢))] + k(c1, 0) + kc, ¢1). 
If h(c,, c,) + h(c,, c,) = 0, then c, + c, + x would not depend on x, which is impossible since 


all costs are assumed positive and x can vary from |c, — c,| to c; + c). We therefore solve the 
preceding equation for g, and substitute this solution into the first equation of the proof to obtain 


f(e1, Co, x) = (ce, + cy + x) F(c,, 2) + G(c), cy), 
where 
h(c,, ¢>) 
h(c,,¢.) + hex, ¢) 
G(c,, Cy) = k(¢,, Cy) ~ [k(c, Cy) + k(c, c)] F(¢, Cy), 


with F(c,, c) + F(cy, ¢,;) = 1. As remarked earlier, Al and A3 imply f(c,, c2, c; + ¢)) = 2¢,, 
and therefore 2c, = 2(c, + c,)F(c,, c,) + G(c,, c,). We solve this for G and substitute into the 
preceding equation for f to obtain (*). As remarked just before this proof, A3 forces F to be 
nonnegative. Hl 


F(c, Cy) = 


The Second Approach. Our second approach uses the following monotonicity and additivity 
axioms: 


A6. Ifx > x’, thenf(c,,¢,,x) >f(c, &, x’); 
AT. fei, 0), x) + fe), , x’) = f(2c,,2¢,, x + x’). 


It would hardly seem fair to 2 to decrease 1’s allocation when x increases but the home-to-i 
costs remain fixed, especially if 2 would have to bear this decrease plus the increase in x itself, as 
would be required under Al. Given A4, the monotonicity axiom A6 says that neither sponsor pays 
more if the intermediate cost decreases. 

Additivity, A7, considers three trips. The first two are the same except for their intermediate 
costs, x and x’. The third trip doubles the home-to-i costs of the first two and adds the 
intermediate costs of the first two to obtain the intermediate cost of the third. Indeed, (2c¢,,2c,, x 
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+ x’) =(c), C),X) + (c), Co, x’). Under these hypotheses, it seems reasonable to us that sponsor 
] (likewise 2 under A4) should pay the same for the third trip as he would pay for the other two 
together. 

When x = x’ in A7, we obtain the homogeneity property f(2c,,2c¢,,2x) = 2f(c¢,, cy, x). This 
is a special case of general degree-1 homogeneity [ f(Ac,, Ac), Ax) = Af(c), C, x)] for f. Given 
Al, A3, and A4, which are used in Theorem 1 and in our next theorem, it is easily seen that the 
conjunction of A5 and f(2c,,2¢,,2x) = 2f(c), c,, x) is equivalent to the conjunction of A6 and 
A7. Moreover, as shown in the ensuing proof of Theorem 2, A6 and A7 alone imply A5. On the 
other hand, although A6 is implied by the axioms of Theorem 1, A7 is not since there is nothing in 
our first approach which requires F(2c,,2c,) to equal F(c,, c,). Hence the axioms of the next 
theorem are stronger than those of Theorem 1, but only because they entail the limited 
homogeneity property. 


THEOREM 2. Suppose Al, A3, A4, A6, and A7 hold. Then the conclusions of Theorem | hold and, 
in addition, F(2c,,2c,) = F(¢), c>). 


Proof. Since we assumed initially that all x; ; are positive, we begin the proof of Theorem 2 by 
supposing that c, # c,, so that the lower bound |c, — c,| allowed for x by the triangle inequality 
is feasible. With c, and c, fixed and c, # ¢5, let x° = |c, — c,| and x! = ¢, + c). In addition, let 


f° = fc), C25 x°) and f' = fc), C2, x'). By A6, f' > f°. 
Let x? = (x° + x!)/2. Then, by three applications of A7, 


f (cy, eo, x?) = f(e1/2, ¢2/2, x°/2) + f (1/2, ¢2/2, x!/2) 


Next, let x? = (x° + x7)/2 = zx + ax and x4 =(x'+x7)/2= ax + x! Then 


l ] 3 l 
flere. )= af + Gfler x7) = Zh + Zh 
f(ce Cc x4) = = f(c Cc yp tpatyoy 3p 

1929 2 19 2? 2 4 4° 
Further bisections lead to 


fe, ¢, ax® +(1- a)x!) = af®+ (1 -a)f' 


for all a in a subset of [0, 1] that is dense in [0, 1], and monotonicity then forces f(c,, c), ax® + 
(1 — a)x') =af° + (1 — a) f' for all a € [0,1]. In other words, 


x—- x? 


f(ce,,¢.,x) = 1 a f° + 1 =f 
Xx —xX X —-XxX 
_ xf? _ xf! f' ~ f° 
_ 1 cd Gr 0 
X —-xX X —X 


for all x € [x°, x']. 

If c; = Cc, a similar linear form holds for all x in [8, x'], for any 0 < 8 < x!. Since this is true 
no matter how small 6 is taken, it follows in general that there are functions ) and k on c vectors 
such that 


fe), €2,x) = xh(c,,¢,) + k(q, C). 
Thus, A5 holds, and we obtain the conclusions of Theorem 1 from the hypotheses of Theorem 2. 


The final conclusion of Theorem 2 follows immediately from (*) and’ f(2c,,2c,,2x) = 
2f(c1, C25 x). = 
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Proportional WTP. Henceforth in this section we assume that Al, A3, A4, and either A5 or A6 
and A7 hold, so that, with x = x,, 


fi(e,, Cy, x) = 2c, - (c, + Cy — x)F(c,, Cy) 
fre, C25 x) = 2c) - (c + Cy — x)F(c, c1), 


where F > 0 and F(c,, c,) + F(c), c,) = 1. There are two situations in which this form com- 
pletely determines the f;. The first is x = c, + c,, whence f, = 2c; for each i. The second arises 
when c, = ¢5, for then F(c,, c,) = 1/2, with f, = c,; + 7x. Thus, given x < c, + c,, the sponsors 
split the cost of the intermediate leg 50-50 when their home-to-i costs are equal. How should they 
divide x when c, # c,? 

One feasible scheme is to split x to equalize willingness to pay minus actual charge, or, 
equivalently, to minimize (2c, — f,)? + (2c, — f,)*. Then F = 1/2 with 


file, x) = Be, + x — e,)/2 
fo(c, x) = Be, + x -—¢,)/2. 


This gives f, > c, but has other consequences that seem questionable. In particular, if c, < c, and 
x = C, — C,, then 1 pays only c, while 2 bears the entire cost of the intermediate leg. Thus, if New 
York (home) to Detroit costs $100, Detroit to Dallas costs $200, and Dallas to New York costs 
$300, then the Detroit sponsor pays $100 and the Dallas sponsor pays $500. Since this seems less 
equitable to us than an allocation that charges some of the $200 to Detroit, such as $150 for 
Detroit and $450 for Dallas, we do not find it very attractive. 

Given c, # c, and the general solution in terms of F, it should be clear that if f,(c,, c, x) is 
specified for any one feasible x <c, + c,, then F(c,,c,) is determined. For example, the 
particular solution in the preceding paragraph is fully characterized by f,(c¢,, c., ¢) — ¢;) = cy 
whenever c, < c,. In similar fashion, the proportional WTP solution is characterized by 
f\ (C15 Co5 Co — C) = 2€;C,/(c,; + Cy) whenever c, < c,, for this gives F(c,, c,) = ¢,/(¢, + ¢2) 
and 


file;,%,x) =e, + ( <1! Js 


C) + C2 


7 (; 1 + Cy c 
Moreover, if c, > cy, then f,(c¢, ¢,,¢; — C)) = 2¢;¢./(c; + ¢c2) implies that F(c,,c¢,)= 
C>/(c, + c,), and again we get F(c,, c,) = ¢,/(¢, + c2) since F(c,, c,) + F(¢2, ¢,) = 1. 

If, given c,; < c, and c, — c, as the cost of the intermediate leg, it is generally felt that the most 
equitable division of this cost between the sponsors is in direct proportion to their c,, then it 
follows that proportional WTP is the most equitable particular solution to our general class of F 
solutions. Although we believe that such a division is quite reasonable, we do not presently have a 
compelling argument that it is the most equitable division and would invite others to consider the 
matter. 

Instead of using a specific value of f,(c,,c,,x) for x <c, +c, as a basis for selecting 
proportional WTP from the myriad possibilities of the class of F solutions, one can use a 
derivative argument at the upper extreme of x, where x = c, + c,. We have noted that f, = 2c, 
when x = c, + c,. Suppose we agree that for x slightly less than c, + c,, each sponsor should pay 
less than 2c;, and that their reductions from 2c, should be approximately proportional to what 
they pay at x = c, + c,, with the proportions becoming exact as x approaches c, + c,. Then, 
since the ratio of their reductions at x = c, + c, — € 18 F(c,, c,)/F(c5, c,), and the ratio of what 
they pay at x = c, + c,18c,/Co, we get Fc), c,)/F( Co, c)) = ¢,/c, and F(c,, cy) = ¢,/(c, + C). 

Integration offers a third approach to proportional WTP. If the average charges to the sponsors 
over all x € [|c, — cy], c, + c,] are assumed to be proportional to what they pay when x = c, + c, 
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then again we find that F(c,, c,) = ¢,/(c, + ¢>). 

Thus, we see that several distinctly different conditions lead to proportional WTP. However, 
the ones we have identified are all phrased directly in terms of proportions. Can less obvious and 
perhaps more convincing conditions be found? 


4. Many Cities. Several interesting features of our basic allocation problem appear only when 
n > 3. One of these is the possibility that C is substantially smaller than 2c; so that, unlike the 
case for n = 2, we do not think of assigning c; to sponsor i as his minimum charge. Related to this 
is the obvious fact that the individual does not travel directly between home and city i for 
2<i<n-—41, so that there might be a temptation to treat 1 and nv differently than the 
intervening sponsors. If one x; increases but everything else remains fixed, should the f; for j not 
adjacent to i change? If an intermediate c, increases but all else, including C, is unchanged, should 
the f, change in specified directions? 

As explained earlier, our present discussion is motivated by the proportional WTP scheme in 
which the allocation to sponsor i is 

C: 
fc, x)= ato Fe, +e, +x,+-+-+x,_,+¢,). 

In this case an increase in any x; will increase all f;, an increase in c; for 2 <i<n-— 1 will 
increase f; and decrease f, for all 7 # i (C remaining unchanged), while an increase in c, will 
increase f, but can either increase or decrease the other f; depending on the sign of 2c, — C. Fig. 4 
presents a case in which a $100 increase in c, increases f, by more than $100 and decreases each of 
f, and f, by a few dollars. 


O O 
C4 
400 400 ———»e 500 400 
INCREASES 
1300 2 200 2° '""200 2 200 2° 
f, = 436 f', = 542 
fo = 327 fo = 325 
fz = 436 f3= 433 


Fic. 4. A change in c,; proportional WTP. 


Two Approaches. Our analysis of the n-city case will be similar to the m = 2 analysis of the 
preceding section. In place of A4 and A5 for the first approach for n = 2, we now use a single 
separability condition which directly involves all sponsors. Two versions of the separability 
condition will be considered. We state the stronger version first. 


A5°. There are real valued functions g on (c, x) pairs, and h, on c vectors for i = 1,...,n, 
such that 


fi(ce,x)=g(c,x)hj(c) (i= 1,...,7n). 
THEOREM 3. Axioms Al, A3 and A5° imply proportional WTP. 


Thus, A5° is a very strong condition. A weaker version, which retains the flexibility of A5, is 


A5*. There are real valued functions g on (c, x) pairs, and h; and k; on c vectors for 
i = 1,...,n, such that 


fi(c,x)=g(c,x)h(c)+k,(c) G=1,...,n). 
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A sense of neutrality is imparted by g, which operates on (c, x) in the same way for all 7, and is 
the only part of the decomposition that involves x. 


THEOREM 4. Suppose Al, A3 and A5* hold. Then there are nonnegative real valued functions F, 
on c vectors for i = 1,..., n such that 


and 
(* *) f;(c, x) = 2¢,- (E26 -c}no (i =1,...,n) 


for all applicable (c, x) pairs. 


Proofs. The proof of Theorem 4 is similar to the proof of Theorem 1. Add the f; using A5*, 
solve for g = [C — 2k,]/Zh, with the aid of Al, then use the extreme solution at x, = c, + ¢;4. 
to obtain (* *), where F,(c) = h;(c)/2h,(c). Clearly, 2F, = 1, and F; > 0 by (* *) and A3. 

When k; = 0 (Theorem 3), g = C/2h,, so f,(c, x) = (h,/2h,)C. With each x; set atc; + ¢;41, 
f, = 2¢; = (h;/2h,)(22c¢,), so h;/Zh, = [¢;/(e, + co + +++ + C,)). oF 

As written in Theorem 4, (* *) says that the charge to i equals his willingness to pay, minus a 
fraction of the total difference between all sponsors’ willingness to pay and the actual cost C of 
intercity travel. We noted earlier that (* *) yields nonnegative f, when n = 2. However, axiom A2 
is not automatically satisfied by the solution in Theorem 4 when n > 3. 

Although the axioms used for Theorem 4 do not imply the proportional WTP scheme, they 
imply the phenomenon illustrated for sponsor 1 in Fig. 4. Assume the conclusions of Theorem 4 
hold and, in addition, suppose that an increase of 6 in c, (or c,,)—all else unchanged—does not 
increase f, (or f,) by more than 6. That is, we propose, df;/dc,; < 1 and df, /dc, < 1. Since 


Af\/dc, =2— [22c, -— C]aF,(c)/dc, — F,(c), 


and since C can be made to equal 22c,, df;/dc, < 1 implies that F,(c) > 1. Similarly, 
Of,/0c, <1 implies F,(c) > 1. But F, + F, >2 clearly contradicts the conclusions of the 
theorem. 

Our second approach for the n-city case uses axioms that are similar to A6 and A7. These 
axioms do not yield as specific a form as (* *) and hence do not imply A5* when n > 3, but 
neither are they implied by A5* since the homogeneity implication of A7*—that f,(2c,2x) = 
2f,(c, x)—does not follow from A5*. In A6*, x > x’ means that x, > x; for alli <n — 1: 


A6*. Ifx > x’, thenf,(c,x)>f,(c, x’) fori = 1,...,n; 
AT*. f.(ce,x)+f(c, x’) =f,(2c, x + x’) fori = 1,...,n. 


THEOREM 5. Suppose Al, A3, A6* and A7* hold. Then there are nonnegative real valued 
functions F;, on c vectors fori = 1,...,n andj = 1,...,n — 1, such that 


DA ()=1 (j=1,...,n-1) 


and 


n—l 


f(ce,x)=2¢,- > (co, + ¢4, —x,)Fj(c) G=1,...,n) 
j=l 


for all applicable (c, x) pairs. 


For n > 3, the conclusions of Theorem 5 imply those of Theorem 4 only if F, = F, =---: = 
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F, , -, for all i. The different forms for f; in Theorems 4 and 5 will be dealt with further in the 
next subsection. A proof of Theorem 5 will be presented elsewhere [3]. 


Proportional WTP. We conclude our examination of the basic case by noting how the 
proportional WTP scheme follows from the forms for f, in Theorems 4 and 5 through a derivative 
condition at the upper x extreme. 

Recall that x, is the cost of going from city i to city i+ 1, for 1 < i<n-— 1. Suppose that, 
beginning at x, = c,; + c,;,, for each j, the marginal reductions in the charges to i andi + | that 
accrue from a reduction in x; are proportional to what i andi + 1 paid initially: 


Of; (c, x)/Ox; _ file, x) _ - 
Of: 4.1 (c, x) /Ix; ase ~ £4 4(c, x) G=1,...,n—1). 


cjtcy+l cjtej+l 


Given Al, A3, and A5*, this implies proportional WTP. For, given (* *), our derivative condition 
says that 


F,(c) _ G 
Fiii(e) Gist 


and this along with 2F; = 1 implies that F,(c) = c,/2c, fori = 1,..., n. 

The preceding derivative condition does not imply proportional WTP in the context of 
Theorem 5, since there it implies F,,(c)/F,4,,;(¢) = ¢,/c;4 , which is not sufficient to conclude 
that F,,(c) = c,/2c, for all i and j. However, if we assume that, beginning at x; = c, + c;,, for 
each j, the marginal savings toi andi + | that accrue from a reduction in any x, are proportional 
to what i and i+ 1 paid initially, then proportional WTP does follow from the form for f, in 
Theorem 5. This stronger derivative condition gives F,;(c)/Fi+1, ;(¢) = ¢;/¢;4, for all i, 7 © 
{1,...,2— 1}, and the use of 2,F,, = 1 then yields F, (c) = c;/2c, for all i andj. 

Proportional WTP also follows from the axioms of Theorem 5 (Al, A3, A6*, A7*) and the 
weaker derivative condition provided that we augment these axioms with an assumption that 
implies F;,(c) = F;,(c) for all relevant 7, 7, and k. The most obvious assumption for this purpose 
is f,(c, x) = f,(c, x’) whenever x and x’ are identical in all except two places, say j and k, where 
x, + xX, =x; + x;,. The form for f; in Theorem 5 then gives F,,(c) = F;,(c), so that that form 


reduces to (* *). 


fori=1,....n— 1, 


5. Summary and Generalizations. We have seen that conditions for equitable allocation—in- 
cluding full-allocation, willingness-to-pay, monotonicity and additivity—imply that the charge to 
sponsor i 1S 


n—1 


file, x) = 2C; ~~ » (¢; + Cr+ ~~ x, )F,;(c) 
j=i 


with F,, > 0 and 2,F,,(c) = 1 for each/. Here c = (c,,..., ¢,), X = (X1,---) X,—1), ¢; iS the direct 


y= i“ tj 
cost from home to city i, and x; is the cost from city i to city i + 1. A weak separability axiom in 
place of monotonicity and additivity yields the alternative form 


fe, x) = 2¢; _ (2 2¢, _ C)F,(c), 
where C= c, + xX, +X, +°::+x,_, + ¢,, the total travel cost. An additional boundary-value 
condition implies 
Je 
J 


f,(c, x) = (ss 


the proportional WTP scheme. 
We conclude with brief remarks on three generalizations, the first two of which are analyzed in 
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greater detail in [3]. We take n > 3, as in section 4, and let x =[x,,], as in section 2. It is 
presumed that allocations depend only on c and the x;,; involved in the tour under consideration. 

(i). Suppose first that the triangle inequality for costs is replaced by the less-demanding 
inequality 


n-I,n 


Then, because of the greater variety in x that is allowed by this relaxation, the axioms of Theorem 
5 imply the stronger conclusion of Theorem 4. In addition, the final step to proportional WTP 
follows from the appealing condition that f,(c, x) = c,; for all i when C = 2,c;. 

(ii). Suppose next that the triangle inequality applies and all n! tours are considered. A four is a 
permutation o on {1,2,..., }: o(/) is the ith city visited in tour o. The cost of o is 

n—1 
C(o,x)= Coc) * » Xo(ijo(i+1) 7 Co(ny> 

and the charge to i for tour o is f°(c, x). Here we adopt an axiom which says that the set of 
minimum-cost tours is precisely the Pareto optimal set, thus providing an incentive for sponsors to 
adopt a least-cost tour. When this axiom is applied to the o-dependent conclusions of Theorem 5, 
it follows that there are nonnegative F,(c) which sum to 1 such that 


fi(c, x) =2¢,- F,(c)[2,2¢, — C(o, x)| 


for all applicable o, i, c and x. Note in particular that F, does not depend on o. Hence if 
proportional WTP holds for at least one tour, then it holds for all tours. 

(iii). Finally, consider the situation of the preceding paragraph with 2c; replaced by w, > 0 as 
sponsor i’s indicated willingness to pay. We assume that 2w, exceeds a specified minimum value 
that depends on the c,, and that the triangle inequality holds. Let 


W=2w, and T(x)={0:C(o,x)< W). 


Then 7(x) is the set of cost-recoverable tours when w and x apply. We work only with oeT(x). 

This case breaks down into two subcases, according to whether W < 22c; or W > 22c,. The 
first of these is amenable to modifications of the conditions used in generalization (ii), which lead 
to 


fi(e, x)= pcs x) 


as the appropriate proportional WTP form. Similar modifications for the “generous-sponsors” 
subcase, in which W > 22c,, yield the less-specific form 


f?(c,x) = F(c)C(o,x), F(c) < w/22¢,. 


Although proportional WTP is feasible for this subcase, it is not uniquely identified by the types 
of axioms that characterize proportional WTP for the other generalizations. 


References 


1. L. J. Billera and D. C. Heath, Allocation of shared costs: A set of axioms yielding a unique procedure, 
Mathematics of Operations Research, 7 (1982) 32-39. 

2. D. E. Campbell and P. C. Fishburn, Anonymity conditions in social choice theory, Theory and Decision, 12 
(1980) 21-39. 

3. P. C. Fishburn and H. O. Pollak, Proportional allocation schemes, mimeographed, Bell Laboratories, 1982. 

4. L. D. James and R. R. Lee, Economics of Water Resource Planning, McGraw-Hill, New York, 1971. 

5. E. Kalai, Proportional solutions to bargaining situations: interpersonal utility comparisons, Econometrica, 45 
(1977) 1623~—1630. 

6. S. C. Littlechild, Common costs, fixed charges, clubs and games, Review of Economic Studies, 42 (1975) 
117-124. 

7. L. J. Mirman and Y. Tauman, Demand compatible equitable cost sharing prices, Mathematics of Operations 


378 WILLIAM C. WATERHOUSE [June-July 


Research, 7 (1982) 40-56. 
8. J. F. Nash, The bargaining problem, Econometrica, 18 (1950) 155-162. 
9. J. S. Ransmeier, The Tennessee Valley Authority: A Case Study in the Economics of Multiple Purpose 
Stream Planning, Vanderbilt Univ. Press, Nashville, Tenn., 1942. 
10. L. S. Shapley, A value for n-person games, in H. W. Kuhn and A. W. Tucker (Editors), Contributions to the 
Theory of Games, vol. II, Princeton Univ. Press, Princeton, N.J., 1953, pp. 307-317. 
11. W. W. Sharkey, Suggestions for a game-theoretic approach to public utility pricing, Bell Laboratories 
Economic Discussion Paper 61, 1974. 
12. M. Shubik, Incentives, decentralized control, the assignment of joint costs and internal pricing, Management 


Science, 8 (1962) 325-343. 
13. A. L. Thomas, A Behavioral Analysis of Joint-Cost Allocation and Transfer Pricing, Stipes Publishing Co., 


1980. 
14. H. P. Young, Cost allocation and demand revelation in public enterprises, International Institute of Applied 


Systems Analysis Working Paper WP-80-130, 1980. 


DO SYMMETRIC PROBLEMS HAVE SYMMETRIC SOLUTIONS? 


WILLIAM C. WATERHOUSE 
Department of Mathematics, The Pennsylvania State University, University Park, PA 16802 


There are many problems where the function under study is symmetric in several variables and 
its maximum or minimum occurs when the variables are equal. Here are some examples: 


(1) Of all rectangles with given perimeter, the square has the largest area. 

(2) The triangle of minimum area circumscribed about a given circle is equilateral. 

(3) Take four positive numbers whose product is 16. Then their sum is at least 8. (That is, the 
sum is least when all four numbers are equal.) 

(4) A box is to be made from sheet metal, with rectangular bottom and sides but no top. Then 
the design using least material has a square bottom and height half the width. (To reduce this to a 
symmetric situation, put another copy of the box upside down on top of it.) 

(5) Determine the minimum value of 


2 2 2 
(r—1)° + (= - 1) + (< - 1 + (5 - 1 
r s t 

for all real numbers 7,s,t with 1 <r<s<t< 4. (This is a problem from the 1981 Putnam 
Competition; the minimum comes when (7, s, ¢) = (V2 ,2, 272). The problem becomes symmetric 
if we introduce new variables r, s/r, t/s, and 4/1.) 

(6) For a given mean X = (21'x,)/n, the value x? is least when all x, are equal. 

(7) For any positive x,,..., x, we have (2"x;)/n > (x, --: x,)!/" (the inequality of arithmetic 
and geometric means, a generalization of (3) above). 


It seems clear that some general principle must be lurking here, and I propose to call it the 
Purkiss Principle. This name honors one of the authors who (independently) noticed this principle 
in the middle of the last century. Their discussions were somewhat inconclusive and seem to have 
been completely forgotten; I have not come across any recent references to the idea. Probably this 
is because the principle is not true without qualifications. But still, there is something to it. 


William C, Waterhouse studied at Harvard, receiving his Ph.D. in 1968 under the guidance of John Tate. He 
taught at Cornell until 1975, when he moved to Penn State. His research has been mainly in algebra, algebraic 
geometry, and algebraic number theory. He has also published an Introduction to Affine Group Schemes and several 
articles on the history of mathematics. 
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There are many problems where the function under study is symmetric in several variables and 
its maximum or minimum occurs when the variables are equal. Here are some examples: 


(1) Of all rectangles with given perimeter, the square has the largest area. 

(2) The triangle of minimum area circumscribed about a given circle is equilateral. 

(3) Take four positive numbers whose product is 16. Then their sum is at least 8. (That is, the 
sum is least when all four numbers are equal.) 

(4) A box is to be made from sheet metal, with rectangular bottom and sides but no top. Then 
the design using least material has a square bottom and height half the width. (To reduce this to a 
symmetric situation, put another copy of the box upside down on top of it.) 

(5) Determine the minimum value of 


2 2 2 
(r- 1) + (= - 1) + (< - 1 + (5 - 1 
r s t 

for all real numbers r,s,t with 1<r<s<t< 4. (This is a problem from the 1981 Putnam 
Competition; the minimum comes when (7, s, t) = (V2 ,2,2V72). The problem becomes symmetric 
if we introduce new variables r, s/r, t/s, and 4/1.) 

(6) For a given mean X = (21x,)/n, the value ="x? is least when all x, are equal. 

(7) For any positive x,,..., x, we have (="x;)/n > (x, --- x,)!/" (the inequality of arithmetic 
and geometric means, a generalization of (3) above). 


It seems clear that some general principle must be lurking here, and I propose to call it the 
Purkiss Principle. This name honors one of the authors who (independently) noticed this principle 
in the middle of the last century. Their discussions were somewhat inconclusive and seem to have 
been completely forgotten; I have not come across any recent references to the idea. Probably this 
is because the principle is not true without qualifications. But still, there is something to it. 
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The first person to notice the general pattern behind the specific results was a Frenchman, O. 
Terquem [13]; the year was 1840. Terquem, who was then nearly sixty years old, was more a man 
of scholarship than a creative mathematician, and he is perhaps best remembered for his 
Voltairean credo, “I count all honest men as true believers, and only scoundrels as heretics.” 
(More information about him and our other authors can be found in the biographical appendix 
below.) Terquem’s paper was just a one-page note, and it was prompted by inequalities (6) and (7) 
above, which occur (at different places) in Cauchy’s Cours d’ Analyse. He considered the general 
principle obvious, saying that if the numbers x, “all remain positive, and a given symmetric 
function preserves a given value, it is evident that every other symmetric function... will attain an 
extreme value, maximum or minimum, when all the [numbers] become equal.” His main concern 
then was to distinguish the cases of maximum and minimum, which he did by considering the 
values that occur when one or more of the numbers go to zero. 

Terquem’s reaction seems to be a very natural one. That is, as soon as the symmetry of a 
problem is brought out, people are inclined to say that “by symmetry” the extreme value must 
occur when the variables are equal. But a bit of thought shows that there is no simple symmetry 
argument to this effect. Indeed, there cannot be, because such a symmetry conclusion is sometimes 
false. This was the discovery of our next author, the Russian V. Bouniakovsky [2], in 1854. Unlike 
the other two, Bunyakovskii (to give another common transliteration of his name) was a major 
mathematician, and there is no problem with his results. He looked at a single symmetric 
polynomial, asking whether its maxima or minima (if such exist) occur at points where the 
variables are equal. For polynomials of degree at most three he proved this is true, but in higher 
degrees he gave counterexamples. His basic example is neat and simple, 


f(x,y) = [x2 + GO — 1] [(e- 1° + y?]. 


Obviously this is symmetric, and obviously also it has its minimum value at (1,0) and (0, 1) rather 
than at any point where x = y. 

Bouniakovsky did not deal directly with the constrained extremum problems that were 
Terquem’s concern (and ours), but it is not too hard to find counterexamples in that context as 
well. Take for instance the symmetric function 


f(x,y) = (x? +? — S/8y, 


and consider its values for positive x and y satisfying x + y = 1. At the equality point x = y = 1/2 
we get f(1/2, 1/2) = (1/8)’. But f also takes on values larger and smaller than this, for instance 
f(1/8,7/8) = (5/32)? and f(1/4, 3/4) = 0. Thus it would seem that our principle simply isn’t 
true. 

Still, it is hard to believe that all our original examples had symmetric extrema just by chance. 
It 1s time, therefore, that we turned to our third author, the Englishman H. J. Purkiss, who 
published his observation in 1862 in the Messenger of Mathematics [9]. This was then a new 
journal designed for English undergraduates, and Purkiss was one of the founders; at that time he 
was in fact a student at Trinity College, Cambridge, and was just twenty years old. Unlike 
Terquem, he realized that his observation required proof. Needless to say, his argument was 
unsatisfactory, but it has one significant feature: it deals only with small variations away from 
equality of the variables. Though Purkiss himself slurred over the distinction between relative and 
absolute maxima, his argument is the first indication of a basic fact: whatever validity the 
principle has will be /ocal. For example, our function (x* + y* — 5/8)? on the line x + y = 1 
does indeed have a local maximum at x = y = 1/2, even though it reaches greater values far away 
from that point. Thus we can now give a more precise formulation: 


THE PURKISS PRINCIPLE. Let f(x,,..., X,) and g(X,,..., X,,) be two symmetric functions. On the 
set where g stays equal to g(r,...,1r), the function f should have a local maximum or minimum at 


(7,..., 7). 
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Of course we do not yet have any idea why, or indeed whether, the principle should be true. All 
we know (from Bouniakovsky) is that no rudimentary symmetry argument will prove it. But at 
least we can try testing it by the general method to test for a constrained extremum at a point P. 
The classical way to remember this “Lagrange multiplier” method is to think of curves c(t) that 
have c(0) = P and lie in the set where g is constant. Differentiating the condition g(c(t)) = 
constant, we find that 

0 = 2(Dig)(P)c;(0), 
which says that the curve’s tangent vector c’(0) is perpendicular to the gradient vector (V g)(P) 
formed by the D,(g) = dg/d0x,. Conversely, so long as (V g)( P) is nontrivial, the implicit function 
theorem shows that every vector perpendicular to (Vg)(P) occurs as the tangent to some such 


curve c(t). Now if f has a local (constrained) extremum at P, each f(c(t)) has an extremum at 0, 
and its derivative vanishes there. Hence we have 


0 = 2(D,f)(P)c{(0). 
This says that (vf)(P) is also perpendicular to all the c’(0), which implies that 
(vf )(P) is a scalar multiple of (vg)(P). 


Points where this condition holds (together with points where Vg = 0) are the critical points for 
the constrained extremum problem. 

All this is familiar in general. Does something special happen when f and g are symmetric? Let 
us work out an example, say 


f (X15 X25 X3) = XX + X2X3 + X3X1, 
2 
B( 2X1, X2,%3) = (44 X2x5). 
The gradients of these are 
Vf = (x + %3,X, + %3,X%, + Xp), 
V & = 2X1X2X3 (17X35, X1X3, X1X2). 
We are interested in points P of the form (7, 7, r), and there we have 
Vf(ryr,r)=(Qr,2r,2r) =2rd,1,1), 
Vga(r,r,r) = (2r°,2r°,2r°) = 2r(1, 1, 1). 
Thus vf is a multiple of Vv g simply because in each of them all the entries are equal. Once we see 
that this is what we want to prove, we have no difficulty proving it. 


LEMMA 1. Suppose f(x,,..., X,) is a symmetric differentiable function. Then at a point x, = -:: 
=x, =r, all the D,f are equal. 


Proof. Let w be a permutation of 1,2,...,. For any function h we can define 
h(X15--+5X,) = h(Xaay>- -_ Xany)s 
and trivially then 
(Dyin hn X55 25 Xn) = (DA) ay ses Xacny) 


In the present case all f, equal f and all x; are equal. & 

With hindsight (and a little charity) we might say that Purkiss also managed to show that the 
points (r,..., 7) are critical points. But then he undeniably fell into an error. When there is only 
one free variable, a critical point is “in general” a local maximum or minimum— that is, it will be 
one or the other except for degenerate cases where the second derivative vanishes. But in several 
variables this is not true, and nondegenerate critical points can equally well be saddle points. 
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Purkiss simply ignored this possibility, and thus he left a major gap in his argument. Yet it must 
be admitted that in our original examples we did not in fact encounter any saddle points. Support 
for the principle can also be drawn from George Chrystal, who in 1889 included it in Part 2 of his 
famous textbook on algebra [4, II. 61-63]. Recognizing the inadequacy of Purkiss’ proof, Chrystal 
treated only the case where f and g are symmetric polynomials; these he rewrote in terms of the 
elementary symmetric polynomials, and after some computation he was able to establish the 
principle (apart from some degenerate situations). With this encouragement, then, let us take up 
where Purkiss left off and try to show that for some reason we never get saddle points. 

For this we need the second-order terms in the Lagrange multiplier method. These ought to be 
familiar, but in fact most advanced calculus books seem to skip them. Briefly, then, let us again 
consider one of our curves c(t) lying in the set where g = g(P). We have 


_@ 
0= es (t)) - 


= 2; (D;g)(P) ¢7’(0) + z,, ;(D,D;g (P)/(0) ¢; (0) 
and 
d 


L f(e(t))Ieno = (DSM PIELO) + E,, (DDS P)E{(0) ef (0). 


We know that at our critical point we have v f( P) = A(vg)(P) for some scalar A. Multiplying the 
first equation above by A and subtracting, we get 


© F(e(A)YIeo0 = 2s DD IMP)—A(D.D,s VP] €((0) e500). 


For a local maximum or minimum, we want these second derivatives to have the same sign for all 
c(t). Thus the extra condition we need is that the quadratic form 


Q(v)= =| (D;D,f )(P) - \(D,D;g)(P)| 0,0; 
should be positive definite or negative definite on the space of all v perpendicular to vg(P). 
Using the implicit function theorem, one can show that this condition is indeed sufficient [5, 154]. 
This tells us what we need, but why should it automatically be true for symmetric f and g? Let 
us look again at our example, 
2 
f = XX + XQXZ+NZX,, B= (KX). 


At a point (r, 7, 7) we find that the matrices of second partials are 
O 1 1 
(D,D)f(P)) = 1 O lf 
1 1 0O 
2r* 4r* 44 
(D,Djg(P))= | 4r4 art 4r4 
4r*  4r* 2r4 
What strikes the eye here is that all the diagonal entries are equal and all the off-diagonal entries 
are equal. Again, once we notice this we can easily check it in general. 
LEMMA 2. Suppose f(X,,...,X,) is a symmetric twice-differentiable function. Then at a point 
(r,...,7) all D,D,f are equal and all D,D,f for i # j are equal. 


Proof. In the proof of Lemma 1 we saw that D,,,,h = (D;h), for any h. Applying this rule 
twice, we get 
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Dyin Daj f = Dain Pui bn = Pun | (DF), | = [DBS], 


Take first i = 7, and choose 7 so 7(i) = 1; this gives us (D,D,f)(7,..., 7) = (D,D;f)(%,..., 7). 
Take next any i * j, and choose 7 so 7(i) = 1 and 7(/) = 2; then we get (D,D,f)(r,...,7r) = 
(D,DA/)C,- sey r). = 


A straightforward computation now gives us the final lemma we need: 
LEMMA 3. Suppose a quadratic form Q is given by 
Q(v,,...,0,) = 2,bv;v; + X jf jC0j0;. 
Then for all v satisfying Dv; = 0, we have O(v) = (c — b) 20/7. 


THEOREM (The PURKISS PRINCIPLE). Let f and g be symmetric functions with continuous second 
derivatives in the neighborhood of a point P = (r,...,r). On the set where g equals g(P), the 
function f will have a local maximum or minimum at P except in degenerate cases. 


Proof. Assuming V g(P) # 0, we know from Lemma | that v f(P) has the form Av g(P). The 
second partial derivatives of f and g satisfy the equalities in Lemma 2, and hence the terms 
(D;D,f)(P) — \(D;D;g)(P) also satisfy those equalities. Lemma 1 shows that the vectors vo 
perpendicular to vg(P) are those with 2v,; = 0. Lemma 3 then shows that on those v our 
quadratic form (if not identically zero) is positive or negative definite. The result then follows 
from the Lagrange multiplier criterion. & 


Two types of degeneracy were excluded from the proof: we assumed that V g( P) was nonzero, 
and then we assumed that the second-order terms in f — Ag did not all vanish in the directions 
perpendicular to vg(P). Such exclusions are indeed necessary, and the Purkiss principle is not 
universally true. To see why a condition on Vg is needed, consider the extreme case where g is 
constant everywhere; there is then no constraint, and hardly any symmetric f will satisfy the 
condition. It is a little harder to find an example that fails with the second-order degeneracy, but 
here is one such case. Let the functions be 


_ 4 4 4 
f= Xp X2X3 + X7°X3X, + XX) Xd, 
g=xpxt xx, tx Px; 


around the point P = (1, 1, 1). Consider the curve consisting of points (st, s, 5), where ¢ is close to 
1 and s is chosen to keep g = 3 (that is, s°(2¢? + 1) = 3). For such points 


f(st, s,s) =s°(t* + 2t) = 3(t4 + 24)/Q22 + 1). 


The derivative of this comes out to be 6(1° — 1)?/(2t? + 1)*, which is zero at ¢ = 1 but positive 
on both sides nearby. Thus f has values less than 3 for t = 1 — e and bigger than 3 fort = 1 + e, 
and the Purkiss principle fails. It is not at all surprising that such exceptions should exist. What is 
surprising is how widely the principle turns out to be correct. 

Thus far we have come in direct pursuit of the Purkiss principle. By good classical methods we 
have analyzed it; so far as it is true, we have proved it. Here we might stop. But in fact, the scope 
of the principle is much wider than we have yet imagined. For an alert reader, the first sign of this 
should have been Lemma 3. That was a straightforward computation, but in the line of argument 
it seemed to be a miracle, because for no apparent reason it gave us just what we needed. Such a 
seeming miracle should never be accepted on its face, for nine times out of ten it marks the 
presence of a general property not yet recognized or understood. 

Furthermore, there are examples to suggest a more general result. Think for instance of a 90° 
rotation around the vertical axis in three-space. This rotation preserves x,” + x,’ + x3’, and in 
particular it maps the unit sphere to itself, leaving the north and south poles fixed. Let f be any 
other function preserved by the same quarter turn, and consider its values on the unit sphere. It is 
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intuitively clear that the north pole and south pole must be critical points for f on the sphere. 
Depending on the strength of your intuition, perhaps you can also see that the poles will be 
extrema rather than saddle points (apart from degenerate cases). Certainly it is easy to check this 
in examples. But this situation is not covered by the original Purkiss principle. 

The geometric language in this example can serve to remind us that symmetry was a geometric 
concept to begin with. The symmetry that comes from interchangeability of the variables is only 
one of many types that a function might possess. What elementary geometry suggests is that we 
should allow other linear changes of variable. (We leave out translations because we are interested 
in fixed points.) As in all symmetry situations, two changes of variable that preserve a function f 
can be composed to get another one, and we will be dealing with a group of transformations. 
There is no reason to bother with continuous families of maps preserving f, since usually then we 
could reduce f to a function of fewer variables. Thus we are led to consider what happens to the 
Purkiss principle when we replace interchanges of variables by some other finite group of linear 
transformations. 

Unavoidably now we must raise the level of discussion to assume some familiarity with linear 
algebra. Our basic structure will be a finite group G of linear transformations on a finite-dimen- 
sional real vector space V. The “symmetric” functions will be those f that satisfy f(Tv) = f(v) for 
all T in G; the usual name for these is invariant functions. The points (r,..., r) that we considered 
before are precisely the ones unaffected by interchanges of coordinates; in our general situation 
we should correspondingly consider the fixed points P in V, those sent to themselves by every T in 
G. We must also make explicit an invariance property that was hidden in the earlier treatment: 
interchanging coordinates simultaneously in two vectors (x,,...,x,,) and (),,..., y,) does not 
change the dot product x - y = 2x;,y,. The corresponding tool in general is given to us by the 
following standard result. 


LEMMA 4. There exists a positive definite inner product ( , ) on V that is G-invariant; in other 
words, (Sv, Sw) = (v, w) for every S in G and v, w in V. 


Proof. Let v - w be the dot product in some coordinate system. Define 
(0, w) = rin Glv . Tw. 
Clearly this is symmetric and bilinear and positive definite. For S in G we have (Sv, Sw) = 


2TSv - TSw; and since G is a group, the terms of this sum are just the terms of (v, w) in different 
order. @ 


Consider now a Taylor expansion of a function f around a point P: it can be written in the 
form 


f(P + vo) =f(P) + (Wf(P), 0) + Op(v) + R(o), 


where Qf is a quadratic form and R(v) is a remainder term satisfying 
lim R(v)/(e, v) = 0. 
vo—0 


In such an expression the vector vf(P) and the quadratic form Qf are uniquely determined. 
Suppose therefore that we take a fixed point P and an invariant function f. For each T in G we 
have then 


f(P+0)=f(T(P + 0)) =f(P + Tr) 
= f(P) + (vf(P), Te) + Q5(Te) + R(Tv). 
By uniqueness we see that O£(Tv) = Of(v) and (vf(P), Tv) = (Vf(P), v). Invariance of the 
inner product tells us that (Vf(P), Tv) = (T ~'v f(P), v); and since this equals (v f(P), v) for all 


v, we have T'vf(P) = V/f(P). Thus we get the following result, which is as close as we can 
come to Lemmas | and 2 in this generality. 
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LEMMA 5. Let f be invariant and P fixed. Then Vf(P) is fixed, and the quadratic form Q& is 
invariant (that is, O£(Tv) = QOf(v) for all T in G). 


Nothing so strong as the Purkiss principle will be true unless we make some further 
assumptions about our group of transformations. To get the analogue of Lemma 1, for instance, 
we need to know that the fixed vector Vv f( P) is forced to be a multiple of the fixed vector V g(P). 
Thus we should assume that the fixed vectors in V form a one-dimensional subspace. After that, 
we must separate two types of terms, as in Lemma 2; for this we need an analogue of the space 
where 2v; = 0. The general concept needed here is that of an invariant subspace, one sent to itself 
by all maps in G. We then have the following standard result. 


LEMMA 6. Let U be an invariant subspace. Let W be its orthogonal complement with respect to 
(,). Then W is also an invariant subspace. 


Proof. Take any w in W and T in G. Let uw be any element of U. Then (Tw, u) equals 
(w,T~'u), and this is zero because T ~'u is in U. Thus Tw is orthogonal to U. & 


In particular, our one-dimensional space U) of fixed vectors has an orthogonal complement W, 
which is an invariant subspace. Now if we look back at the proof of Lemma 2, we see that it 
depended on our having a great many permutations, enough to match most of the matrix entries 
with each other. The general analogue of this is irreducibility, which means that there should be no 
nontrivial invariant subspaces inside W,. Using this, we can finally establish the general property 
behind Lemma 3. 


LEMMA 7. Let G be a finite group of linear transformations acting irreducibly on a finite-dimen- 
sional real vector space W. Let Q(w) be a G-invariant quadratic form on W. Then Q(w) is a scalar 
multiple of (w, w). In particular, Q is positive or negative definite whenever it is not identically zero. 


Proof. We can express the quadratic form Q in terms of the invariant inner product, getting 
QO(w) = (By, w) for a unique self-adjoint linear map B:W — W. For each T in G we have 


(Bw,w) = O(w) = Q(Tw) = (BTv, Tw) = (T 7 'BTw,w). 


The uniqueness of B tells us then that T ~'BT = B. In other words, BT = TB. But now the real 
self-adjoint map B has some real eigenvalue b. Let X be {w in W|Bw = bw), the corresponding 
eigenspace. For w in X and T in G we have BTw = TBw = Tbw = bTw, so Tw is in X. Thus X is 
an invariant subspace of W. Since it is nontrivial, it must be all of W. Thus B = b- Id, and 
O(w) = (Bw, w) = b(v, w). 


We have now established the analogues of our three initial lemmas. As before, we must rule out 
degenerate cases: that is, we assume that V g(P) is nonzero and that the second-order terms in 
f — Ag around P do not vanish on the space orthogonal to Vv g(P) under (, ). The proof of the 
original theorem then gives us our wider result. 


THEOREM (The EXTENDED PURKISS PRINCIPLE). Let G be a finite group of linear transformations 
of a finite-dimensional real vector space V. Assume that the fixed vectors form a one-dimensional 
subspace, and that G acts irreducibly on the complementary subspace. Let f and g be twice-differentia- 
ble G-invariant functions on a neighborhood of a fixed vector P. Then on the set where g(v) = g(P), 
the function f has a local maximum or minimum at P except in degenerate cases. 


Is this now the ultimate generalization of the Purkiss principle? Not at all. Indeed, Purkiss 
himself mentioned a similar result where the variables were subject to several independent 
constraints by different functions g,; = g,(P),..., 8, = 8,(P). Other examples will easily come to 
mind. If we are to do more than just accumulate further cases, we must now reach for more 
general geometric concepts, those from the rudimentary theory of differentiable manifolds. 
Assuming familiarity with those, we can examine the theorem with a critical eye, distinguishing 
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the soul of the principle from its mere outward limbs and flourishes. 

First of all, we observe that the actual values of the function g are never mentioned. All that 
matters is the set VM of points where g is equal to g(P). Instead of having g symmetric, then, all we 
need to require is that our group G should map ™ to itself. The condition vg(P) # 0 now is the 
familiar condition that allows us to introduce smooth coordinates on M around P. Thus we can 
absorb that condition by saying that our set M@ should be a differentiable manifold. (There is of 
course no need for M to be complete; it may be nothing but a smooth piece around P.) The values 
of f at points off M are also irrelevant, and all we need for the statement is that f should be 
defined as a smooth G-invariant function on M. 

Furthermore, there is no need for the maps in G to be induced by linear maps on some ambient 
space. Linearity will enter automatically, because any smooth map of M to M that fixes P will 
induce a linear map on the tangent space to M at P. This tangent space in our earlier situation 
was precisely the orthogonal complement to the span of v g(P), and so this is the space where we 
should assume irreducibility. Thus we are led to formulate a modernized version of our theorem: 


THEOREM (The MODERN PURKISS PRINCIPLE). Let M be a differentiable manifold. Let G be a 
finite group of smooth maps from M to M. Let P be a point in M fixed by G, and let f be a 
differentiable function on M invariant under G. Assume that the action induced by G on the tangent 
space at P is (nontrivial and) irreducible. Then P is a critical point of f; and if this critical point is 
nondegenerate, it is a local maximum or minimum of f. 


Proof. This theorem shows nicely how much can be done with the machinery of manifolds; for 
though it is a much more general result, it requires no new ideas in its proof. To begin with, f 
induces a linear function dfp on the tangent space at P. The kernel of df, will be G-invariant, and 
irreducibility thus requires dfp to be zero. (We assume that the action is nontrivial in order to rule 
out a counterexample in dimension one.) Hence P is a critical point for f. Consequently, the 
second-order terms of f induce a quadratic form on the tangent space, and clearly this form will be 
G-invariant. By definition one says that the critical point is nondegenerate when this quadratic 
form is nondegenerate. In that case Lemma 7 again shows that the form is positive definite or 
negative definite, and P accordingly gives a local minimum or maximum. 


This theorem has a quite up-to-date sound, and so with it we can end our pursuit of the Purkiss 
principle. I do not happen to have seen this final version anywhere before, but I make no great 
claims for it. More important is that we now see how such a result may stand at the end of a 
search that began with nothing more than some interchangeable variables in calculus problems. 


BIOGRAPHICAL APPENDIX 


Olry Terquem ([3], [7], [8]) was born in Metz, France, in 1782. His native language was a form 
of Yiddish, and his early studies were limited to Hebrew and the Talmud. But the upheaval of the 
French Revolution happened to bring his older brother in contact with the Jewish community at 
Coblentz, and a tutor engaged there led Olry into more general studies. His weakness in French 
made him fail in his first application to the Ecole Polytechnique, but he was admitted on his 
second attempt (1801). In 1804 he began to teach higher mathematics at the lycée in Mayence, 
moving in 1811 to a professorship in the artillery school there. In 1814 he was called to Paris as 
professor attached to the Comité de l’Artillerie; in this post he served as librarian for the Dépdt 
Central d’Artillerie and also as a general scientific consultant. Exempted from mandatory 
retirement, he continued active until his death at age 80. 

Terquem translated technical works related to artillery and also wrote textbooks on algebra, 
geometry, and mechanics. Of his discoveries in pure mathematics, we might single out his 
computation of the number of normals from a given point to an algebraic surface of given degree. 
His mastery of languages helped make him an authority on the history of mathematics. He also 
wrote many articles on Jewish concerns, advocating for instance prayers in the vernacular 
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languages rather than Hebrew. Perhaps his most important contribution to mathematics began in 
1842, when he was sixty: he was cofounder of the journal Nouvelles Annales de Mathématiques, 
which he continued to edit until his death. Characteristic of the man are these lines from one of 
his very last letters: “I believe that human intelligence approaches the divine intelligence 
asymptotically. Let us hope!” 


Viktor Yakovlevich Bunyakovskii (1804-1889) is a much better known mathematician than the 
others, and hence less needs to be said about him here [6]. After basic studies in Russia, he 
received his doctorate in mathematics at Paris in 1825. For most of his career he was a professor 
at the University of St. Petersburg (now Leningrad) and a member of the St. Petersburg Academy 
of Sciences; from 1864 to his death he was vice-president of the Academy. He wrote about 150 
published works in mathematics and mechanics. His most famous discovery, of course, was the 


inequality 
b. \? b b 
) <(Lr UL} 
which he published in 1859. Unfortunately it appeared only in a separate pamphlet written in 


Russian, and the inequality remained unknown in western Europe until it was rediscovered by 
Schwarz in 1884. 


Henry John Purkiss ({10], [11], [14]) was born in London in 1842. In 1859 he was admitted to 
Trinity College, Cambridge, as a scholarship student (sizar). In 1862 he became one of the 
founders and editors of The [Oxford, Cambridge, and Dublin| Messenger of Mathematics. He 
received his B.A. degree in 1864 as the top mathematics student in his class (Senior Wrangler, Ist 
Smith’s Prize). From 1864 to 1865 he was Vice-Principal of the College of Naval Architecture, 
South Kensington, and in 1865 he was named Principal of the Royal School of Naval Architec- 
ture. He drowned on September 17, 1865, in the river Cam. Little more is recorded of his sadly 
short life, but we can list his published papers: 


1. Theorem in maxima and minima, Messenger of Math., 1 (1862) 180-183. 

2. Dynamical note, Messenger of Math., 2 (1864) 228. 

3. The cardioid, Messenger of Math., 2 (1864) 241-249. 

4. Note on the comparative value of Simpson’s two rules, and on 
Dr. Woolley’s rule, Naval Architects’ Trans., 6 (1865) 48-50. 

5. The equation of the tangent, Messenger of Math., 3 (1866) 19-22. 

6. Notes on pedal coordinates, Messenger of Math., 3 (1866) 83-88. 

7. Notions of infinity derived from gnomonic projection, Messenger of 
Math., 3 (1866) 171-172. 

8. On certain formulas of mensuration, Quart. J. Math., 7 (1866) 
235-241. 


George Chrystal (1851-1911) was a Scotsman ((1], [12]). After studies in Aberdeen, he was in 
residence at Peterhouse, Cambridge, from 1872 to 1875. As a student there, he spent much time 
with Maxwell in the newly opened Cavendish Laboratory. Many of his friends considered this a 
“waste of time,” since Maxwell’s work was not covered in the Tripos examination; even so, 
Chrystal came out second in his class. He was always inclined toward physics: his first major work 
was an experimental verification of Ohm’s Law, and shortly before his death he did significant 
work on the analysis of seiches (long-lasting waves in lakes). From 1879 until his death he was 
Professor of Mathematics at the University of Edinburgh, where he played a major role both in 
reforming the university curriculum and in raising the level of primary and secondary education in 
Scotland. 

Despite all this, Chrystal will always be best known for his classic algebra textbook, which is 
still in print. Officially he is present in this Appendix because his book gave the first proof of any 
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significant form of the Purkiss principle, but in fact he is included because the temptation to quote 
him is irresistible. Here is a bit from an address he gave a year before Part I of his book appeared: 


The whole teaching consists in example grinding. What should be merely the help to attain the end has 
become the end itself. The result is that algebra, as we teach it, is neither an art nor a science, but an 
ill-digested farrago of rules whose object is the solution of examination problems.... The end of all 
education nowadays is to fit the student to be examined; the end of every examination not to be an 
educational instrument, but to be an examination which a creditable number of men (however badly taught) 
shall pass. We reap, but we omit to sow... . The cure for all this evil is to give effect to a higher ideal of 
education in general, and of scientific education in particular... . It takes the hand of God to make a great 
mind, but contact with a great mind will make a little mind greater. 


Here is part of the Preface to Part II of his book, which is devoted mainly to a careful treatment 
of power series: 


A practice has sprung up of late (encouraged by demands for premature knowledge in certain examina- 
tions) of hurrying young students into the manipulation of the machinery of the Differential and Integral 
Calculus before they have grasped the preliminary notion of a Limit... on which all the meaning and all the 
uses of the Infinitesimal Calculus are based. Besides being to a large extent an educational sham, this course 
is a sin against the spirit of mathematical progress. 


And here, finally, is part of the famous Preface to Part I: 


The first object I have set before me is to develop Algebra as a science, and thereby to increase its 
usefulness as an educational discipline. I have also endeavoured so to lay the foundations that nothing shall 
have to be unlearned... . It becomes necessary, if algebra is to be anything more than a mere bundle of 
unconnected rules, to lay down generally the fundamental laws of the subject, and to proceed deductively. 


Amen. 
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Wadswerth Pub, J983, x + 578 pp. [ISBN: 0-534-01180-2] The usual topics of a pre-calculus 
algebra/trigornometry course are all here, but arranged differently, with different emphasis. Might 
be an approach that avoids the student impression that "I’ve seen all this before." AWR 


W.J. Tebeaux. Wadsworth Pub, 1983, xx + 642 pp. [ISBN: 0-534-01235-3] Changes from the First Edi- 
tion (TR, December 1980) inspired by the 1981 Tax Act. Also new are chapter tests, expanded prob- 
lem sets and expanded chapters on graphing and statistics. JRG 


Elementary, T(13: 1). Operational Mathematics for Business, Second Edition. R.C. Pierce, Jr., 


Precalculus, T(13: 1). Precalculus: A Functional Approach to Algebra and Trigonometry. Peter Evano- 
vich, Martin Kerner. Holden-Day, 1982, xiii + 408 pp, $20.50. [ISBN: 0-8162-2715-2] Preliminaries, 
notation kept to a minimum (e.g., € avoided for set membership). An early chapter introduces func- 
tions, while inequalities and absolute vaiues are next to last. Organization appears patched 
together, with appendices inserted irregularly, and certain explanations seem unclear. Exercises 
seem overly elementary and repetitive. Some applications. RB 


Precalculus, T(13-14: 1). Algebra & Trigonometry with Applications. Jagdish C. Arya, Robin W. 
Lardner. Prentice-Hall, 1983, xiv + 633 pp, $23.95. [ISBN: 0-13-021675-5] A straightforward ("all 
signal, no noise”) fairly traditional text, the book is somewhat long on aigebra (including such 
topics as Descartes” Rule of Signs and Cramer’s Rule) and somewhat short on trigonometry. Attrac- 
tive format, good selection of routine exercises and applications. Tables, answers, index. JS 


Precalculus, T(13: 1). Basic Mathematics for Calculus, Second Edition. Dennis G. Zill, Jacqueline 
M. Dewar, Warren S. Wright. Wadsworth Pub, 1983, x + 437 pp. [ISBN: 0-534-01197-7] Informal, "no- 
nonsense" approach to precalculus topics. Includes numerous examples, calculator problems, chapter 
reviews and tests, and short appendices for basic math review, mathematical induction, complex 
numbers and axis rotation. Application problems included in most sections. PB 


Precalculus, T(13: 1), S. Mathematics for Technical Education, Second Edition. Dale Ewen, Michael 
A. Topper. Prentice-Hall, 1983, vii + 504 pp, $23.95. [ISBN: 0-13-565168-9] Essentially a precal- 
culus course which includes transcendental functions, and introductions to vectors, complex numbers, 
and analytic geometry. New topics in this edition include properties of determinants and inverse 
trigonometric functions. The text consists primarily of examples and exercises. (First Edition, 
TR, October 1976.) CEC 
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Precalculus, T(13-14: 1). College Algebra. Max A. Sobel, Norbert Lerner. Prentice-Hall, 1983, xvi 
+ 435 pp, $22.95. [ISBN: 0-13-141796-7] This text especially emphasizes the algebraic skills that 
will be used in calculus: simplifying difference quotients, recognizing composite functions (for the 
chain rule), solving inequalities (for finding signs of derivatives), et cetera. Also stresses 
graphing (geometric transformations induced by algebraic changes). Points out common pitfalls, has 
review exercises, and sample tests. GHM 


Precalculus, T(13: 1). College Algebra with Applications. Jagdish C. Arya, Robin W. Lardner. 
Prentice-Hall, 1983, xiv + 508 pp, $22.95. [ISBN: 0-13-140699-X] A review of basic algebra followed 
by chapters on lines, functions, conics, exponentials and logarithms, polynomials, and matrices and 
other topics from algebra. Some problems and exercises involve applications. FLW 


Precalculus, T*(13: 1). Introduction to Technical Mathematics. Loren Radford, Anthony Vavra, Shir- 
ley Rychlicki. Prindle, Weber & Schmidt, 1983, viii + 680 pp. [ISBN: 0-87150-339-5] Readable, 
student-oriented, comprehensive coverage of linear, quadratic, trigonometric, exponential and loga- 
rithmic functions for readers with basic skills in arithmetic, geometry and algebra. Material is 
related to needs of technologists. Unusual features include chapter on measurements and data han- 
dling, and appendices on graphical treatment of data, data analysis and dimensional analysis. Cal- 
culator use integrated throughout text. Numerous routine exercises with answers to half. JK 


Education, P. The Programmable Hand Calculator: A Teacher’s Tool for Mathematics Classroom Lec- 
tures. Bernard Seckler. Sigma Pr, 1981, 207 pp, (P). Gives programs, with comments and examples, 
for both the Hewlett Packard 29C and the Texas Instruments 58C, which can be used to solve’ several 
college-level mathematics problems. Topics range from factorization of integers to the Runge-Kutta 
method for solving a differential equation. RSK 


Education, T*(16), P*. Teaching Problem-Solving Strategies. Daniel T. Dolan, James Williamson. 
Addison-Wesley, 1983, xiv + 322 pp, $16 (P). [ISBN: 0-201-10231-5] Carefully constructed introduc- 
tion to six problem-solving strategies for integration with junior high math programs. Includes 
introductory lessons, activities and practice problems with worksheets ready for duplicating. 
Activities correlated with strategies and standard topics, thoughtful page layouts and interesting 
mix of topics. PB 


Education, S(16-17). Ideas from the Arithmetic Teacher, Grades 6-8, Middle School. George Immer- 
zeel, Melvin Thomas. NCTM, 1982, ii + 142 pp, $5.40 (P). [ISBN: 0-87353-200-7] Reprinted directly 
from the IDEAS section of this journal. Student activity sheet and teacher suggestions back-to-back 
on perforated pages for ease of use. Good ideas-~ideal way to introduce potential middle school 
teachers to the Arithmetic Teacher. MW 


Education, S(16-17), P. Teaching Mathematics. Ed: Michael Cornelius. Nichols Pub, 1982, 248 pp, 
$15.00 (P); $27.50. [ISBN: 0-89397-137-5; 0-89397-136-7] Ten essays on current British issues in 
mathematics education. Practical ideas on providing for exceptional or learning disabled children, 
teaching through recreational mathematics, and providing applications in other areas. Discusses 
teaching at primary, middle and secondary level. MW 


History, P, L. Loo-Keng Hua Selected Papers. Ed: H. Halberstam. Springer-Verlag, 1983, xiv + 889 
pp, $42. [ISBN: 0-387-90744-0] A selection of about 50 papers, supplemented by topical surveys cov~ 
ering number theory, algebra and geometry, and function theory. Includes a brief biography, a com- 
plete publication list, and a tantalizing list (without any details) of over 100 industrial problems 
to which Hua has contributed significantly by leading a team of applied mathematicians in the 
development of new mathematical methodology. LAS 


History, T*(14-16: 2), S**, L**. The Nature and Growth of Modern Mathematics. Edna E. Kramer. 
Princeton U Pr, 1982, xxiv + 758 pp, $9.95 (P). [ISBN: 0-691-02372-7] A welcome re-issue of one of 
the best books in its genre: a compelling exposition of mathematics--theories, applications, people, 
societal context--from antiquity to mid-twentieth century. Fully one-third of the book is devoted 
to twentieth century mathematics in terms that any serious non-mathematician can comprehend. That 
in itself makes this book virtually unique. (First Edition, TR, April 1971; Extended Review, April 
1974.) LAS 


Combinatorics, S(17-18), P. Lecture Notes in Mathematics—969: Combinatorial Theory. Ed: D. Jung- 
nickel, K. Vedder. Springer-Verlag, 1982, 326 pp, $16 (P). [ISBN: 0-387-11971-X] A collection of 21 
research papers on a variety of topics in combinatorics that constitute the proceedings of the 
Conference on Combinatorial theory at Schloss Rauischholzhausen in May 1982, marking the 375th 


anniversary of Universit¥t Giessen. SS 


Combinatorics, P, L. Combinatorics on Words. M. Lothaire. Encyclopedia of Math. & Its Applic., V. 
17. Addison-Wesley, 1983, xix + 238 pp, $32.95. [ISBN: 0-201-13516-7] A collaborative collection of 
articles which thoroughly and nicely survey and unify much recent (and older) research on "words," 
i.e., strings of symbols, as both an algebraic and combinatorial subject. Exposition is suffi- 
ciently elementary for advanced undergraduates, computer scientists and linguists. Many problems, 
extensive bibliography. GHM 


Number Theory, $(13), P, L. The Lore of Prime Numbers. George P. Loweke. Vantage Pr, 1982, viii + 


259 pp, $17.95. [ISBN: 533-04347-6] A history of number theory which focuses on the study of primes 
up until 1966. The approach is expository so it includes no difficult proofs. Interesting reading. 
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Well documented. CEC 


Number Theory, 8(18), P. Number Theory Related to Fermat’s Last Theorem: Proceedings of the Confer- 
ence Sponsored by the Vaughn Foundation. Ed: Neal Koblitz. Progress in Math., V. 26. Birkhauser 
Boston, 1982, x + 362 pp, $30. [ISBN: 3-7643-3104-6] This volume is the result of a meeting at MIT’s 
Endicott House in September, 1981 on current mathematical work relating to Fermat’s last theoren. 
CEC 


Linear Algebra, S(16-17), L. Hankel and Toeplitz Matrices and Forms: Algebraic Theory. I.S. 
Iohvidov. Transl: G. Philip A. Thijsse. Birkhauser Boston, 1982, xiii + 231 pp, $24.95. [ISBN: 3- 
7643-3090-2] Translation of 1974 Russian text, with up-dated references; exercises. Hankel (resp. 
Toeplitz) matrices are those whose entries depend only on the sum (difference) of the indices. This 
purely algebraic study is based on classical linear algebra; inertia, signature, etc., are intro- 
duced and applied. RB 


Linear Algebra, T(14-17: 1, 2), 8, P, L. Matrices with Applications in Statistics, Second Edition. 
Franklin A. Graybill. Wadsworth Pub, 1983, xii + 461 pp, $31.95. [ISBN: 0-534-98038-4] Presupposes 
a course in matrix or linear algebra. Considers topics useful in connection with multivariate 
analysis and the general linear model, including generalized inverses, patterned matrices, integra- 
tion and differentiation, inverse positive matrices, projections. FLW 


Linear Algebra, T(14-15), S. An Introduction to Linear Algebra. L. Mirsky. Dover Pub, 1982, xi + 
433 pp, $8 (P). [ISBN: 0-486-61547-2] Reprint of 1955 text. A classical approach near the beginning 
is gradually supplanted by a more abstract viewpoint; concreteness throughout. Filled with histori- 
cal references. Some dated or non-standard terminology. Numerous exercises extend the presenta- 
tion, and there are many miscellaneous problems. RB 


Linear Algebra, T(14-15: 1). Applied Linear Algebra with APL. Garry Helzer. Little, Brown & (Co, 
1983, xv + 588 pp, $35 [ISBN: 0-316-35526-7]; Instructor’s Manual, v + 52 pp, (P). An introduction 
to linear algebra emphasizing those ideas of greatest importance in applications using APL as a 
teaching tool. Not as much abstraction as many texts, but more computational algorithms and appli- 
cations than most. AO 


Category Theory, P. Lecture Notes in Mathematics-962: Category Theory. Ed: K.H. Kamps, D. Pumplin, 
W. Tholen. Springer-Verlag, 1982, xv + 322 pp, $16 (P). [ISBN: 0-387-11961-2] These proceedings of 
the international conference held at Gummersbach, July 6-10, 1981 contain applications to algebra, 
logic and topology. JAS 


Algebra, T*(17-18: 1, 2), P. Associative Algebras. Richard S. Pierce. Grad. Texts in Math., V. 
88. Springer-Verlag, 1982, xii + 436 pp, $39. [ISBN: 0-387-90693-2] Designed to bridge from early 
graduate study to the beginning of thesis specialization, starting with Wedderburn theory. After 
structure and representation theorems, second half concerns central simple algebras (studied via 
Brauer groups of fields). Well organized: short, modular chapters and sections, effective symbol 
indexing, sizable bibliography, many exercises. RB 


Algebra, P. Proceedings of Conference on Near-rings and Near-fields. Ed: G. Ferrero, C. Ferrero 
Cotti. Universita degli Studi di Parma, 1981, vii + 224 pp, (P). Lectures, papers, abstracts from 
the week-long conference held in San Benedetto del Tronto (Italy) in 1981. LCL 


Algebra, S(18), P. Lecture Notes in Mathematics-958: Group Extensions, Representations, and the 
Schur Multiplicator. F. Rudolf Beyl, Jurgen Tappe. Springer-Verlag, 1982, iv + 278 pp, $13.50 (P). 
(ISBN: 0-387-11954-X] A unified treatment of several areas of group theory, using the Schur multi- 
plicator as the key concept and interpreting it in the light of modern homology and cohomology 
theory. Applications to projective representations, isoclinism, and assorted related topics. 
Indexes, bibliography. JS 


Algebra, T(14~16: 1, 2), S. Fundamental Concepts of Algebra. Bruce E. Meserve. Dover Pub, 1981, ix 
+ 294 pp, $6 (P). [ISBN: 0-486-61470-0] An unabridged and corrected republication of the second 
printing (1959) of the work originally published in 1953. This pre-"modern-math" text places much 
emphasis on the theory of polynomials and the theory of equations. Other chapters include number 
theory, determinants and matrices, geometrical constructions, and curve sketching. LCL 


Algebra, T(16-18), S, P. A Survey on Congruence Lattice Representations. E. Tamas Schmidt. 
Teubner-Texte, Band 42, BG Teubner, 1982, 115 pp, $5 (P). A survey of the work that has been done 
on the problem of representing a lattice as the congruence lattice of some special type of algebra. 
Exercises, bibliography. LCL 


Algebra, P. Seminar D. Eisenbud/ B. Singh/ W. Vogel/ Volume 2. D. Eisenbud, B. Singh, W. Vogel. 
Teubner-Texte, Band 48. B.G. Teubner, 1982, 108 pp, 14M (P). Eight papers on commutative rings and 
algebraic geometry. JD~-B 


Finite Mathematics, T(13: 1). College Mathematics with Business Applications, Third Edition. John 
E. Freund, Thomas A. Williams. Prentice-Hall, 1983, xvii + 445 pp, $24.95. [ISBN: 0-13-146498-1] 
Changes from the Second Edition include: more intuitive treatment of limits; review of elementary 
algebra; new chapter on mathematics of finance; chapters on simulation and periodic functions have 
seen dropped. New end-of-chapter summaries, glossaries, and supplementary exercises. (First 
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Edition, TR, August-September 1969; Extended Review, January 1971; Second Edition, TR, November 
1975.) JBG 


Finite Mathematics, T(13-15: 1). Applied Finite Mathematics for the Managerial and Social Sciences. 
Soo Tang Tan. Prindle, Weber & Schmidt, 1983, ix + 470 pp. [ISBN: 0-87150-336-0] Conventional in 
content and straightforward in presentation, elementary treatment of linear programming (including 
simplex method), probability and statistics, Markov chains and game theory, mathematics of finance. 
Ample, largely routine exercises with answers, tables, index. JS 


Calculus, T(14-16: 1, 2), S, L. Advanced Calculus, Third Edition. Angus E. Taylor, W. Robert Mann. 
Wiley, 1983, xv + 732 pp, $31.95. [ISBN: 0-471-02566-6] A complete and comprehensive text at the 
intermediate-to~advanced undergraduate level. Covers a good deal of analysis, e.g., uniform conver- 
gence, infinite series, mean-value theorems. Uses and assumes linear but not multilinear algebra. 
Extensive explanation accompanies the theoretical development. Some calculator-aided numerical 
problems (Second Edition, TR, February 1973). Pz 


Calculus, T(13: 1, 2), S. Calculus: Pure and Applied. A.J. Sherlock, E.M. Roebuck, M.G. Godfrey. 
Edward Arnold Pub, 1982, x + 534 pp, $16.95 (P). [ISBN: 0-7131-3446-1] Polynomial calculus (the 
first quarter) followed by algebraic and transcendental functions, with applications interspersed. 
Nontheoretical, computational, special emphasis on numerical methods and differential equations 
(calculus as a language for science). LCL 


Calculus, T(13-14: 3). Calculus and Analytic Geometry. J. Douglas and Barbara Trader Faires. 
Prindle, Weber & Schmidt, 1982, 11 + 1141 pp. [ISBN: 0-87150-323-9] Stand-out features include 
numerous historical notes, over 6000 exercises, many with applications, and about fifty problems 
(but not solutions) from Putnam competitions. Trigonometric functions are introduced early. Read~ 
able, attractive two-color format, flexible. Complete solutions manual and two-volume student sup- 
plement are available. JK 


Differential Equations, P. Instability, Nonexistence and Weighted Energy Methods in Fluid Dynamics 
and Related Theories. B. Straughan. Research Notes in Math., No. 74. Pitman Pub, 1982, 169 pp, 
$18.95 (P). [ISBN: 0-273-08564~6] The author illustrates several techniques which lead to nonex- 
istence results for solutions to partial differential equations; he also presents variations of a 
weighted energy method which has been used to obtain results for a variety of problems in mathemati- 
cal physics. LCL 


Differential Equations, P. Lecture Notes in Mathematics-957: Differential Equations. Ed: D.G. de 
Figueiredo, C.S. HUnig. Springer-Verlag, 1982, viii + 301 pp, $16 (P). [ISBN: 0-387~-11951-5] 
Proceedings of the First Latin American School of Differential Equations held at Sao Paulo, Brazil, 


June 29 to July 17, 1981. JAS 


Differential Equations, P. The Fundamental Principle for Systems of Convolution Equations. Daniele 
Carlo Struppa. Memoirs No. 273. AMS, 1983, iv + 167 pp, $10 (P). [ISBN: 0-8218-2273-X] Exposition 
of an extension of Ehrenpreis” "fundamental principle" in the theory of systems of linear partial 
differential equations. Extends methods of Berenstein and Taylor to establish admission of Fourier 
representation of solutions in certain spaces, Brief discussion of some open problems. Good 
bibliography. JK 


Differential Equations, T(15-17: 1), S, L*. Topics in Ordinary Differential Equations. William D. 
Lakin, David A. Sanchez. Dover Pub, 1982, vi + 154 pp, $4 (P). [ISBN: 0-486-61606-1] Unabridged and 
slightly corrected republication of 1970 edition (TR, February 1971). Informal, intuitive approach. 
Little theory. Few proofs. Perturbation methods link five chapters. Prerequisites include intro- 
ductory course in differential equations but not in complex variables. Not intended to be exhaustive 
treatment. Problems. References. JK 


Differential Equations, P. Generic Bifurcations for Involutory Area Preserving Maps. Russell J. 
Rimmer. Memoirs No. 272. AMS, 1983, v + 165 pp, $10 (P). The use of involutory area preserving 
maps in studying generic bifurcations associated with symmetric periodic solutions of Hamiltonian 
systems. JG 


Differential Equations, P. The Stability of Multi-dimensional Shock Fronts. Andrew Majda. Memoirs 
No. 275. AMS, 1983, iv + 95 pp, $6 (P). A systematic study of the linearized stability of multidi- 
mensional shock-fronts, with special emphasis on applying the theory to shock~-fronts for the physi~ 
cal equations of gas dynamics. LCL 


Differential Equations, S(18), P. Quasikonforme Abbildungen und elliptische Systeme erster Ordnung 
in der Ebene. Heinrich Renelt. Teubner-Texte, Band 46. B.G. Teubner, 1982, 140 pp, 14,50M (P). 
On systems of first order elliptic partial differential equations in the plane whose solutions are 
composites of analytic functions and quasiconformal mappings (Bers-Nirenberg). JD-B 


Differential Equations, S(17-18), P. Lectures on Exponential Decay of Solutions of Second-Order 


Elliptic Equations: Bounds on Eigenfunctions of N-Body Schr¥dinger Operators. Shmuel Agmon. Math. 
Notes, No. 29. Princeton U Pr, 1982, 118 pp, $10.50 (P). A study of estimates for solutions of 


second order elliptic equations in unbounded domains. JG 
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Numerical Analysis, P. Analysis of Global Expansion Methods: Weakly Asymptotically Diagonal Sys- 
tems. L.M. Delves, T.L. Freeman. Comp. Math. & Applic. Academic Pr, 1981, x + 275 pp, $49. [ISBN: 
0-12-208880-8] WAD systems, used in solving differential and integral equations, concern the conver- 
gence of expansions of f(x) into sums of terms of the form aNinj(x) for i = 1,2,...,N. Some new 
results, an account of known but scattered results, and an emphasis on computability and error 
analysis. AWR 


Numerical Analysis, P. Lecture Notes in Mathematics-968: Numerical Integration cf Differential 
Equations and Large Linear Systems. Ed: Juergen Hinze. Springer-Verlag, 1982, vi + 412 pp, $20 
(P). [ISBN: 0-387-11970-1] Proceedings of two workshops held at the University of Bielefeld in the 
spring of 1980. JAS 


Numerical Analysis, P. Lecture Notes in Mathematics-965: Topics in Numerical Analysis. Ed: P.R. 
Turner. Springer-Verlag, 1982, ix + 202 pp, $11 (P). [ISBN: 0-387-11967-1] Proceedings of a 1981 
summer school at the University of Lancaster: lecture notes from five of the courses, covering Vol- 
terra and Abel integral equations, multivariate approximation methods, splines, finite element 
methods, and piecewise linear homotopy algorithms. LAS 


Numerical Analysis, P. Lecture Notes in Mathematics-960: Multigrid Methods. Ed: W. Hackbusch, U. 
Trottenberg. Springer-Verlag, 1982, vii + 652 pp, $28.80 (P). [ISBN: 0-387-11955-8] Proceedings of 
a November 1981 conference held at KSin-Porz: three systematic introductory papers (with a uniform 
notation for essential concepts) followed by 18 specific papers on theory, applications and software 
developments. LAS 


Numerical Analysis, P. Tables for Numerical Integration. H.V. Smith. Macmillan Pub, 1982, vi + 26 
pp, $5.95 (P). [ISBN: 0-02-852670-8] Tables of global error bounds for many of the well-known qua- 
drature formulae; a short introduction includes examples to illustrate their use. LCL 


Numerical Analysis, P. Birkhoff Interpolation. G.G. Lorentz, K. Jetter, S.D. Riemenschneider. 
Ency. of Math. & Its Applic., V. 19. Addison-Wesley, 1983, lv + 237 pp, $32.50. [ISBN: 0-201- 
13518-3] Polynomial interpolation given irregular information about the values of the functions and 
its derivatives. Applications to approximation, quadrature, splines and Chebyshev systems. Uses 
interpolation matrices, coalescence of knots, independent knots among other techniques. RWN 


Functional Analysis, S**(18), P**, L. A Hilbert Space Problem Book, Second Edition, Revised and 
Enlarged. Paul R. Halmos. Grad. Texts in Math., No. 19. Springer-Verlag, 1982, xvii + 369 pp, $28. 
[ISBN: 0-387-90685-1] A Second Edition of the justly famous original. 250 problems, 60 of them new 
since the First Edition (TR, April 1968), with remarks, hints, and complete solutions. The choice 
of interesting problems and their presentation is outstanding; the author’s remarks on (and beyond) 
the problems are invaluable. PZ 


Functional Analysis, P. Invariant Subspaces and Other Topics. Ed: C. Apostol, et al. Operator 
Theory: Advances & Applic., V. 6. Birkhauser Boston, 1982, 229 pp, $26.95. [ISBN: 3-7643-1360-9] 
The proceedings of the Sixth International Conference on Operator Theory held in Timisoara and Her- 
culane (Romania), June 1-11, 1981. This volume contains eighteen papers contributed by the confer- 
ence participants on invariant subspaces as well as a wide range of other topics (e.g., operator 
algebras, dilation theory). AO 


Functional Analysis, P. Spectral Methods in Linear Transport Theory. H.G. Kaper, C.G. Lekkerk- 
erker, J. Hejtmanek. Operator Theory: Advances & Applic., V. 5. Birkhauser Boston, 1982, xi + 345 
pp, $29.95. [ISBN: 3-7643-1372-2] This monograph provides a mathematical theory for linear transport 
processes in matter (e.g., neutron transport in nuclear reactors, photon transport in planetary and 
stellar atmosphere). The interest is on functional-analytic and spectral-theoretic methods. This 
branch of applied mathematics provides several nontrivial applications of functional analysis: long 
live Hilbert space! LCL 


Algebraic Geometry, P. Lecture Notes in Mathematics~956: Group Actions and Vector Fields. Ed: J.B. 
Carrell. Springer-Verlag, 1982, 144 pp, $8 (P). [ISBN: 0-387-11946-9] Proceedings (eleven research 
papers) of the seminar on group actions and vector fields held at the University of British Columbia 
in January-February 1981. LCL 


Algebraic Geometry. Lecture Notes in Mathematics-959: Géométrie Algébrique Réelle et Formes Quadra- 


tiques. Ed: J.-L. Colliot-Théléne, et al. Springer-Verlag, 1982, x + 458 pp, $22 (P). [ISBN: 0- 
387-11959-0] Proceedings of a conference held at the Université de Rennes in May 1981. JAS 


Algebraic Geometry, T(18: 1), S, P. Vorlesungen Uber algebraische Fld¥chen. Herbert Kurke. 
Teubner-Texte, Band 43. BG Teubner, 1982, 204 pp, $7.92 (P). A monograph on the modern theory of 
algebraic surfaces. JD-B 


Algebraic Geometry, P. Curves in Projective Space. Joe Harris, David Eisenbud. Presses U Mont- 
real, 1982, 138 pp, $13 (2). [ISBN: 2~7606-0603-1] Considers the problem of describing curves in 
projective space, emphasizing two methods of parametrizing projective curves. Chow and Hilbert 
schemes are discussed, along with two other classificatory approaches the Brill-Noether theory and 
the theory Castelnuovo theory. SG 
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Algebraic Geometry, P. Enumerative Geometry and Classical Algebraic Geometry. Ed: Patrick Le Barz, 
Yves Hervier. Progress in Math., V. 24. Birkhauser Boston, 1982, x + 252 pp, $20. [ISBN: 3-7643- 
3106-2] Partial proceeding of a conference held at Nice in July, 1981. The papers deal with low- 
dimensional projective spaces and curves over algebraically closed fields. SG 


Differential Geometry, S(17-18), P, L. Curvature and Homology. Samuel I. Goldberg. Dover Pub, 
1982, xvii + 315 pp, $6.50 (P). [ISBN: 0-486-64314-X] An unabridged and corrected republication of 
the 1962 edition. A study of the curvature and homology of Riemannian manifolds, as well as Kaehler 
manifolds. Topics include Betti numbers, Lie groups, and groups of transformations. JG 


Differential Geometry, T(17), S, L. Calcul différentiel et géométrie. Daniel Leborgne. Presses U 
France, 1982, 262 pp, (P). [ISBN: 2-13~-037495-6] A modern approach to differential geometry from the 
viewpoint of differentiable manifolds, tangent spaces and differential forms. JG 


Differential Geometry, T(17-18: 1), S, P, L. Lectures on Differential Geometry, Second Edition. 
Shlomo Sternberg. Chelsea Pub, 1983, xv + 442 pp, $25. [ISBN: 0-8284-0316-3] Apart from correction 
of typographical errors, the text is unchanged from the first edition but enhanced by two appen- 
dices: Appendix III is a complete reproduction of a paper co-authored with Guillemin on transitive 
differential geometry, and Appendix IV is a brief survey by the author of relevant developments of 
the past twenty years together with references. J5 


Geometry, S(17-18), P. Finite Geometries: Proceedings of a Conference in Honor of T.G. Ostrom. Ed: 
Norman L. Johnson, Michael J. Kallaher, Calvin T. Long. Lect. Notes in Pure & Appl. Math., V. 82. 
Dekker, 1983, xii + 454 pp, $55 (P). [ISBN: 0-8247-1052-5] An extremely interesting festschrift 

consisting of 36 papers on finite geometries, related algebraic and combinatorial topics, and their 


histories. SS 


Geometry, S8(13), L. Famous Problems of Geometry and How to Solve Them. Benjamin Bold. Dover Pub, 
1982, xii + 112 pp, $3 (P). [ISBN: 0-486-24297-8] An unabridged and slightly corrected version of 
the work originally published in 1969 as Famous Problems of Mathematics: A History of Construction 
with Straight Edge and Compass. Includes several famous classical construction problems and the 
analytic criteria which demonstrate the impossibility of their construction via straight edge and 


compass. JNC 


fopology, T**(15-17: 1, 2). Differential Topology: An Introduction. David B. Gauld. Pure & Appl. 
Math., V. 72. Dekker, 1982, v + 241 pp, $29.75. [ISBN: 0-8247-1709-0] Neither pure point set 
topology nor technical and formal differential topology, but rather geometric topology with dif- 
ferentiation as a tool and as a link to the geometric environment of multivariable analysis. This 
is a well written text with a number of exercises. There is a helpful index of notation, but’ the 
main index is rather brief. JAS 


Topology, P. Beta-homotopy Equivalences Have Alpha-cross Sections. Luis Montejano. Memoirs No. 
274. AMS, 1983, iv + 37 pp, $5 (@). 


Operations Research, T(15-17), S. The High Cost of Clean Water: Models for Water Quality Manage- 
ment. Edward Beltrami. UMAP Expos. Mono. Ser. Birkhauser Boston, 1982, 53 pp, $8.95 (P). [ISBN: 
3-7643-3098-8] A prototypical water-pollution case study based on similar studies done on a more 
extensive scale in the Long Island area of New York. Several mathematical models (e.g., using dif- 
ferential equations, integer programming) are presented to help articulate and analyze the trade- 
offs between costs and water quality. The complexities of the real-world probiem are discussed and 
care is taken in noting simplifying assumptions in the models. Includes exercises with solutions, 


as well as suggestions for further research. LCL 


Optimization, P. Bonn Workshop on Combinatorial Optimization. Ed: Achim Bachem, Martin Gr&tschel, 
Bernard Korte. Math. Stud., No. 66. Elsevier North-Holland, 1982, ix + 312 pp, $55.75 (P). [ISBN: 
0-444-86366-4] Eighteen papers covering a broad spectrum of optimization problems which arise in 
graph theory (matching, independence, paths). LCL 


Probability, S8*(15-17), L*. Probability, Statistics and Truth, Second Revised English Edition. 
Richard von Mises. Dover Pub, 1981, xii + 244 pp, $4 (P). [ISBN: 0-486-24214-5] Reprint of the 1957 
English edition of this classic, which was a translation of the definitive 1951 German Third Edi- 
tion. A pioneer effort in the foundations of probability theory. RSK 


Probability, T(16-17: 1, 2), S, P, L. Probability and Random Processes. Geoffrey Grimmett, David 
Stirzaker. Clarendon Pr, 1982, ix + 354 pp, $45; $19.95 (P). [ISBN: 0-19-853184-2; 0-19-853185-0] A 
rigorous introduction to probability theory with minimal reference to measure theory, followed by 
individual chapters on random processes: Markov chains, stationarity and diffusion, renewals, 
queves. A nice balance between formal and informal, theory and application. LCL 


Probability, T(18: 1), S, P*. Brownian Motion and Diffusion. David Freedman. Springer-Verlag, 
1983, xii + 231 pp, $24. [ISBN: 0-387-90805-6] Originally published by Holden-Day (1971) (TR, 
October 1971). Brownian motion as a probability on a Borel o@-field. Diffusion is obtained from 
Brownian motion by changing time. Emphasizes variation, reflection and invariance principles. 
Includes examples, heuristic arguments, motivation as well as formal proofs. SG 
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Statistics, T(15-17: 1, 2), S. Introductory Engineering Statistics, Third Edition. Irwin Guttman, 
S.S. Wilks, J. Stuart Hunter. Wiley, 1982, xi + 580 pp, $33.95. [ISBN: 0-471-07859-xX] This edition 
includes new material on reliability and contrasts, revisions of other topics, and many new problems 
and data sets (Second Edition, TR, April 1972). FLW 


Statistics, S(17-18), P. Theory of Statistical Experiments. H. Heyer. Ser. in Stat. Springer- 
Verlag, 1982, x + 289 pp, $19.80. [ISBN: 0-387-90785-8] A functional analytic treatment of statisti- 
cal decision theory. FLW 


Statistics, P. Higher Order Asymptotics for Simple Linear Rank Statistics. R.J.M.M. Does. Math. 
Centre Tracts, No. 151. Math Centrum, 1982, 91 pp, Dfl. 11,55 (P). [ISBN: 90-6196-243-9] Technical 
monograph dealing with the problem of obtaining Berry~Esseen bounds and Edgeworth expansions for 
Simple linear rank statistics. RSK 


Statistics, P. Lecture Notes in Statistics-l13: Contributions to a General Asymptotic Statistical 
Theory. J. Pfanzagl, W. Wefelmeyer. Springer-Verlag, 1982, vii + 315 pp, $16.80 (P). [ISBN: 0~-387- 
90776-9] Presents the beginnings of a unified asymptotic theory, covering both parametric and non- 
parametric models, but restricted to normal approximations. Good set of references. RSK 


Statistics, P. Time Series Analysis: Theory and Practice 1. Ed: 0O.D. Anderson. Elsevier North- 
Holland, 1982, ix + 756 pp, $93. [ISBN: 0-444-86337-0] Proceedings of the International Congress 
held at Valencia, Spain, June 1981. Contains 48 papers, in addition to an introduction by the edi- 
tor. RSK 


Statistics, P. Kernel Discriminant Analysis. D.J. Hand. Electronic & Electrical Engin. Res. 
Stud., V. 2. Research Studies Pr (US Distr: Wiley), 1982, x + 253 pp, $37.95. [ISBN: 0-471-10211-3] 
Provides a general nonparametric method for attacking problems of "classification" (in the areas of 
discriminant analysis and pattern recognition rather than cluster analysis). Includes a good set of 
references to this relatively new technique. RSK 


Statistics, T*(14: 1, 2). Modern Business Statistics. Ronald L. Iman, W.J. Conover. Ser. in Prob. 
& Math. Stat. Wiley, 1983, xxxiv + 777 pp, $27.95 [ISBN: 0-471-09668-7]; Introduction to Modern 
Business Statistics. W.J. Conover, Ronald L. Iman. Ser. in Prob. & Math. Stat. Wiley, 1983, xxxi 
+ 525 pp, $25.95. LISBN: 0-471-09669-5] Introduction is the first thirteen chapters of Modern Busi- 
ness Statistics. Both contain contingency tables, correlation, regression, and time series 
analysis, in addition to other standard topics. Modern Business Statistics also contains multiple 
regression, analysis of variance and covariance, and decision theory. Novel features include the 
emphasis on assumptions, tests of these assumptions (particularly normality), and parallel presenta- 
tions of nonparametric alternatives. RSK 


Statistics, T(13-14: 1). Elementary Statistics, Second Edition. Mario F. Triola. 
Benjamin/Cummings Pub, 1983, xvi + 496 pp, $21.95. [ISBN: 0-8053~-9320-X] Presupposes only high 
school algebra. Covers the usual topics, including some nonparametric methods. Many examples use 
real data. Optional computer projects included. FLW 


Statistics, $(15-17), P. Applied Statistics: A Handbook of Techniques. Lothar Sachs. Transl: 
Zenon Reynarowych. Series in Stat. Springer-Verlag, 1982, xxviii + 706 pp, $48. [ISBN: 0-387- 
90558-8] Translation and expansion of the 1978 German Fifth Edition. Written for non-mathematicians, 
it has as its main concern practical applications. Particular emphasis is given to small sample 
problems and distribution free methods. Extensive bibliography. RSK 


Statistics, P. A Festschrift for Erich L. Lehmann: In Honor of His Sixty-Fifth Birthday. Ed: Peter 
J. Bickel, Kjell A. Doksum, J.L. Hodges, Jr. Statistics/Probability Ser. Wadsworth Pub, 1983, viii 
+ 461 pp, $39.95. [ISBN: 0-534-98044-9] Twenty-eight statistical papers written by colleagues, stu- 


dents, and friends of Erich Lehmann. Concludes with a bibliography of Lehmann’s works. RSK 


Statistics, T*(15-17: 1, 2), L. Applied Linear Regression Models. John Neter, William Wasserman, 
Michael H. Kutner. Richard D. Irwin, 1983, xv + 547 pp, $27.50. [ISBN: 0-256-02547-9] Revision of 
the regression portion of Neter and Wasserman’s 1974 text Applied Linear Statistical Models (TR, 
December 1974). New topics include detection of multi-collinearity, ridge regression, detection of 
influential observations, and nonlinear regression. Examples from the health and biological sci- 
ences have been added, and problem sets have been substantially expanded. RSK 


Statistics, S(17-18), P. Grundlagen der sequentiellen Statistik. Hartmut Heckendorff. Teubner- 
Texte, Band 45. BG Teubner, 1982, 166 pp, $7.50 (P). A sophisticated introduction to the theory of 
sequential statistical experiments. Contains a good deal on Bayesian decision theory. JD-B 


Statistics, P. Pao-Lu Hsu Collected Papers. Ed: Kai Lai Chung. Springer-Verlag, 1983, xii + 589 
pp, $48. [ISBN: 0-387-90725-4] Contains reprints (in English) of all of Hsu’s published mathematical 
papers. Also includes biographical information and short discussions of Hsu’s work in inference, 
multivariate analysis, and probability. RSK 


Statistics, T(17: 1), P. Multivariate Statistical Methods, An Introduction. Marvin J. Karson. 
Iowa St U Pr, 1982, x + 307 pp, $24.95. [ISBN: 0-8138-1845-1] Assumes background in multivariate 
calculus, matrix algebra, and mathematical statistics, which are briefly reviewed. Includes 


chapters on correlations, discriminant analysis and allocation, principal components analysis, 
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factor analysis, cluster analysis, and discrete multivariate analysis, in addition to the wsual 
introductory material. Contains many numerical examples. RSK 


Statistics, T(14-18: 1), S, P. APL-STAT: A Do-It-Yourself Guide to Computational Statistics Using 
APL. James 8B. Ramsey, Gerald L. Musgrave. Lifetime Learning Pub, 1981, xi + 340 pp, $17.95 (P). 
[ISBN: 0-534-97985-8] Explains how to use APL to carry out statistical calculations. Presupposes 
knowledge of statistics. Not an introduction to APL but the authors say many have learned APL from 
it. FLW 


Statistics, S(15-18)), P, L. The Human Side of Statistical Consulting. James R. Boen, Douglas A. 
Zahn. Lifetime Learning Pub, 1982, xii + 196 pp, $15.95. [ISBN: 0-534-97949-1] This helpful and 
interesting book deals with the "human, behavioral, and political issues that a consultant confronts 


in organizations and in dealing with people." FLW 


Statistics, T(15-18: 1, 2), S, P, L. Statistical Methods for Survival Data Analysis. Elisa T. Lee. 
Lifetime Learning Pub, 1980, xiii + 557 pp, $31.95. [ISBN: 0-534-97987~4] Parametric and non- 
parametric methods for survival data analysis. Many examples and exercises use real medical data. 


46 pages of Fortran program listings. 65 pages of tables. Extensive bibliography. FLW 


Computer Literacy, T?(12), L? Armchair BASIC: An Absolute Beginner’s Guide to Programming in BASIC. 
Annie and David Fox. Osborne/McGraw-Hill, 1983, vii + 264 pp, $11.95 (P). [ISBN: 0-931988-92-6] 
Intended to explain popular computer terminology and Basic to "the casually curious as well as_ the 
utterly baffled," without the need for a computer. No mention of program development, program 
structure; short giveaway quizzes, no exercises; few program examples exceed 10 lines. Glossary. 
RB 


Computer Literacy, S, L. Apple II User’s Guide. Lon Poole, Martin McNiff£f, Steven Cook. 
Osborne/McGraw-Hill, 1981, xii + 385 pp (P). [ISBN: 0-931988-46-2] For Apple II and Apple LII+, con- 
cerning everything from locating the "on" switch to discussions of graphics, machine language moni- 
tor program, etc. Tutorials introduce Basic from a hardware viewpoint--not suitable for most intro- 
ductory courses. A complete description of all Basic commands is included. An unusually accessible 
technical reference. RB 


Computer Literacy, T*(13: 1). Introduction to Computers and Data Processing. Gary B. Shelly, Tho- 
mas J. Cashman. Anaheim Pub, 1980, (P). xvi + 458 pp [ISBN: 0-88236-115-5]; Student Workbook and 
Study Guide, 247 pp [ISBN: 0-88236-116-3]; Answer Manual for Student Workbook and Study Guide, 191 
pp; Transparency Masters, 187 pp; Test Bank, 188 pp; Instructor’s Guide and Answer Manual, 173 pp. 
Well organized with many color photographs. An introduction to data processing that emphasizes 


state-of-the-art technology and techniques. AO 


Computer Programming, S(13~-18). Programming in Modula-2. Niklaus Wirth. Springer-Verlag, 1982, 176 
pp, $13.50. [ISBN: 0-387-11674-5] Intended for someone who already knows how to program; there is, 
however, an introduction for the beginner. Modula-2 is a direct descendant of Pascal and Modula. 
The notation of the language itself encourages the principles of structured programming. Language 
includes all aspects of Pascal plus the module and multiprogramming concepts. Text includes’ the 
Report, two appendices, and an index. RJA 


Computer Programming, S77. Your IBM Personal Computer: Use, Applications, and BASIC. David E. Cor- 
tesi. Holt, Rinehart & Winston, 1982, xiv + 253 pp, $16.95 (P). [ISBN: 0-03-061979-3] An appalling 
book at the junior-high-school level. The style is condescending and the content excessively 
fluffy--even for beginners. For example, "The prefix kilo, which usually means 1,000, here stands 
for a multiple of 1,024. There is a practical but uninteresting reason for this odd usage,..." No 
explanation whatsoever is provided-~the issue is dropped. JAS 


Computer Programming, T*(13: 1). Structured Problem Solving with Pascal. Lawrence J. Mazlack. 
Holt, Rinehart & Winston, 1983, xiii + 386 pp, $21.95 (P). [ISBN: 0~03~060153-3] An introduction to 
top-down analysis and structured programming using Pascal. This book provides a relatively complete 
introduction to the Pascal language. AQ 


Computer Programming, T(13~14). Introduction to Programming with ESP and Pascal. Allen B. Tucker, 
Jr. Holt, Rinehart & Winston, 1983, xvi + 363 pp, $18.95 (P). [ISBN: 0-03-059148-1] ESP = Eight 
Statement Pascal, the subset used first in the text. Overlaps other books of the. author’s on Pascal 


and Basic. RWN 


Computer Programming, T(13: 1). Programming the IBM Personal Computer: BASIC. Neill Graham. Holt, 
Rinehart & Winston, 1982, x + 287 pp, $16.95 (P). [ISBN: 0-03-061911-4] Integrates introductory 
Basic programming with the features of the IBM personal computer. RWN 


Computer Programming, T**(13-14: 1), $, L. Problem Solving with BASIC. Richard Diliman. Holt, 
Rinehart & Winston, 1983, x + 326 pp, $18.95 (P). [ISBN: 0~03-061981-5] An introductory text, usable 
with micros, with solid emphasis on top-down design, standard programming structures, maintainable 
programs, learning to read manuals, and the value of these principles in commercial programming. 
Effective application of “analysis and design as a prerequisite for coding" from the beginning. 
Relatively limited instruction set. A well thought out, convincing book. RB 
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Computer Programming, $(13-15). Programming the PL/I Way. Dan Smedley. TAB Books, 1982, x + 288 
pp, $9.95 (P). [ISBN: 0-8306-1414-1] An introduction to the programming language PL/I. The text 


assumes no prior knowledge of programming and begins with the basic aspects of the language. It 
concentrates almost exclusively on the syntactic/semantic aspects of the language and does not 
branch out into broader issues of programming in general. It would be appropriate for those 


interested specifically in learning the PL/I language. MS 


Computer Programming, P. TRS-80 Color Programs. Tom Rugg, Phil Feldman. TAB Books, 1982, 328 pp, 
$25.95. [ISBN: 0-8306-1481-8] This book is simply a collection of 36 Basic programs which can be 
executed on the TRS-80 color computer manufactured by Radio Shack. The programs are in the area of 
games, mathematics, graphics, home use, and educational use. Each program is accompanied by exten- 
sive documentation about how to run it and interpret the output. MS 


Computer Programming, T?7(13). Programming the IBM Personal Computer: UCSD Pascal. Seymour V. Pol- 
lack. Holt, Rinehart & Winston, 1983, xi + 323 pp, $18.95 (P). [ISBN: 0-03-062637-4] Suitable for 
quick retooling from Fortran, Basic, etc., into Pascal syntax, but not necessarily into the spirit 
of the language. Type declarations are discussed early, procedures and functions are postponed. 
Hardware-inspired understanding of procedure and function action. Not strong on algorithms, style; 
cryptic identifiers. RB 


Computer Programming. The PASCAL Handbook. Jacques Tiberghien. Sybex, 1981, xix + 485 pp, $19.95 
(P). [ISBN: 0-89588-053-9] Each keyword, symbol, standard identifier, identifier in at least one of 
Standard, HP1000, CDC, OMSI, Pascal/Z, and UCSD Pascals, is discussed in alphabetical order. Each 
entry includes syntax, description, implementation dependent features, and an example. Most are not 
worth the space, but others might be helpful to programmers writing software for different machines 
or who are interested in knowing the various extensions for the listed compilers. RBK 


Computer Programming. Introduction to the UCSD p-System. Charles W. Grant, Jon Butah. Sybex, 
1982, xiv + 300 pp, $14.95 (P). L[ISBN: 0-899588-061-X] A useful, well-written, well-organized refer- 
ence for a p-system user. Explicit examples are given for the most common tasks in editing, filing, 
and compiling. Includes chapters on preparing short and large-scale Pascal programs. Examples from 


Version II.0, but information is valid for versions through IV. RBK 


Computer Programming, T(13), S, L. Professional Programming Techniques--Starting with the BASICs. 


——— ns 


Richard Galbraith. TAB Books, 1982, vi + 301 pp, $10.95 (P). [ISBN: 0-8306-0128-7] A readable 
introduction to programming in Basic with emphasis on structure, style, and documentation. JG 


Computer Programming, $77(13). FORTRAN Programs for Scientists and Engineers. Alan R. Miller. 
Sybex, 1982, xvii + 280 pp, $15.95 (P). [ISBN: 0-89588-082-2] This book contains a collection of 
Fortran subroutines suitable for use on a microcomputer which implement simple numerical algorithms 
for such problems as random number generation, the solution of systems of linear equations, least 
squares curve fitting, and numerical integration. The numerical algorithms presented seem to _ have 
been chosen for their simplicity rather than their accuracy. AO 


Data Structures, T*(14-7: 1, 2), S. Design of Computer Data Files. Owen Hanson. Computer Software 
Engin. Ser. Computer Sci Pr, 1982, x + 358 pp, $24.95. [ISBN: 0-914894-17-X] Provides single source 
of information on factors influencing file design. Topics include backing storage devices, record 
formats, blocking and buffering, sequential and direct file organization, multiple-key processing. 
Chapter references and questions. Appendices. Index. RJA 


Software Systems, T(15-17: 1), S, P?, L?. Introduction to Arithmetic for Digital Systems Designers. 
Shlomo Waser, Michael J. Flynn. Holt, Rinehart & Winston, 1982, xvii + 308 pp, $35.95. [ISBN: 0- 
03-060571-7] A poorly written assemblage of basic facts and state-of-the-art algorithms for doing 
machine arithmetic (fixed and floating point), stressing “information of practical implementation 
value." Abounds with redundancies, non sequiturs, sentence fragments and just plain awkwardness. 
Appendix contains specifications on many currently available MSI and LSI arithmetic chips. GHM 


Computer Science, T(15-16: 1), P. Computer Network Architectures. Anton Meijer, Paul Peeters. 
Computer Sci Pr, 1982, xi + 396 pp, $27.95. [ISBN: 0-914894-41-2] This text reviews the design and 
implementation of distributed computer systems. It uses the ISO Open Svstem Model as its frame of 
reference and discusses the various levels of protocol in that model. Part I discusses protocols in 
general, while parts II and III look at specific examples of architectures, both private (e.g., DEC- 
NET and SNA) and public (e.g., X.25). The book is intended for professionals in the field of com- 
puter networks and distributed systems, especially those involved with design and performance 
evaluation of alternative protocols. MS 


Computer Science, T?(13). Using Computers in Mathematics. Gerald H. Elgarten, Alfred S. Posamen- 
tier, Stephen E. Moresh. Addison-Wesley, 1983, 574 pp, $21.44. [ISBN: 0-201-10450-4] Elementary 
Basic through application to various topics in secondary mathematics. Sample programs are followed 
by assignments to produce different output, yet changes required are usually as trivial as altering 
the wording of a question. Mathematics is reduced to formulas, with no attempt to create under- 
standing. Programming is old-fashioned, with emphasis on flowcharts rather than structure. MW 


Computer Science, T(16-18), S, P*, L*. Computer Algebra: Symbolic and Algebraic Computation. Ed: 
B. Buchberger, et al. Computing Supplementum, No. 4. Springer-Verlag, 1982, vii + 283 pp, $44 
(P). [ISBN: 0-387-81684-4] This book of sixteen survey articles (with extensive references) is the 
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first attempt at drawing together the disparate results on efficiency and implementation of alge- 
braic algorithms scattered throughout the literature. The articles are arranged in a "top-down" 
Manner: starting with high-level problems whose motivation is immediately apparent (e.g., algebraic 
simplification), it proceeds to more basic problems whose solution is needed for the problems at the 
higher levels (e.g., factorization of polynomials, computing with groups, computing in extension 
fields, computing by homomorphic images). The material in this volume can be used to supplement a 
traditional algebra course or as a basic reference for a course in computer algebra. LCL 


Computer Science, T(14: 1). The Minicomputer in the Laboratory: With Examples Using the PDP-ll, 
Second Edition. James W. Cooper. Wiley, 1983, xviii + 381 pp, $29. [ISBN: 0-471-09012-3] An intro- 
ductory textbook in computer organization, using the PDP-1l family for all of its examples. It 
first introduces the PDP-1l computer system in general, and then concentrates on specific laboratory 
Projects which can utilize the capabilities of this machine. Some of the sample laboratory projects 
would include plotting lab data, averaging signals, performing Fourier transforms, and detecting 
Signal peaks. MS 


Applications, P. Computerised Braille Production: Today and Tomorrow. Ed: D.W. Croisdale, H. Kamp, 
H. Werner. Springer-Verlag, 1983, xii + 422 pp, $17.20 (P). [ISBN: 0-387-12057-2] Proceedings of a 
conference held in London, May 30 to June 1, 1979. JAS 


Applications (Artificial Intelligence), $(16-18), P, L. The Handbook of Artificial Intelligence, 
Volume Il. Ed: Avron Barr, Edward A. Feigenbaum. William Kaufmann, 1982, xiii + 428 pp, $35. [ISBN: 
0-86576-006-3] This volume covers three subfields of artificial intelligence: (i) artificial intel- 
ligence programming languages; (ii) expert systems in science, medicine, and education; (iii) 
automatic programming. List of contributors. Each chapter contains several short articles. 
Chapter overviews. Extensive volume bibliography. Name and subject indexes for Volume II. RJA 


Applications (Biology), P. Lecture Notes in Biomathematics-47: Stochastic Transport Processes in 
Discrete Biological Systems. Eckart Frehland. Springer-Verlag, 1982, viii + 169 pp, $11 (P). 
[ISBN: 0-387-11964-7] "The subject of this volume is not to introduce the mathematics of stochastic 
processes but to present a field of theoretical biophysics [ion transport through biological mem- 


branes] in which stochastic methods are important." RB 


Applications (Computer Organization), S(13-18), P. System Architecture. John Zarrella. Micropro- 
cessor Software Engin. Concepts Ser. Microcomputer Applic, 1980, vii + 231 pp, $10.95 (P). [ISBN: 
0~935230-02-5] Intended as an introduction to advanced computer design. Integrates ideas from 
software engineering and computer architecture. Topics include data representations, elements, and 
structures; addressing and hardware communication; parallel processing, resource protection, 1/0, 
microprogramming, error detection. Important terms are in boldface on the page where first used. 
Good diagrams. Glossary. References. Index. RJA 


Applications (Engineering), T(16: 2), P. Signals and Systems. Alan V. Oppenheim, Alan S. Willsky, 
Ian T. Young. Prentice-Hall, 1983, xix + 796 pp, $32.95. [ISBN: 0-13~809731-3] Designed as an 
advanced undergraduate textbook in the areas of signals and electrical systems. Covers the topics 
of continuous time signals, discrete time signals, filtering, modulation, and both Fourier and 
Laplace transformations. It would be appropriate for a required one or two semester course for 
majors in the field of electrical engineering. Assumes a thorough grounding in calculus. MS 


Applications (Engineering), P. Tutorials in Modern Communications. Ed: Victor B. Lawrence, Joseph 
L. LoCicero, Laurence B. Milstein. Computer Sci Pr, 1983, viii + 348 pp, $33.95. [ISBN: 0-914894- 
48-X] A collection of thirty-seven tutorial articles originally published in the IEEE Communications 
Society Magazine in the following areas: quantization and switching, data and modulation tech- 
niques, computer communications, transmission, signal processing, and secure communications. AO 


Applications (Engineering), P. Annual Review of Fluid Mechanics, Volume 15, 1983. Ed: Milton van 
Dyke, J.V. Wehausen, John L. Lumley. Annual Reviews, 1983, 534 pp, $28. [ISBN: 0-8243-0715-1] 18 
expositions of current research and mathematical models of fluid dynamics, ranging from green plants 
to airfoils. Begins with a survey of Ernst Mach’s contributions to fluid mechanics, including 
numerous plates of experimental artifacts from Mach’s research. LAS 


Applications (Meteorology), P. Dynamic Meteorology: Data Assimilation Methods. Ed: Lennart Bengts- 
son, Michael Ghil, Erland K¥llén. Appl. Math. Sci., V. 36. Springer-Verlag, 1981, 330 pp, $18 (P). 
[ISBN: 0-387-90632-0] Selected lectures from the 1980 Seminar of the European Centre for Medium 
Range Weather Forecasts. Highly applied in nature. AWR 


Applications (Modelling), S(17-18), P. Lecture Notes in Biomathematics-46: Identifiability of State 
Space Models. Eric Walter. Springer-Verlag, 1982, viii + 202 pp, $13.50 (P). [ISBN: 0-387-11590-0] 
A monograph devoted to tests of identifiability as a necessary step in the modeling process, with 
special attention given to transformation systems (a generalization of compartmental models). LCL 


Applications (Pattern Recognition), T(17-18: 1), P. Fuzzy Techniques in Pattern Recognition. Abra- 
ham Kandel. Wiley, 1982, x + 356 pp, $34.95. [ISBN: 0-471-09136-7] Exposition of an approach to the 
problem of pattern classification based on the theory of fuzzy relations, fuzzy expectation, inexact 
Matrices, et cetera. Comprehensive bibliography (3064 entries). GHM 
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Applications (Physics), P*. Quantum Mechanics of Large Systems. Walter Thirring. Course in Math. 
Physics, No. 4. Transl: Evans M. Harrell. Springer-Verlag, 1983, x + 290 pp, $32. [ISBN: 0-387- 
81701-8] A rigorous exposition of quantum statistical mechanics using modern techniques (e.g., C*- 
algebras, etc.) which summarizes much of the work in this area done in the last twenty years. A 
valuable addition to the mathematical physics literature. AO 


Applications (Physics), P. Ray Methods for Waves in Elastic Solids: With Applications to Scattering 
by Cracks. J.D. Achenbach, A.K. Gautesen, H. McMaken. Mono. & Stud. in Math., No. 14. Pitman Pub, 
1982, xi + 251 pp, $49.95. [ISBN: 0-273-08453-4] Basic ideas and results of ray theory within the 
context of the theory of elasticity. Includes the necessary methods of applied mathematics, 


together with a complete discussion of the canonical problems. LCL 


Applications (Physics), T(16-17: 1). Introduction to Mathematical Fluid Dynamics. Richard E. 
Meyer. Dover Pub, 1982, vi + 185 pp, $4.50 (P). [ISBN:0-486-61554-5] Reprint of 1971 text (TR, May 
1972). Presupposes calculus and vector analysis; with problems and appendices included after most 
sections. RB 


Applications (Physics), P. Percolation Theory for Mathematicians. Harry Kesten. Progress in Prob. 
& Stat., V. 2. Birkhauser Boston, 1982, 423 pp, $30. [ISBN: 3-7643-3107-0] A unified exposition of 
most of the known rigorous results concerning Bernoulli percolation in undirected graphs in two 
dimensions. Some new results are presented as well. AO 
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An asterisk (*) by the title of a paper indicates that copies of the paper are available from 
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parentheses. 


Southern California Section 


The Fall meeting of the Southern California Section was a joint meeting with SIAM held at Pep- 
perdine University, Malibu, California on November 13, 1982. There were about 120 people in atten- 
dance. 


Luncheon Talk: 


"Information and the Uniqueness of Man," by Raymond Redheffer, University of California, Los 
Angeles. 


Invited Addresses: 


“Computer Memories, Hamming Codes, and the Birthday Surprise," by Robert Mc Eliece, Caltech. 

“Applications of Functional Analysis and Complex Variables in Electrical Engineering," by William 
Helton, University of California, San Diego. 

“Triptych of Singular Perturbation Problems," by Charles Lange, University of California, Los 
Angeles. 

"Parameter Identifiability and Identification," by Stavros Busenberg, Harvey Mudd College. 


Panel Discussion: 

"Undergraduate Mathematics in Non~academic Employment." Roy Danchick, Abacus Programming (Modera- 
tor); Joe Wertz, Aerospace Corporation; Roger Bourke, Jet Propulsion Laboratory; Edward Posner, 
Caltech; Marvin Blum, Hughes Aircraft. 

Louisiana-Mississippi Section 


The Louisiana-Mississippi Section held its annual meeting at Delta State University, Cleveland, 
Mississippi on February 18-19, 1983. With the reduction in travel funds in both states, there were 
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100 registered faculty attendees and 17 student registrants. The meeting was the 60th joint meeting 
with the Louisiana~Mississippi branch of the NCTM. 


Invited Addresses: 


"Pressures for Curriculum Modification," by R.D. Anderson, Past President, MAA. 
“Geometrical Proofs of Some Theorems About Differentiable Functions," by James R. Dorrah, Lovisi- 
ana State University. 


Student Papers: 


"Force and Velocity in the English Frog Sartorius Muscle," by Bob Philpot, Belhaven College. 

"Rose’s Rose," by John A. Bailey, Millsaps College. 

"frapezoidial Properties," by Scott Bowie, Millsaps College. 

"Sequential Verbalization for the Inverse of a One-to-One Function,” by Laurel C. Eskridge, 
Millsaps College. 

"Fibonacci Numbers and Patterns of Independent Sets in Cycles and Paths," by Mary Harrington, 
University of Mississippi. 


Panel Discussions: 


"Entrance Requirements for Public Universities," by A.J. Hulin, University of New Orleans and John 
L. Tilley, Mississippi State University. 

"Mathematics Placement of Freshmen Students," by Nancy Dunn (Moderator), University of New Orle- 
ans es . 

“Junior College Transfer Students--Follow-Up," by Ken Johnston (Moderator), Hinds Junior College. 


Contributed Papers: 


"The Space of Entire Functions Revisited,” by L. Swetharanyam, McNeese State University. 
* "A Generalization of Euler’s Phi-Function,"™ by $. Leigh, University of Southwestern Louisiana. 
* "Fixed Point Theorems for Certain Classes of Multi~Valued Mappings,” by H. Kaneko, Mississippi 
State University. 
“Fibonacci Numbers and Graphs," by W. Staton, University of Mississippi. 
* "A Concrete Experiences Approach to Concept Formation in Developmental Arithmetic,” by R. Yellott, 
McNeese State University. 
"Microcomputers in the Junior High School Classroom,” by R. Grantham, University of Mississippi. 
"Microcomputer Aids in Teaching Trigonometry," by D. Cook, University of Mississippi. 
* "Z-Closed Spaces," by T. Thompson, University of Southwestern Louisiana. 
* "Semi-T)-Identification Spaces and S-Essentially Tp Spaces," by C. Dorsett, Louisiana Tech Univer- 
sity. 
* "Orderly Neighborhoods in a Disorderly Row," by M. Maxfield and J. Maxfield, Louisiana Tech 
University. 
"A Discrimination Procedure for Bivariate Probability Models," by B. Asrabadi, Nicholls State 
University. 


Student papers by John A. Bailey and Mary Harrington were awarded cash prizes of $50. 


Southern California Section 


The spring meeting of the Southern California Section was held on March 5, 1983 at the Univer- 
sity of California, San Diego. There were approximately 200 people in attendance. 


Invited Lectures: 


"Topology and Human Greed," by Michael Starbird, University of Texas, Austin. 

"Coding Theory and Finite Geometrics," by Jacobus van Lint, Tech. University Eindhoven visiting 
Caltech. 

"Can Probability be Applied to Mathematics," by Richard P. Feynmann, Caltech, Nobel Laureate. 

"Synchronization for Deep Space Codes," by Laif Swanson, Jet Propulsion Laboratory, Caltech. 


Report: 


"Diagnostic Testing in Mathematics," by Philip P. Curtis, Jr., University of California, Los 
Angeles. 


Luncheon Talk: 


"Reminiscences in the History of Mathematics," by Ernst Straus, University of California, Los 
Angeles. 


If any one man discovered differential topology, he did. See p. 397. 
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Lemma | we get 
g-f= (a-Si): 


n= 1 


Leth,=g—f—-2v7_1(8, —f,); then, 10 so A(h,) > 0, ie., 


v([f,8))= DX (Lf 8n))- 


n=1 
As in the case of “ordinary” intervals, the proof that © is a semiring is an easy exercise. 


Proof of the theorem. For v as defined in Lemma 3, choose p according to Lemma 2. Let 
O<feéF. Forf, = (n[f—fA 1]) A 1 we have 


(A) [0, Bf, )t{f> 1) x [0,B) for every B > 0. 
Hence, 

p(A) = p(A X [0,1)), A € Be 
is a well-defined measure with the property 


o({f> 1} x [0,8)) = Buf > 1), fe F,B > 0. 
Let B” = 0"2,(C" — C",,) X [0,i27”) with C” = (i7'2"f > 1) for 1 < i < n2" and C! = @ for 
i= n2" + 1. Then, B” T[0, f) and we get 


A(f) = e([0, f)) = lim p(B") = ff dp. 
REMARKS. 


(a) It is well known that one cannot dispense with “Stone’s condition”: f © F implies 
f A 1 © F. (Compare [1], 7XB for a counterexample.) However, any other condition which 
ensures for every f © F the existence of a sequence f, € F with property (A) is also 
appropriate. 

(b) Of course, the above theorem implies the Riesz representation theorem ([{3], 56D). This is a 
well-known and immediate consequence of Dini’s theorem. Another short proof for this 
special case has been given by Garling [2] (or [4], §22). Garling’s method is closely related 
to Varadarajan’s [5]. 


References 


F 
. J. H. Garling, A “short” proof of the Riesz representation theorem, Proc. Cambridge Philos. Soc., 73 
59-460. 
3. P. R. Halmos, Measure Theory, Van Nostrand, Princeton, 1950. 
4. R. B. Holmes, Geometric Functional Analysis and its Applications, Springer, New York-Heidelberg-Berlin, 
5 


D. H. Fremlin, Topological Riesz Spaces and Measure Theory, Cambridge University Press, 1974. 
J 
4 


5. V. S. Varadarajan, On a theorem of F. Riesz concerning the form of linear functionals, Fund. Math., XLVI 
(1958) 209-220. 


ANSWER TO PHOTO ON PAGE 388 


Hassler Whitney, one of the recipients of the 1982 Wolf Foundation Prize in Mathematics. 


UNSOLVED PROBLEMS 
EDITED BY RICHARD GUY 


In this department the MONTHLY presents easily stated unsolved problems dealing with notions ordinarily 
encountered in undergraduate mathematics. Each problem should be accompanied by relevant references (if any 
are known to the author) and by a brief description of known partial results. Manuscripts should be sent to 
Richard Guy, Department of Mathematics and Statistics, The University of Calgary, Calgary, Alberta, Canada 
T2N IN4. 


A CONJECTURE RELATED TO SYLVESTER’S PROBLEM 


PETER BORWEIN AND MICHAEL EDELSTEIN 
Department of Mathematics, Statistics and Computing Science, Dalhousie University, Halifax, Nova Scotia, 
Canada 


If a finite set of points spans the plane, then there exists a line through exactly two of the 
points. Posed as a problem by J. J. Sylvester in 1893, this was not solved until the 1930’s when 
various people, including P. Erdés and T. Gallai, revived interest in it. Motzkin, one of a number 
of mathematicians to contribute to the now considerable list of refinements and variations on 
Sylvester’s Problem, considered higher dimensional analogues. A substantial contribution to this 
higher dimensional problem was provided by Hansen in 1965. (See [5].) The rich literature on this 
problem may be traced through the references in [4], [6] and [7]. 

Of a distinctly similar flavor is another result of Motzkin. Given two finite and disjoint sets of 
points whose union spans the plane, then there exists a line through at least two points of one of 
the sets that does not intersect the other. Such a line is called monochrome. Essentially different 
proofs of this may be found in [4] and [9]. A beguilingly simple related question asks whether two 
finite and disjoint sets whose union spans R* must generate a monochrome plane (a plane through 
at least three noncollinear points of one of the sets that does not intersect the other set). This 
result is false in R? and may be easily proved in R® using the results in [5]. Another higher 
dimensional variation on Motzkin’s theme is offered by Shannon [9]: if n finite and disjoint sets 
span projective n-space P”, then there exists a line through at least two points of one of the sets 
that misses all the other sets. We propose the following conjecture that, if true, would provide an 
umbrella for the preceding results. 


CONJECTURE. If A and B are two finite and disjoint sets whose union spans P"*™, then either 
there exists an A-monochrome n-flat or there exists a B-monochrome m-flat. 

By an A-monochrome n-flat we mean an affine variety of dimension n spanned by points of A that 
contains no points of B. 


In [3] this conjecture is proved for n= 1. This, by induction, implies Shannon’s result 
concerning 7 sets in n dimensions. There are various other immediate consequences that would 
follow from the conjecture. For instance, sets in 2n dimensions would guarantee the existence of 
a monochrome plane. 

In [2] it is shown that any finite set E that spans P*‘"*™) generates an (n + m)-flat spanned by 
and containing exactly n + m+ 1 points of E. Thus, the conclusion of the above conjecture is 
true if we replace P"*™ by P*\"*™), (As is often the case in this area, finding some dimension in 
which the problem is tractable is much simpler than determining the “correct” dimension.) If we 
consider n + m points in general position in P”*”~' and let A be composed of n of these points 
and let B be composed of the remaining m points, we see that we cannot hope to replace P”*” by 
p"*™~! without violating the conjecture. 

An apparently related question concerns compact sets in R"”*” which are also countable. More 
precisely, if we assume that A and B are countable and compact rather than just finite, does the 
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conjecture still hold? If not, what dimension, if any, suffices to maintain the conclusion? 

As far as we know, this question is only resolved in the case of two sets in the plane. However, 
Tingley [10] shows that two compact disjoint sets in R? generate a monochrome line. Perhaps by 
stepping up a dimension the condition that the sets be countable can in general be dropped. 

A curious feature of a number of results related to the conjecture is that they seem to be more 
easily proved in dual formulation. The latter questions concerning compact sets do not dualize as 
naturally. While the easiest arguments for treating Motzkin and Sylvester-like problems for finite 
sets are geometrical, the proofs of results for infinite sets are topological in nature. A perhaps 
surprising feature of many of these problems is that combinatorial methods do not prove 
particularly useful. 
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MISCELLANEA 


106. Metagame (so baptized by Martin Gardner) is an ingenious new twist on the Russell 
paradox. (It was called “hypergame” by its discoverer, William S. Zwicker.) Here is how it goes. 

A two-person game (such as tic-tac-toe or chess) is finite if it always ends in a finite number of 
moves. The first move of metagame is to pick a finite game. If, for example, you and I are playing 
metagame and I have the first move, I can say “Let’s play chess.” Then you make the first move 
in a chess game, and we continue playing till that game ends. 

Is metagame finite? If it is, then, as my first move in a metagame, I can say “Let’s play 
metagame.”’ It is now your turn, and, as your first move in the metagame that I chose to play, you 
can say “Let’s play metagame.” The process can obviously go on indefinitely, contrary to the 
assumption that metagame is finite. The contradiction forces the conclusion that metagame is not 
finite. It follows that I cannot choose metagame as my first move in metagame; I must choose a 
finite game. Consequence: the game must end in a finite number of moves, and that contradicts 
the proved statement that metagame is not finite. 


—Paraphrased, by permission, from a 
forthcoming book by Raymond Smullyan. 
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NOTES 
EDITED BY SABRA §S. ANDERSON, SHELDON AXLER, AND J. ARTHUR SEEBACH, JR. 


Material for this department should be sent to Professor J. Arthur Seebach, Jr., Department of Mathematics, St. 
Olaf College, Northfield, NM 55057. 


POINTWISE LIMITS OF ANALYTIC FUNCTIONS 


KENNETH R. DAVIDSON 
Department of Pure Mathematics, University of Waterloo, Waterloo, Ontario, Canada 


What are the possible pointwise limits of analytic functions? Since analytic functions are well 
behaved, you might expect such limits to be “nice” also. Yet it turns out that some bizarre things 
can occur. The results in this paper are accessible to a student of complex analysis, but they do 
not appear to have been collected together before. 


Classical Background. It is easy to construct a sequence of continuous functions which 
converges pointwise to a function with discontinuities. However, not every function is the 
pointwise limit of continuous functions. The limit of a sequence of continuous functions which 
converges uniformly is necessarily continuous. We will be considering continuous functions on an 
open subset Q of the complex plane. In this context, it is more natural to consider a topology of 
convergence in between uniform and pointwise convergence—uwuniform convergence on compact 
subsets (u.c.c.). A sequence of functions f, on converges u.c.c. if for every compact subset K of 
(2, the restrictions f,|K converge uniformly. If every f, is continuous, then the limit function is 
continuous on each K and thus is continuous on (). If every f, is analytic, the u.c.c. limit is analytic 
also. 

In both real and complex analysis, mathematicians have looked for and found conditions on a 
family of functions % on Q which guarantee that sequences taken from % have convergent 
subsequences. These results are closely related to our problem. To explain the classical theorems, 
we need some additional terminology. 

Recall that a function is continuous at Z, if for each e > 0, there is ad > 0 so that |z — z,| < 6 
implies | f(z) — f(Zo)| < e. A family of functions & on Q is equicontinuous if for each z, in Q and 
each e > 0 the 6 > 0 given above can be chosen independent of f in %. That is, |z — z,)| < 6 
implies | f(z) — f(Zo)| < e for all fin &. 

A family of functions % is pointwise bounded if for each zy in Q, the set {f(z)):f € &) is 
bounded. The family % is locally bounded if each z, in Q is contained in an open ball U so that the 
set (f(z): z © U, f © &) is bounded. Since a convergent sequence is bounded, a sequence of 
continuous functions converging pointwise is pointwise bounded. However, it need not satisfy the 
stronger condition of local boundedness. A sequence converging u.c.c., though, must be locally 
bounded because every Z, is contained in a ball U on which the sequence converges uniformly. 

The following classical theorem from real analysis is proved in many introductory texts. 


THEOREM A (Arzela-Ascoli). Let & be a family of continuous functions on Q. Every sequence in § 
has a subsequence converging u.c.c. if and only if & is equicontinuous and pointwise bounded. 


The analogous theorem for analytic functions 1s: 


THEOREM M ( Montel). Let $ be a family of analytic functions on Q. Every sequence in § has a 
subsequence converging u.c.c. if and only if % is locally bounded. 


Notice that the equicontinuity condition is eliminated at the price of a stronger boundedness 
condition. A natural proof of Montel’s Theorem is obtained by deducing equicontinuity and 
applying Theorem A. 
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Sketch. Fix wy in Q and let U be a ball of radius r about wy so that | f(w)| < M for all w in U 
and fin &. Let Cbe the circle of radius r/2 centered at w). For every w within r/4 of wy, Cauchy’s 
Theorem gives 


f(z) ft l f(z) dz 
f(w) - f(w Wo) = 35 [P LS) te =e [NEw — mo) 


A simple estimate gives | f(w) — f(Wo)| < 4Mr7'|w — wo| for all fin ¥. From this, equicontinuity 
is immediate. 0 

What happens when a family of analytic functions is only pointwise bounded? As we noted 
earlier, this question applies in particular to a sequence of analytic functions which converges 
pointwise on Q. The answers are dramatically different than for Montel’s Theorem. There are 
some positive results, but first we will examine some of the pathology. Another classical result, 
Runge’s Theorem, is needed to provide a way of obtaining analytic functions with prescribed 
behavior. 


THEOREM R (Runge). Let K be a compact subset of the complex plane with connected comple- 
ment. Let f be a function analytic in a neighborhood of K. Then there is a sequence of polynomials 
which converges to f uniformly on K. 


Sketch. Inside the neighborhood U of K on which f is analytic, it is possible to draw a finite 
collection © of piecewise smooth curves which surround K in such a way that Cauchy’s Theorem is 
valid for f. That is, 


f(z)=— gh fea dw for allzin K. 


27 i 


This integral can be approximated by Riemann sums 17_., f(w,;)(w.., — w;)/(w; — 2). Since C is 
bounded away from K, simple estimates show that these sums converge uniformly on K. This 
approximates f by rational functions. In order to get polynomials, it suffices to approximate each 
2(z) = (w — z)7' by polynomials uniformly on K. 

When |w| is strictly larger than |z| for all z in K, the geometric series L°_)z"/w"*! converges 
uniformly to (w — z)~' on K. A similar argument can be used to express (w’ — z)~! in terms of 
powers of (w — z)~' for w’ near w. Thus if (w — z)~!' can be approximated by polynomials in K, 
so can (w’ — z)~'. Because C \ K is a connected open set, a finite number of repetitions will get 
us from the “big” values of w to any point in C \ K. Thus a polynomial approximation is 
obtained. 0 


Some Examples. 

EXAMPLE |. Let K,, be the union of the point {0}, the line segment [(1/7), n] and the compact 
set S, = {z © C: |z| < n and dist(z,R,) => 1/n)}. (See Diagram 1 on p. 393.) Let g, be an analytic 
function which vanishes in a neighborhood of S, and [(1/n), n] and is constantly one in a ball 
about {0}. Let p, be a polynomial obtained by using Runge’s Theorem such that | p,(z) — g,(2)| < 
| /n for all z in K,,. 

Every point z in the complex plane belongs to K,, for large enough n. Thus lim,,_, ,, p,(z) exists 
pointwise and equals zero everywhere except at {0} where the limit is one! This limit is uniform on 
each set S, by construction. Hence the convergence is u.c.c. on C \ [0,00]. The sequence { p,} 
cannot converge uniformly on any neighborhood of {0} because the limit function is not 
continuous at {0}. In fact, it cannot converge uniformly near any point on the positive real axis. 

To see this, notice that if { p,} converges uniformly on a ball U of radius r centered at some 
x > 0, then for some n => 2, r > 1/n and |p, (z)| < 1/2 on U. In addition, | p,(z)| < 1/n < 1/2 
on §S,. But the union of S, and U contains the circle of radius x about zero. So the maximum 
modulus principle implies that | p, (0)| < 1/2 which is a contradiction. 0 
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DIAGRAM | 


EXAMPLE 2. Let L,, be the union of K,, and the point {1/27}. Let h,, be an analytic function on 
a neighborhood of L,, which is zero near K, and one at {1 /2n)}. Again, Runge’s Theorem provides 
polynomials g, such that |q,(z) —4A,(z)| < 1/n on L,. Clearly, the sequence {q,} converges 
pointwise to zero everywhere. However, convergence is not uniform near {0} because g,,(1/2n) is 
approximately equal to one. As in Example 1, convergence 1s uniform on each S, but cannot 
converge uniformly near any point on the positive real axis. O 


Some Positive Results. 
PROPOSITION 1. Let & be a pointwise bounded family of analytic functions on Q. There is a 
(maximal) dense open set Q) C Q such that ¥ restricted to Qo is locally bounded. 


Proof. For each z in Q, let (z) denote the least upper bound for {| f(z)|: f © & }. Because each 
fin Sis continuous, it is easy to see that K,, = {z: 6(z) < n) is (relatively) closed in Q. Since $(z) 
is finite for each z, Q is the union of the K,,. The Baire Category Theorem shows that for every 
ball U in Q, K,, A U has interior for large n. Thus, the union Q, of the interiors int K,, is a dense 
open subset of 2. Obviously, & is bounded on each int K,,, and hence & is locally bounded on Qp. 
Conversely, if & is bounded by n on an open set U, then U is contained in int K,, and thus is in Qo. 
‘a 


COROLLARY 2. Let f, be a sequence of analytic functions converging pointwise on {2. Then the 
limit function f is analytic on a dense open subset of XQ. 


Proof. A sequence of functions converging pointwise is pointwise bounded. By Proposition 1, 
{f,} 1s locally bounded on Q,). By Montel’s Theorem, a subsequence converges u.c.c. to f on Q, 
and hence f is analytic on Q,). 0 


PROPOSITION 3. If f,, is a sequence of bounded analytic functions on {2 converging pointwise to f, 
then f is analytic and the convergence is u.c.c.. 


Proof. The family % = { f,} is bounded and thus locally bounded. By Montel’s Theorem, every 
subsequence of f, has a sub-subsequence which converges u.c.c. This limit is f perforce, so f 1s 
analytic. If the whole sequence does not converge to f u.c.c., there would be a compact set K in Q, 
an e > 0, a subsequence f,, , and points z, in K such that |/, (2,) — f(2,)| = ¢ for k = 1. Thus no 
subsequence of f,,, could converge uniformly to f on K, which is a contradiction. 0 


What Limits Are Possible? How can the set of pointwise limits of analytic functions be 
described? If f is such a limit, then by Proposition 1 there is a dense open set 2, on which the 
convergence is u.c.c. and fis analytic. On the relatively closed, nowhere dense set C = Q \ Qo, fis 
merely the pointwise limit of continuous functions. That is all that can be said, for a modification 
of Example 1 can be arranged to converge to any function with these properties. 


394 LEE A. RUBEL AND ARISTOMENIS SISKAKIS [June-July 


The idea is to choose increasing sequences K,, and C,, of subsets of Q,) and C respectively so 
that the complement of K,, U C, is connected and the union of all K,,’s and C,’s is Q. 

Use Runge’s theorem to approximate f within 1/n on K,, by polynomials. On C, there is a 
sequence g,, of continuous functions converging pointwise to f. A theorem of Lavrentiev can be 
used to approximate g,, within 1/n on C,, by a polynomial. Finally, use Runge’s Theorem again to 
approximate both these polynomials simultaneously by a polynomial f,. It is clear that {f,} 
converges to f. 

A careful choice of the sequence g,,, as in Example 2, allows one to ensure that the convergence 
is not uniform near any point of C. Thus one may achieve an analytic limit f with failure to 
converge u.c.c. on any given closed nowhere dense subset. 

Sequences of univalent functions (one to one and analytic) are much more nicely behaved. A 
sequence of univalent functions converging pointwise to a nonconstant function must in fact 
converge u.c.c.. In particular, the limit is always analytic. These facts will not be proved here. We 
just mention that the proof relies on a deep theorem of Montel which states that a family of 
analytic functions which omits two values is locally bounded. 
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A NET OF EXPONENTIALS CONVERGING TO A NONMEASURABLE FUNCTION 


LEE A. RUBEL AND ARISTOMENIS SISKAKIS 
Department of Mathematics, University of Illinois, Urbana IL 61801 


It is a basic fact that the pointwise limit of a sequence of (Lebesgue) measurable functions must 
be measurable. It is not hard to see that when “sequence” is replaced by “net,” the result fails. 
Nevertheless, it is surprising that from the family T = { f(x — t): t © R)} of translates of f(x) = 
exp(ix’), a pointwise convergent net may be drawn whose limit is nonmeasurable. This is just 
another way of saying that in the space of all complex-valued functions on R, in the topology of 
pointwise convergence, the closure of T contains a nonmeasurable function. (For an extended 
discussion of nets and filters in topology, see [BAR]. Essentially, the same fact that we prove here 
is proved in [LUS, Example 8.4.45, p. 218] by other means.) 

To prove that there exists such a net of translates of exp(ix7), let n, be a sequence of positive 
integers approaching oo. There are several ways to see that there exists a subnet (n, ) of (n,) such 
that for any bounded continuous function g on R, $(g) = lim, g(n x,) exists. In one kind of 
language, ¢ is a point of the Stone-Cech compactification of R—see IG, Chapter 6] for more 
details. From another viewpoint, @ arises as a weak-star limit point of the functionals of 
evaluation (on the Banach space of continuous bounded functions on R) at the points n,. In any 
case, it is clear that @ is a complex homomorphism—that is, @ takes complex values and 
o(g + h) = 6(g) + $(h) and $(gh) = $(g) - 6(h). To stress the dependence on x as the running 
variable, we shall write ¢,(f(x)) sometimes. We will see that if the sequence (n,) satisfies 
property P below, then $,( f(x + ¢)) = lim, f(t + Nk ) is not a measurable function of ¢. 

With f(x) = exp(ix”) we have 


(f(x + t)) = 6, (Ce *”) = 6,(e* )o, (eo, (22). 
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The idea is to choose increasing sequences K, and C, of subsets of Q, and C respectively so 
that the complement of K,, U C, is connected and the union of all K,,’s and C,’s is Q. 

Use Runge’s theorem to approximate f within 1/n on K,, by polynomials. On C, there is a 
sequence g, of continuous functions converging pointwise to f. A theorem of Lavrentiev can be 
used to approximate g, within 1 /n on C, by a polynomial. Finally, use Runge’s Theorem again to 
approximate both these polynomials simultaneously by a polynomial f,. It is clear that (f,} 
converges to f. 

A careful choice of the sequence g,,, as in Example 2, allows one to ensure that the convergence 
is not uniform near any point of C. Thus one may achieve an analytic limit f with failure to 
converge u.c.c. on any given closed nowhere dense subset. 

Sequences of univalent functions (one to one and analytic) are much more nicely behaved. A 
sequence of univalent functions converging pointwise to a nonconstant function must in fact 
converge u.c.c.. In particular, the limit is always analytic. These facts will not be proved here. We 
just mention that the proof relies on a deep theorem of Montel which states that a family of 
analytic functions which omits two values is locally bounded. 
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discussion of nets and filters in topology, see [BAR]. Essentially, the same fact that we prove here 
is proved in [LUS, Example 8.4.45, p. 218] by other means.) 

To prove that there exists such a net of translates of exp(ix*), let n, be a sequence of positive 
integers approaching oo. There are several ways to see that there exists a subnet (n x,) Of (n ,) such 
that for any bounded continuous function g on R, $(g) = lim, g(n x,) exists. In one kind of 
language, ¢ is a point of the Stone-Cech compactification of R—see IG, Chapter 6] for more 
details. From another viewpoint, ¢ arises as a weak-star limit point of the functionals of 
evaluation (on the Banach space of continuous bounded functions on R) at the points n,. In any 
case, it is clear that ¢ is a complex homomorphism—that is, ¢ takes complex values and 
o(g +h) = 6(g) + $(h) and 6(gh) = $(g) - 6(h). To stress the dependence on x as the running 
variable, we shall write ¢,(f(x)) sometimes. We will see that if the sequence (7,) satisfies 
property P below, then ¢,( f(x + ¢)) = lim, f(t + Ny ) is not a measurable function of t. 

With f(x) = exp(ix?) we have 


, (f(x + t)) = 6, (eX *”) = 6, Ce )o, (eo, (e?™). 
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Now ,(exp(ix’)) is just a complex number. It is nonzero since exp(in% ) has modulus 1 for all y. 
Since exp(it?) is a constant function with respect to x, we have ,(exp(it2)) = exp(it?), which is a 
measurable function of ¢t. So we can prove that $,( f(x + ¢)) 1s nonmeasurable if we can prove 
that v(t) = $,(exp(2ixt)) is nonmeasurable. 

It is immediate that v satisfies the functional equation 


(1) v(s+t)=v(s)r(t) 


and that |v(t)| = 1 for all t, and v(t) is 7-periodic so we can use the well-known fact that unless 


(2) v(t) = e™ 
for some constant A, then vy must be nonmeasurable. (A folklore proof of this is the following: 
Suppose v were measurable. The convolution 


(u*v)(x)= ag J ue — t)v(t) dt 


(with the usual abuse of circular notation) of two bounded measurable functions is continuous, 
but 


(v*v)(x) = aS. aa - p(t) dt = v(x) 


so that v itself would be continuous. Then (2) follows in a few easy lines by studying first rational 
values of s and ¢ in (1) and then using the continuity of v.) 


PROPERTY P. There exist two numbers ty and t, in R that are linearly independent over the 
rational numbers Q, and two sequences (1,) and (m,) of positive integers, and sequences (€,) and 
(6,) of real numbers with e, — 0 and 6, — 0 as k > 00, so that 


a 


(3) iin 
(4) at ST Sk 


Let us suppose for a minute that we have constructed our sequence (n,) to have property P. Then 
the rest is easy. We have (with the assumption that v(t) 1s measurable) 


ei2"et —» el for all £. 


Taking ¢ = ¢, and then ¢ = ¢, and solving (3) and (4) for n, gives 
ef t2 +8.) — eirto 
eit((2m,y +l)t+eK,) _» ef At 


Hence exp(iAt,)) = +1 and exp(iAt,) = —1, so that Ato = 2mm, At, = (21 + 1)a7 for some 
integers / and m, and thus ¢,)/t, € Q, a contradiction. 

To see that the property P can hold, let n, — oo very rapidly through even values, say 
n, = (k!)!, k = 2,3,.... Let 


t —~ 1 ft 2 (-1)" 

roe ek 
k=2 "k k=2 

where, for each k, a, = 0 or a, = 1. Let 


eo 1 2m +1 (-)" 
Qn. ’ 


j=2 J 


Neo jay Nj 2n, 


and let 6, and «, be defined by (3) and (4). It is easy to check that these formulas define integers /, 
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and m, and that 6, — 0 and e, — 0. Finally, there are uncountably many choices of the sequence 
(a,), each one of them leading to a different ¢,. Since there are only countably many ?¢, that are 
rationally dependent on fy, the result is proved. 

As a final remark, we tried, but were unable to prove from our result that there is a net of 
translates of sin(x*) that converges to a nonmeasurable function. Could there actually be a net 
(sin(x — t,)?) that converges to a measurable function while (cos(x — ¢,)*) converges to a 
nonmeasurable function? 
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A SIMPLE PROOF OF THE DANIELL-STONE REPRESENTATION THEOREM 


JURGEN KINDLER 
Department of Mathematics, TH Darmstadt, Schlossgartenstr. 7, 6100 Darmstadt, West Germany 


Let X be a nonvoid set, F a linear space of real valued functions on X such that f, g © F 
implies fA g€ F and fA 1LEF (fA g\(x) = min{ f(x), g(x))); let 8, denote the smallest 
o-ring which contains the sets {f> l}={x © X: f(x) > ]),f¢F; and lett A: F>R be a 
positive linear functional such that f,|0 (ie, f(x) > f,(x) > --- and lim, ..,f,(x) = 0 for 
every x € X) implies lim, _,,, A(f,,) = 0. 

We present a short and simple proof of the following fundamental theorem: 


THEOREM (Daniell-Stone). There exists a measure p on 8, such that A(f) = {fd for every 
fEF. 


In our proof we follow the same lines which are known from the construction of Lebesgue 
measure. Both constructions are based on the following two well-known facts (compare [3], §8D, 
§8 (5), and §13A). The union of disjoint sets A,, A,,--: is denoted by U°_,A,. 


LEMMA 1. For all sequences a,,b, € R with b, >a,, n > 0 and [ao, by) = Ue_,[a,, b,) we 
have by — ag = LF_ (6, — a,). 


LEMMA 2. Every o-additive nonnegative finite set function v on a semiring © can be extended to a 
measure p on the o-ring generated by ©. 
Let us recall that a semiring is a nonvoid system © of subsets of a fixed set with the properties 


(i) if S € S and T € ©, then § N T € G, and 
(ii) if S € © and T € © and S C T, then there is a finite class {S), S),..., S,} of sets in © 
such that S = §, CS, C --: CS, =T and R, = S,; — S;_, € © fori = l,..., n. 
Lemma 2 will be applied to the following situation: 


LEMMA 3. The system © of “intervals” 
[f.s)=((x, 1) EXXR: f(x) <t<g(x)), gef, g.feF 
is a semiring and 
viS>R,, r([f,g))=Alg—s) 
is well-defined and o-additive. 


Proof. [f, 8) = Ln=ilfn» Sn) implies [ f(x), 3(%)) = Line ilfn(%), 8n(%)) for every x © X. From 
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and m, and that 6, — 0 and e, — 0. Finally, there are uncountably many choices of the sequence 
(a,), each one of them leading to a different ¢,. Since there are only countably many ¢, that are 
rationally dependent on f¢,, the result is proved. 

As a final remark, we tried, but were unable to prove from our result that there is a net of 
translates of sin(x”) that converges to a nonmeasurable function. Could there actually be a net 
(sin(x — t,)?) that converges to a measurable function while (cos(x — ¢,)*) converges to a 
nonmeasurable function? 
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Lemma 2 will be applied to the following situation: 
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Lemma | we get 
g-f= L (81 - tn). 
n= | 


Leth, =g—f-2),-1(g, —fn); then h, 10 so A(h,) > 0, Le., 


o 6) 


v([f58)) = Le (lta 8): 


n=1 
As in the case of “ordinary” intervals, the proof that © is a semiring is an easy exercise. 


Proof of the theorem. For v as defined in Lemma 3, choose p according to Lemma 2. Let 
O<feéF. For f, = (n[f—fA 1) A 1 we have 


(A) [0, Bf, )T{f> 1} x [0,B) for every B > 0. 
Hence, 

(A) = p(A x [0, 1)), A € Bp 
is a well-defined measure with the property 


e({f> 1} x [0,8)) = Bu({f > 1}), fe F,B > 0. 
Let B” = 5"2'(C" — C",,) X 0,127") with C? = (i7'2"f > 1) for 1 <i < n2" and C? = @ for 
i=n2" + 1. Then, B” T[0, f) and we get 


A(f) = p([0, f)) = lim p(B") = ff du. 
REMARKS. 


(a) It is well known that one cannot dispense with “Stone’s condition”: f © F implies 
f A 1 © F. (Compare [1], 7XB for a counterexample.) However, any other condition which 
ensures for every f © F the existence of a sequence f, © F with property (A) is also 
appropriate. 

(b) Of course, the above theorem implies the Riesz representation theorem ({3], 56D). This is a 
well-known and immediate consequence of Dini’s theorem. Another short proof for this 
special case has been given by Garling [2] (or [4], §22). Garling’s method is closely related 
to Varadarajan’s [5]. 
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Michigan University, Mount Pleasant, MI 48859. 


AN ELEMENTARY APPROACH TO NP-COMPLETENESS 


A. KEITH AUSTIN 
Department of Pure Mathematics, The University, Sheffield S3 7RH, England 


In 1971 Stephen Cook of the University of Toronto showed in [1] that any problem in NP; 1e., 
any problem which can be solved by a nondeterministic Turing machine in polynomial time, could 
be reduced to a problem of consistency in the propositional calculus. As a result of this work, the 
general consistency problem in the propositional calculus was said to be NP-complete. 

Although this is a most significant result in computing science, it does, unfortunately, require 
an understanding of Turing machines to appreciate the general proof. However, the key idea of 
the proof can be illustrated by showing the reduction technique in a particular (but typical) case. 

We will see how the following Traveling Salesman problem can be reduced to a problem in the 
propositional calculus. In the map, the vertices are towns and the lines are roads, each 10 miles 
long. 


FIG. 1. 


PROBLEM: Can the salesman start at 1 and visit all the towns in a journey of only 70 miles? 
To express the problem in the propositional calculus, we write, 


A™ = after 10 X m miles he is at town ¢, 


for m = 0, 1, 2,..., 7, ¢ = 1, 2, 3,..., 8. Given any journey of 70 miles, each A” is either true or 
false. 

We now express the conditions of the problem as logical statements. 

If he is at 5 after 30 miles, then he is at 3 or 4 after 40 miles, i.e., if AZ is true, then either 43 or 
Aj, is true. We write this as 


Ai > (A$ v A$), 
writing — for “if-then” and V for “or.” Denote this expression by J?. Similarly we have 
Az > (ARt! v AG*?), 
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and 
AS > (AT*1 Vv AStl V ASt lV ARt LY AB), 


for m = 0, 1, 2,..., 6, and so on for each town. Denote each of these by the corresponding J". All 
these have to be true and so we write, 


T= JP &IP &: + IZ EI} RG &: + KI &: + KI, 


writing & for “and.” 
Another condition is that each town has to be visited. That town 1 has to be visited can be 
expressed as 


AY V AL V AZ V +++ VA? 
and similarly for the other towns. Let 
V=(Aov--- VAI) &(AS V --- Vv AS) &--& (ADV «++ Vv AR). 


Also he is only at one town at any one time, so we have, e.g., A} — — A}, writing — for “not.” 
Let 


Ne = (43 > 543) & (43 — 343) & (43 — 543) & + -& (43 > 543). 
Let 
N = NP & NY &---& Ny. 


Finally, he has to start at 1 so we require A? to be true. 
Thus, finding a route which satisfies all the conditions of the problem is equivalent to the 
propositional calculus problem of assigning true or false to each A?” so that the expression 


E=J&V&N&A® 


is true. This is a particular case of the general consistency problem for the propositional calculus. 
In the general consistency problem, an expression of the propositional calculus is given, and the 
aim is to assign true or false to the variables present so that the expression is true. 

Cook’s general proof follows similar lines; he describes the working of a nondeterministic 
Turing machine by means of an expression of the propositional calculus. The technique, in fact, 
goes back to Alan Turing’s original paper of 1936, [2], in which he defined a Turing machine and 
described its working in terms of the predicate calculus, in order to show the undecidability of the 
predicate calculus. 

When we have reduced the traveling salesman or other problems in NP to the propositional 
calculus, we are left with the question of whether this reduction helps us to solve these problems. 
The solution to the consistency problem which first comes to mind is that of simply trying all 
possible assignments of true or false to the A” seriatim, and seeing if one makes E true. However 
there are 64 A” and so the number of assignments is 2°. Unfortunately, there is no known 
alternative method which reduces the length of the calculation from this exponential function of 
the number of variables to a polynomial function of the number of variables. Thus the present 
situation is that the reduction has not helped us with the solution of the original problem, as it is 
not difficult to find an exponential solution for the traveling salesman in its original form. The 
question is, whether the future will bring a polynomial solution to the general consistency 
problem, or a proof that no such solution exists. 
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positive integers, n =n, +n, +n; + Ny, Nn, n;, $N,,n;,, for each permutation (i,, 77,13, i,) of 
(1, 2, 3, 4). 


E 3006. Proposed by J. R. Kuttler, Johns Hopkins University. 


Find a function of the form a(x) + B(y) which best approximates sin x sin y on the square 
S = {(x, y):0< x < 27,0 <y < 27) in the sup norm, i.e., so that 


sup |sinxsin y — a(x) — B(y)| 
(x, yES 


is as small as possible. 


SOLUTIONS OF ELEMENTARY PROBLEMS 
Sum of Squares of Digits 


E 2925 [1982, 64]. Proposed by T. Keller, Honolulu, HI. 


Let b be a positive integer. How many integers are there, each of which, when expressed in base 
b, is equal to the sum of the squares of its digits? 


Solution by Edmund Butler, New Carrollton, MD. A simple calculation shows that 0 and 1 are 
the only 1-digit solutions, and there are no solutions with 3 or more digits. Let N(b) be the total 
number of solutions and N’(b) the number of 2-digit solutions, so N(b) = N’(b) + 2. 

Then N’(b) is the number of pairs (a), a,) satisfying 


(1) agtab=a,+a, (0<a,<b-1, l<a,<b-1). 

It is easily verified that there exists a 1-1 correspondence between pairs (do, a,) satisfying (1) 
and pairs (p, q) satisfying 
(2) p+g@=1t+b? (podd3<p<b, 1<q<b-)) 


via the transformations a) = (p + 1)/2,a, = (b+ q)/2. 
Now we express the number of solutions to (2) 1n terms of the number of solutions r(k) to 


(3) e+ d*=k. 


Suppose b is even. Then 1 + b? is odd, so exactly one of p, g is odd. Thus given a solution 
(p,q) to (2) we can generate 3 others that solve (3) via (c,d) = (—p, q); (q, p); and (q,-— p). 
We also add the 8 remaining solutions (+ 1, +b); (+b, +1). This shows that r(1 + b*) = 4N’(b) 
+8 =4N(b). 

Now suppose b is odd. Then 1 + b* = 2 (mod 4); hence both p and g must be odd. Thus from 
any solution (p, qg) to (2) we can generate another solution to (3) via (—p, q). We also add the 4 
remaining uncounted solutions (+1, +b). This shows 


r(1 + b*) =2N’(b) +4 =2N(b). 


Now r(k) is a famous quantity and can be computed from a well-known formula (for example, 
Hardy and Wright, Introduction to the Theory of Numbers, 241). Using the fact that no prime of 
the form 47 + 3 can divide 1 + b’, we find that 


47(1+b*), beven 
27(1+ 67), bodd 


where T(7) is the number of positive integral divisors of n. Therefore we find N(b) = 7(1 + b?). 


r(1 +B) =| 


Also solved by M. Golomb, Purdue University; R. T. Koether, Hampden-Sydney, VA; O. P. Lossers, 
Netherlands; G. Shulman, Teaneck, NJ, and the proposer. 
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ADVANCED PROBLEMS 


Solutions of these Advanced Problems should be mailed in duplicate to Professor G. L. Alexanderson, Department 
of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by November 30, 1983. The solver’s full 
post-office address should be on each sheet. 


6430. Proposed by J. Borwein, Carnegie-Mellon University. 


Recall that the p-trace average of a positive definite symmetric m X n matrix C (we write 
C > 0) is given by 


1 2 1/p 
(c= (5 00] 
i=1 
for —0o <p<o, p#0. Here A, <A, <-:: <A, are the eigenvalues of C. Extend this 
definition by continuity for p = 0, 00, and — oo. Thus 


jo(C): = (det C)'”", 7,,(C):=A,, and j_,(C):=A,. 


Let A(C) be the diagonal matrix whose diagonal coincides with the diagonal of C. Show that 
(i) j,(C) <j,(A(C)) for -o <p<l, 


and 
(ii) j,(C) > j,(A(C)) for 1 <p <0. 


Note that for p = 0 this is the well-known Hadamard inequality. 


6431. Proposed by Gabor J. Szekely and Andras Zempleni, Budapest, Hungary. 


If X and Y are independent nonnegative integer-valued random variables and XY has a 
Poisson distribution, show that either X or Y takes at most two values, zero and one, with 
probability one. (In this sense the Poisson distribution is irreducible.) 


6432. Proposed by Michael Barr, McGill University. 


Problem 4 on p. 348 of Jacobson’s Basic Algebra I reads, “Call a linear transformation normal 
if it commutes with its adjoint [with respect to a symmetric or alternating, nonsingular bilinear 
form on the vector space V—not its classical adjoint]. ... Show that if U [a subspace of V] 1s 
stabilized by a normal linear transformation T, then U~ is stabilized by T.” 

(a) Show by example that the assertion is false. 

(b) Show that if the form is anisotropic on a finite dimensional space (no nonzero vector is 
orthogonal to itself—this requires the form to be symmetric) and the ground field is perfect, then 
T is diagonalizable over the algebraic closure of the ground field. 

(c)* Is it the case that if U is stabilized by a linear transformation T normal with respect to an 
anisotropic form (over a perfect field, if necessary), then U~ is stabilized by T? 


6433. Proposed by Edmund Butler, New Carrollton, MD. 


Let X,, X5,..., X,4, be a sequence of independent random variables uniformly distributed or 
[O, 1]. For any sequence ¢&,, €,..., €, Of +1’s let 


P,(&, EQgsees E, ) = Pr{sign( x, — x7) = €,, Sign( x, — x3) = €); 
my sign( x, 4 -_ Xp) = En}. 


(a) Show that M, = Max, — +1P,,(€1, €25+-+5 &,) 18 assumed when the e’s alternate in sign. 
(b) Find lim, ,.,,M)”". 
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6434. Proposed by Allen J. Schwenk, University of Waterloo. 


The Petersen graph shown below has 15 edges. Its adjacency matrix P is formed by setting the 
(i, j) entry equal to 1 if vertex i is adjacent to vertex / and to 0 otherwise. 
(a) Show that its characteristic polynomial is 


(x — 3)\(x - 1)°(x + 2)". 


(b) Can the 45 edges of the complete graph K,, be partitioned into three copies of the Petersen 
graph? 


6435. Proposed by Thomas Q. Sibley, Cuttington University College, Liberia. 
Let m be any finitely additive extension of Lebesgue measure to all subsets of [0, 1]. Are there 
two subsets A, B of [0,1] such that A 1 B= @,A U B= [0,1] and for anyO <a<b< 1 


m(A 0 [a,b]) = m(BO [a,b]) = 5(b —a) 


SOLUTIONS OF ADVANCED PROBLEMS 
Volume of a Certain Convex Polytope 


5872* [1972, 913]. Proposed by Shmuel Schreiber, Bal-Ilan University, Israel. 


Let C, denote the region in Euclidean x space defined by x; > 0 fori = 1,..., n and y, > 0 for 
j = |,..., nm, where 


x,=1-—2y, + yy 
x= 1+ y-1- 24+ Vi <i<an- 1) 
xX, = I + Vn — 2Yy. 


Prove that C, is a convex polytope of the combinatorial type of a cube and that its volume is 
(n+ 1)""'/(n!). (The result has some use in tournament theory.) 


Solution by H. Debrunner, Universitat Bern, Switzerland. 


1. In contrast to the problem text we prefer to consider C,, in the euclidean space E” given by 
the n-tuples y = (y,,..., y,). The combinatorial type is invariant under the affine mapping x 
relating x to y, but the volume will differ by a factor n + 1, the absolute value of det x (cf. [3]). 
The problem shows only part of cyclic symmetry; in order to complete it we introduce a further 
coordinate y,, , which takes the value 0 on C,. Consequently, we put m:= n+ i and we will 
prove that C,,_, has the combinatorial type of an (m — 1)-cube and has in its y-space (m — 1)- 
volume vol C,,_, = m”~?/(m — 1)!. We assume m > 3 throughout. 
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2. Fort € R, let F,,(¢) denote the m X m-matrix with elements f,, = 2,/,, = —t,f,, = 0 in the 
casesj =i,j =i+1,j #i,i+ 1, resp., where indices are taken modulo m so that we have cyclic 
symmetry. For i = 1,..., m, we let f(t) © E” denote the ith row of F(t) and e, © E” the 
vector with coordinates 5,, (Kronecker symbol). We shall consider the halfspaces H°(t) and H}(t) 
of E™ given by y- e, > 0 and y -f,(t) < 1, resp., and also their boundary hyperplanes 0H°(1), 
0H'(t) given by the equations 


(°}: y-e,=0, (i): y-f(t)-1=0 (i=1,...,m). 


l 


We introduce the (unbounded) polytopes A°(t):= H?(t)Q --- A H®(t) and A!(t):= Hi(t) 
--- H!(t) and their intersection B(t):= A°(t) 0 A'(t). Of course H°(t), dH?(t) and A°(t) 
are actually independent of ¢ so that we may also use the shorter notations H°, dH°, A®. 


3. Two values of ¢ merit special interest: for t = 0, B(O) is an honest metric m-cube of side 
length 1/2; and for t = 1, the C,,_, of the problem text is the facet y,, = 0 of the polytope B(1). 
So we introduce this facet also in the general case by C(t):= 0H2 \ B(t). (See Fig. 1.) Then C(0) 
is an (m — 1)-cube and C(1) must be shown to have the same combinatorial type. Our method to 
tackle this problem consists in proving that the combinatorial types of B(t) and C(t) remain 
unchanged when f varies from 0 to 1. Actually we will meet the difficulty that the combinatorial 
type of B(rt) changes exactly for t = 1, since the vertex (2 — 21) '(e, + --- + e,,) of B(t), 
diagonally opposite to the origin, tends to infinity as t — 1 — . But we will see that this does not 
affect the type of the facet C(t) and, on the other hand, will help to determine vol C,,_ ,. 


4. We will need various determinants related to F,,(t); for any square matrix P we denote by 
P* its determinant. Let X,(t) forj = 1,..., m — 1 denote the principal X j-minor of F,,(1); note 
that — X,,(1) is the matrix of coefficients appearing in the original problem text. If in X,(t) we 


replace the first column by (1,..., 1) © E’, the resulting matrix will be denoted by Y,(r); here we 
remark that the determinant value would be the same (also in sign) if we had replaced the last 
column of X,(t) by (J,..., 1). Now, expanding along the first row and then some of the minors 
along their first columns, we get recursion formulas 

(1) X#(1) = 2X (1) — AXE 2(1) 

and 

(2) V(t) = XA 1(0) +04 4(2). 


With the starting values X*(t) = 1, X#(t) = 2, X¥(t) = 4 -— 07, YF(t) = 1, Y(t) = 2 + 1, induc- 
tion arguments now yield the special values 


(3) X*(1)=n+1, YF) = 3n(n4+ 1) (n= 1,2,...) 
and the inequalities 
(4) X*(t) > X¥(t)>0 and YA(t)>0 for O<r<l. 


Since F(t) is a circulant matrix, its determinant can easily be determined explicitly (as the 
product of its eigenvalues), cf. [1, p. 73] or [2], namely 


m— | . 
2 
(5) F*(t) =2"-'(2- 2t) [] [1 — 1008 =2 ), 
j=l 
Finally, we consider the symmetric matrix Z,,, )(¢) resulting from F,,(t) by bordering with a 
foremost row and column f,:= (0,1,..., 1) € E”*' intersecting in the common entry 0. If ¢ # 1, 


we multiply the sum of the last m columns of Z,,(t) by (2 — 2t)~' and subtract this from the 
foremost column; so we get 


(6) Z*, (t) = — (2-21) ‘mF*(t) for #1. 
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If ¢ = 1, we add the last m — 1 columns of Z,,, ,(1) to the column (1, f,(1)) and similarly for the 
rows, and, in view of X71" f,(1) = 0, we get 


(7) Zx41(1) = —m’X*_ (1) = —m. 


5. Obviously our A° is a simplicial cone with apex 0. Next we show that A!(r) is, for0 < ¢ < J, 
a simplicial cone with apex py:= (2 — 2t)~'(e, + --- + e,,) and, for ¢ = 1, a prismatic region 
with simplicial crossection and with axis R(e, + --- + e,,). This is clear from the inequalities 
defining the H}(t) (i = 1,..., m) once we know that f,(t),..., f,,(t) are linearly independent for 
O < ¢ < 1 and span an (m — 1)-dimensional subspace orthogonal to e, + -:- + e,, fort = 1. The 
orthogonality conditions f;(1) -(e; + -:: + e,,) = 0 are easily checked; the remaining assertion 
follows from rank considerations for F,,(¢). In fact, (5) shows rank( F,,(¢)) = m for 0 < t < 1, and 
(5) together with (4) shows rank( F,,(1)) = m — 1, since X,,_ ,(1) is a minor of F,,(1). 


6. As ¢t runs from 0 upwards, the combinatorial type of B(t,)) is the same as that of the cube 
B(0), if for each t € [0, to] 

(i) the m hyperplanes 0H;'(t),..., 0H{~(t) (where e; € (0, 1} for i = 1,..., m) have linearly 
independent normal vectors and thus intersect in exactly one point P*(t); this for each of the 2” 
possible choices of ¢ = (€),..., €,,). And in addition: 

(ii) the m + 1 hyperplanes 0H/~*(t), OHf'(t),..., OH{m(t) (where i € (1,..., m} and ¢, € 
(0, 1}) have no common point; this for each of the 2”~'m different choices for i and e i (J # i). By 
continuity the condition (11) guarantees in the presence of (i) that each point P*(t) keeps in the 
interior of the halfspaces H;~ *(t), since for t = 0 it is in this interior. So P*(t,) will be a vertex of 
B(t,), and for t = ¢,) we will have the same incidence relations between vertices and facets of B(t) 
as for t = 0. For the sufficiency of this condition see [3, p. 41, Exercise 3]. The same conditions (1) 
and (ii), restricted to the fixed choice e,, = 0, will guarantee that the combinatorial type of C(t)) 
(= 0H2(t)) N B(to)) is the same as that of the (m — 1)-cube C(0). For the (m — 2)-dimensional 
hyperplanes 0H®(t) A 0H® (i = 1,..., m— 1) in the (m — 1)-plane 0H® intersect just like the 
hyperplanes listed in (1) if €,, = 0. 


7. We have already checked in §5 that condition (1), for e = (1,1,..., 1), 1s true if 0 < ¢ < 1 
and fails if ¢ = 1. We next check that (i) is true in the case L" «, < mif 0 < t < 1. Considering the 
cyclic symmetry of F,,(t), we may assume without loss of generality that one of the vanishing «,’s 
is e, = 0. The matrix N‘(t) which checks the linear independence of the normals to the 
hyperplanes listed in (i) arises from F,,(t) by replacing the ith row by e, ife; = 0 (i = 1,..., m). If 
we assume e,; = 0 exactly for the index values 1 = i, <i, < --- <i,, then with i,,,:= m+ 1, 
we easily see 


det Ni(t) = I] XF-i.-1(4), 
s=1 


so by (5) Ni(t) has rank m for 0 <¢< 1. Summing up, condition (1) is generally true for 
0 < ¢ < 1, and in the restricted case ¢,, = 0 it is true for0 <¢< 1. 


8. To deal with condition (11) we again exploit cyclic symmetry by assuming without loss of 
generality that i = 1. The listed hyperplanes then have no point in common if the system of m + 1 


equations (9) ( ), ae wey (5 (for notation see §2) has no solution y € E”. It amounts to 


the same to say that this system made homogeneous (by writing, e.g., first —y,/yp in place of y, 
(i = 1,..., m) and then multiplying the equations by y,) has only the trivial solution (0,..., 0) © 
E™*!, The (m+ 1) X (m+ 1)-matrix M*(t) of this system of equations can be got from the 
matrix F,,(t) by adjoining first a foremost row e, = (1,0,..., 0) © E”, then replacing the ith row 
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f(t) by e, if «, = 0 (i = 2,..., m), and finally adjoining a foremost column (0, 1, €), €3,..., &,): 
For the computation of |det M*(t)| we may first delete all the rows of M*(t) beginning with an 
entry 0 and the corresponding columns which cross these rows in their only nonvanishing entry 1. 
If then we expand along the first remaining row, which will be (1, —7,0,..., 0,0) or (1, —2,0,..., 
0, —t) according to whether «,, = 0 or €,, = 1, we see (in the second case after some reordering of 
the columns of the algebraic complement to the last entry of the first line) that det M*(t) is a sum 
of products consisting of factors t/, X*(t) and Y*(t) (j, 7, s © {1,2,... }), all the summands being 
of the same sign. So, in view of (5), we have det M*(t) # 0 for 0 < ¢t < 1. Summing up, condition 
(ii) is true for 0 < t < 1 in the general case as well as in the restricted case e,, = 0. Accordingly, 
we now have proved that C,,_, has the combinatorial type of an (m — 1)-cube and, moreover, 
that the same holds for C(t) (0 < t < 1) and that B(t) has the combinatorial type of an m-cube 
(0<t< 1). 


9. To compute the (m — 1)-volume of the facet C(1) we introduce a further halfspace 
Hy = [> e E"ly- die,-> oh 


Its boundary hyperplane 0H) contains the origin, has normal vector (1,1,..., 1) and is the 
subspace spanned by /,(1),..., f,,(1). Each of the sets B(r), A°(t), A'(t), (0), Hg is invariant under 
the orthogonal transformation § which permutes the coordinate axes cyclically. So we have the 
same cyclic symmetry on 0A°(1) A'(1) which consists of the m congruent facets 
C(1), §C(1),..., &"- 'CQ1) of B(1). But orthogonal projection onto 0Hj (along the axis R(e, 
+ +--+ e,) of the prism A'(1)) maps 0A°(1) N A'(1) bijectively onto 0H}  A'(1) and maps 
each of the facets {'C(1) with projection factor m~'”* (= cosx(e,, Le,)). (See Fig. 1 and note that 
the prism A'(1), indicated there by stakes on the edges, has nonregular cross section for m > 3.) 
So if, for 0 < ¢ < 1, we define y,(t) as the (m — 1)-volume of 0H} M A'(t), we have 


(8) ¥o(1) = m- m~'/* vol C(1) = m'” vol C,,_ |. 


To evaluate the left-hand side of (8) by means of yo(1) = lim,_,;_yYo(t), we consider, for 
0 <¢ <1, the m-simplex S(t) = Hj N A'(t). It has the apex py = (2 — 2t)”'Le, of A'(t) as a 
vertex, opposite to the facet 0H} © A'(t), and the corresponding height, which coincides by 
{-symmetry with the distance from O to po, is 


(9) hy = (2—2t) 'm'”. 
We have for the m-volume V(t) of S(t) the equation 
(10) mV(t) = yo(t)ho. 


For i=1,...,m, let p, € 0H) be the vertex of S(t) which is opposite to the ith facet 
0H}(t) M S(t) having outer normal vector f,(t), and let y,(t) denote the (m — 1)-volume of this 
facet, h, the distance from p, to 0H/(t) (ith height) and d; the distance from the origin O to 
0H'(t). We dissect the simplex S(1) into m pyramids each having the origin O (which is a relative 
interior point of the facet 0H} \ S(t)) as its apex and a facet of S(t) as its base. All these 
pyramids are congruent by means of §. So we have 


mV(t)=y7,(t)A, = Y (0d, =my(t)d, (i=1,...,m) 


and, in particular, 
(11) h/d,=m (i=1,...,m). 


If we evaluate the left-hand side of equation (§ for the vertex p,, Hesse’s normal form formula 


yields 
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All these incidence relations between vertices and facets of S(t) are contained in the matrix 
equation 


(12) 


It is well known that the determinant of the first matrix is m!V(t). That of the second 1s 
—Z*.,(t), see §4. So, with (10) and (6), the determinant relation associated to (12) reads 


(m 7 1) !¥o(t) ho Zr 41(t) = thyommt!? 
or 
(13) Yo(t) = |m™*!77(m — 1)1Z% (2) 
Combining this with (7) and (8), we get 


vol Cyy = m=] lim yo(t)| = |m™/(m — 1)1Z. (1) = m"3/(m — 1), 
as asserted. 


References 


1. Ph.J. Davis, Circulant Matrices, Wiley, New York, 1979. 
2. E. T. Wong, Polygons, circulant matrices, and Moore-Penrose inverses, this MONTHLY, 88(1981) 509-515. 
3. B. Grunbaum, Convex Polytopes, Wiley, London, 1967. 


Images of Monotone Functions 


6218 [1978, 500; 1982, 134]. Proposed by M. J. Pelling, Balliol College, Oxford, England. 


Let S be a subset of the real line R having cardinality of the continuum. Is there always a 
monotonic f: R > R such that m*{(S) > 0 where m* is outer Lebesgue measure? 


Solution by A. W. Miller, University of Texas. This was partially solved by Fred Galvin (see this 
MONTHLY, 89 (1982) 134-135). He showed that the continuum hypothesis (actually SBCT which 
is much weaker) implies that there is a subset S C R of cardinality the continuum such that every 
monotonic image of S has measure zero. 

A positive answer to Pelling’s problem is also consistent with the ZFC (Zermelo-Fraenkel set 
theory with the axiom of choice). A set of reals X has universal measure zero iff for every 
countably additive, nonatomic measure p on the reals, p(X) = 
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It is an unpublished result of J. Baumgartner and R. Laver that if ZFC is consistent, then so is 
ZFC plus every set of reals of cardinality the continuum that fails to have universal measure zero. 
This result was announced in Laver, R. “On the consistency of Borel’s conjecture,” Acta Math., 
137 (1976) 151-169. A different proof of it will also appear in Miller, A., “Mapping a set of reals 
onto the reals,” to appear in the Journal of Symbolic Logic (1983). This result combined with the 
following construction of Szpilrajn-Marczewski completely settles Pelling’s problem. 


THEOREM (Szpilrajn-Marczewski). Suppose S C R and is a countable additive, nonatomic 
measure such that (SS) > 0. Then there exists a monotonic function f: R — R such that m*( f(s)) > 
0. 


Proof. We may assume that » does not vanish on any interval since we could replace p by 
3( + A) where A is Lebesgue measure. Now define f by 


f(x) = »([0, x]). 
Since p vanishes on no intervals and is atomless, we see that fis strictly increasing and continuous 
and thus a homeomorphism. Define 


v(B) =n(f '(B)) 


for any Borel set B. We are done if we show that v is Lebesgue measure. Suppose IJ = [a, b] is any 
interval. Then 


f'(1) = (y: w([9, y]) € 1} = [e, 4] 
where p((0, c]) = a and p((0, d]) = 5. But then 
»(D) = w(f"(1)) = w(Le, d]) = b- a. 
Since v agrees with Lebesgue measure on the intervals and the intervals generate the Borel sets, we 
have that v is Lebesgue measure. 0 


Stochastic Matrices 


6366 [1981, 711]. Proposed by Emeric Deutsch, Polytechnic Institute of New York. 


Let A be a row Stochastic matrix such that ||A|| = 1, || || being the operator norm induced by 
the Euclidean vector norm. Show that A is doubly stochastic. 


Solution by Enzo R. Gentile, Universidad de Buenos Aires, Argentina. Let A = (a;;) be a real 
n X n matrix. We shall prove that 


A 
Al] = max 421 XI Ly and )oa,,=n 
x >J 


imply that A is doubly stochastic. Since ||A|| = ||A”|| = 1 where A’ is the transpose of A, it will be 
enough to prove that the sum 


S= E(1 - Ea,) 
-E/ + (Za,) -2[Z«,)] 


=—-nt+ ¥(Za,) 


is 0. 
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the Euclidean vector norm. Show that A is doubly stochastic. 


Solution by Enzo R. Gentile, Universidad de Buenos Aires, Argentina. Let A = (a;;) be a real 
n X n matrix. We shall prove that 


|| Al] = max 4*1 1 and ay=n 
x#0 || i,j 
imply that A is doubly stochastic. Since ||A|| = ||A”|| = 1 where A’ is the transpose of A, it will be 
enough to prove that the sum 


s=2(1-La,) 


J 


at + (Ze,) a(X-,) 


—n+ E(Za,) 


is 0. 
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¥(Ea,,) <|x\’ =n, 
joi 


from which it follows that S < 0. Hence S = 0. 


Also solved by David K. Baxter, A. J. Bosch (The Netherlands), C. S. K. Chetty (India), Jesse Deutsch, H. 
Kestelman (England), Mauri Koskela (Finland), O. P. Lossers (The Netherlands), Marvin Marcus, A. McD. Mercer 
(Canada), Jorma Kaarlo Merikoski (Finland), Marcel F. Neuts, Paul J. Nikolai, Nicholas Passell, Mary Beth Ruskai, 
C. L. Thompson (England), John Tripp, Nam-Kiu Tsing (Hong Kong), Pei Yuan Wu (Taiwan), and the proposer. 


The Error Term in Simpson’s Rule 


6368 [1981, 768]. Proposed by Robert E. Shafer, Berkeley, Calif. 


Consider the integral 


r= f" (2) dz = log| I (25 + ‘\ ro), Rev>0 


where y denotes the logarithmic derivative of the gamma function. Simpson’s rule with two 


subintervals gives 
_ 1 (ve vot 1/2 vot | 
r= 354(5) +3 (Pe }+a34 2 }- * 


where 


6 

(Go), > 4 = 
92160” (5 _) 
Show that for real v, FE is negative, by deriving the representation 


E=[' I _ 14+ 2y¥1 - x? (+) dx 
0 


(+++) 6x l+x l+x- 
log| ———— 


E= 


Jo<0<1. 


l1-—x 


Solution by Lajos Takacs, Case Western Reserve University. It is well known that, for Re z > 0, 


_— 4,271 
log T(z) = f(A 2 + 7 du 


log u 


and 


Wz ) = Slog F(z y= - (To +1 | au 


l—u_  logu 


Substituting u = y? in these integrals, we can express 
ovo | 
r/24- 
l D) 1 ol 1 D) 1 (5 5 | 
E=4(5)+54(5 +4) + 4a +2) 8 
in the form 
1 fi \ y? 
E == {'g| -|——2——-a 
cfs; (1 — y*) log y ” 


for Re v > 0, where 
g(u) = (ut 4Vu + 1)logu — 6(u— 1). 
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l (>| y? 
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8 y} (1—y?)logy” 
for Re v > 0, where 
g(u) = (u+ 4V¥u + 1) log u — 6(u — 1). 
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Substituting y = (1 — x)/(1 + x) in the above integral, we obtain the desired formula for E. 
Next we observe that, for ¢ > 0, 


e~'g(e7') = 4tcoshr + 8¢ — 12 she 
0° pont 3 
= 4d (@n)i (1 aT > 0. 
Hence g(u) > 0 for u > 1, and this implies that E < 0 when v > 0. 


Also solved by Otto G. Ruehr and the proposer. M. L. Glasser and O. P. Lossers (The Netherlands) derived the 
desired representation for E but did not show that E < 0 for v > 0. M. E. Muldoon (Canada) observed that an 
explicit representation for E is not required to prove that E < 0 when v > 0, since this follows from the known 
formula 


ioe) 
W(x) = -24V (xtn)?, x>0. 
n=0 


The Sum Dy’_ ,k~ '/7 log k 


6371 [1981, 769]. Proposed by Bruce C. Berndt, University of Illinois, Urbana-Champaign. 


Show that 
N 
im Y k~'7 log k — 2VN log N + 4VN | = —£(1/2){2/4 + y/2 + (log(87)) /2} 
7 OV k= 1 


where §(s5) denotes the Riemann zeta-function and y denotes Euler’s constant. Note 
§(1/2) = (217 + IY (-1)"0- 7, 
1 


Solution by Otto G. Ruehr, Michigan Technological University, Houghton, Michigan. G. H. 
Hardy ( Divergent Series, Oxford, 1949, pp. 333-334) employed the Euler-Maclaurin sum formula 
to show that 


N 1— l—s —s 

, *] l 

(*) lim Dy k~*log k - ~°8* 4, A __ S08 S = —{'(s) 
N> oo \ k= 8 (1 -s) 
for Res > —1. To calculate {’(1/2) we differentiate logarithmically the well-known identity 
t(s) = 2(27)*'sin-T(1 —~s){(1-s) 

and set s= 1/2, noting that ['’(1/2)/T(1/2) = —y — log4. The indicated result follows by 
taking s = 1/2 in (*). 


Also solved by Paul S. Bruckman, P. J. Forrester (Australia), M. L. Glasser, Gaston H. Gonnet (Canada), 
Nathaniel Grossman, Sidney Heller, H. Jager (The Netherlands), O. P. Lossers (The Netherlands), William A. 
Newcomb, Helmut Prodinger (Austria), Don Redmond, Hermann Schmidt (West Germany), Robert E. Shafer, 
David Shelupsky, Lajos Takacs, L. Van Hamme (Belgium), and the proposer. 


An L,,-Norm Inequality 


6372 [1981, 769]. Proposed by David R. Brillinger, University of California, Berkeley. 
Let n be a positive integer and fin L,(R) with p = (n + 1)/n. Then 


for fOr) Fonda to + xe, depen dey <A 
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(This inequality yields directly the result of Problem 5314, this MONTHLY, 72 (1965) 795, discussed 
by H. Dym, this MONTHLY, 87 (1980) 53-54). 


Solution by Pei Yuan Wu, Department of Applied Mathematics, National Chiao Tung University, 
Hsinchu, Taiwan, Republic of China. By Hdlder’s inequality, the integral in question does not 
exceed the following product of n + 1 integrals 


[foo fUiGe2) Fn) FO ob x Pea 


\ 


i 


[fo [fC fq DFC fous. + x,,)|Pdx, “++ dx, 


\"" 1) 


[fo fla) --+ f(x,)Pdx,... dx 


n/(n+t+1) 


= [ fl) oO  fiptePas| (by Fubini’s theorem) 
=| 1p". 


Also solved by Ulrich Abel (West Germany), K. F. Andersen (Canada), G. A. Edgar, Noel Glick, O. P. Lossers 
(The Netherlands), Nicholas Miller, Michael D. Perlman, Jay Warendorff, and the proposer. 
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COLLABORATING EDITOR FOR FILMS: SEYMOUR SCHUSTER, CARLETON COLLEGE 


Turtle Geometry. The Computer as a Medium for Exploring Mathematics. By Harold Abelson and 
Andrea diSessa. The M.LT. Press, Cambridge, MA, 1981. xx + 477 pp., $20.00. 


GEORGE K. FRANCIS 
Department of Mathematics, University of Illinois, Urbana, IL 61801 


Astronomer, surveyor, mason, and carpenter measure heaven and earth, and create a world of 
parallels and perpendiculars. One imagines Plato handing his students a piece of string and 
instructing them to go and discover the rules of geometry on the dusty grounds of his academy. 
Straight lines connect here to there, and great circular arcs trace across the imagination of pupils 
for 2,000 years as they learn their Euclid. Instruments of measurement improve. Astrolabe and 
telescope chart the cosmos, drafting tools and paper aid artist and architect to explore perspective. 
Euclid yields to Descartes and Riemann. Geometry acquires hyphens: analytic-, projective-, 
algebraic-, differential-, and gives birth to topology. Once called the analysis of place, this 
offspring soon escapes its narrow mission and invades the furthest corners of science, from logic 
(ultra filters) to linguistics (catastrophe theory). 

Yet, a century after the great synthesis of analysis, geometry and algebra that culminated in the 
work of Klein and Poincaré, the exciting ideas of Thurston et a/. have inaugurated a rennaissance 
of interest in low-dimensional geometry. In order to appreciate and understand the Thurston 
program, and perhaps contribute to it, a profound familiarity with the most elementary concepts, 
such as curvature, connectivity, and groups of structure-preserving transformations, is essential. 
But how can we provide a cohort of native speakers of a geometrical language which is currently 
learned only in graduate school? 
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Abelson and diSessa’s “Turtle Geometry” represents a pedagogical program that is a vital first 
step in this direction. In a word, it says: computergraphics. There is no question that this truly 
new tool of visualization and its associated syntax (procedure oriented recursive programming) 
could be a direct access to an intuitive understanding of matters generally considered very 
difficult. Just how this might be done is expertly explored in this primer of computer based 
geometry. 

Undoubtedly, the authors mean to take their readers by the hand and guide them through a 
wonderful safari to the curved space of General Relativity. But this book is no “Mr. Thompson in 
Wonderland.” There are eight chapters of sound training in plane, space, and surface geometry 
preceding it. Their turtle, named for Walter’s robot tortoises that were programmed to advance 
and turn like the Milton-Bradley programmable toy “Big Trak,” is secretly a point (= state) in 
the principal SO(n) bundle over a manifold. In the plane (Chapters 1, 2, 4) the turtle has a 
direction as well as a position. It changes these by obeying a string made up of the two atomic 
commands: Fs =“move s units from the current position in the direction of the current heading” 
and R@ =“rotate the heading @ angular units from the current direction.” Thus, over a turtle 
path, (“tour” would have been better because its program is more of an itinerary than a highway), 
it accumulates total turning as well as total displacement. 

To give a sample of turtle pedagogy, let me trace their “poly closing theorem” through the text 
and certain exercises, which they provide with hints and an explanation of their solution. The 
theorem claims that simple iteration of FsR@ returns the turtle to its initial state if and when the 
total turning reaches an integral multiple of 27. Following ample experiments in support of its 
truth, a first proof is sketched. It depends on the observation that the vertices of the polygonal 
trail left by this tour lie on a common circle of reciprocal radius equal to |2sin(@/2)|/s. The 
authors resist the temptation of proving this (elementary) lemma, or of calling this ratio the 
curvature of the circle, for a very good reason. In orthodox turtle thought, based on the radical 
subjectivity of strictly local (intrinsic) experiences, a turtle circle is not so much the locus of 
positions equidistant from a point, but the (ideal) path approximated by tours with decreasing 
parameters but constant ratio 6/s. Though called “curvature” early in the book, the authors avoid 
this term in relation to curves because they do not intend to define surface curvature in terms of 
the Gauss map [5]. Just how they manage to sidestep the Theorema Egregium and motivate 
intrinsic curvature from the start is one of the more ingenious if controversial features of Turtle 
Geometry. 

The second proof argues (by contradiction) that if after n steps, headings agree (as they must 
for @ a proper rational fraction of 27) but at different positions, PO + Pn = (FsR@)"( PO), then 
the turtle must wander off to infinity. It does so inside a strip along the line PO Pn, which it meets 
every n Steps, never straying farther than ns /2 from it in between. On the other hand, applying the 
argument to the initial state (P1, H1 = HO + @) instead, obliges the turtle to wander off along 
another road rotated from the former, which no plane turtle can do. 

After two chapters of thorough motivation, vector geometry is presented as the natural 
algebraic tool for implementing turtle programs on real computers. Now the classic vector proof, 
using isotropy and linearity of rotations can be explored. But real geometric insight is generated in 
an exercise (with hint and answer) concerning the failure of this theorem in three space. Here an 
aquatic turtle, or perhaps a bird, has an orthogonal frame for its heading, and rotations have three 
primitives appropriately called pitch, roll and yaw. The second argument here leads not to a 
contradiction but the existence of invariant axes for elements of SO(3). The polybird happily 
spirals off to infinity along a helix. 

The central character of Turtle Geometry, however, is the “simple closed path theorem” to the 
effect that whatever complicated program generates a turtle loop that never crosses itself, the total 
turning is +2; compare Hopf [6]. Its demonstrable failure on a sphere leads to the notion of 
angular excess and eventually to intrinsic curvature density. The local form of the Gauss-Bonnet 
theorem [3, p. 268] serves as a quasi definition of the total curvature of a simply connected surface 
patch. Labor is rewarded with a thoroughly convincing demonstration of the Kronecker-Dyck 
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theorem: for a closed surface the total curvature is 27 times the Euler characteristic [7], [4]. 

One may well wonder how all this can be done without reference to the calculus and with any 
semblance of precision. On the matter of rigor, always a touchy point with “elementary” books, 
let me suggest a humanistic criterion. An explanation is a discourse that satisfies the curiosity of 
the hearer and a proof is an argument that removes the obstacles to belief. Those definitions and 
proofs that fit the level of combinatorial sophistication expected of the reader are as sound as a 
fiddle. Among these is an algorithm teaching the turtle to escape a maze based on Hopf’s theorem. 
Another is a proof of Hopf’s theorem for a noncrossing right angled turtle loop by collapsing it to 
a square. Other, more heuristic explanations arouse an appetite and prepare the way for the 
analysis and topology needed to cement the gaps. One reason the authors can maintain 
plausibility lies in the digitized nature of turtle paths. Since the computer turtle always has some 
heading, the ideal curve “pointed to” by the traces on the (flat!) computer screen is piecewise 
regular (= nonzero velocity) in the sense of Hopf [6] and Whitney [12]. A broken geodesic on a 
curved or polyhedral surface becomes an “even strided turtle walk,” or better, a wheeled turtle 
that turns its right and left wheels at the same speed on the forward motion. Parallel translation of 
a fixed direction relative to its initial heading (the affine connection) is managed by making the 
appropriate additive corrections at the turns. Transition between adjacent charts (... what to do 
when the turtle reaches the screen edge) in an atlas for a manifold is nicely motivated, for 
example, by taking the turtle for a walk on a cube. 

And so on. I will not spoil the fun of the chase by revealing all the geometrical concepts 
amenable to turtle heuristics. The book is magnificently illustrated. A plenitude of problems is 
helped by separate appendices for hints and answers. Turtle Procedure Language, the pseudocode 
used throughout the text, is fairly clear and an appendix ties it to the common speech of 
APPLEBASIC and PASCAL as well as its parent LOGO. The only serious flaw is the absence of 
a bibliography. To the undergraduate veteran of Turtle Geometry I recommend doCarmo’s superb 
text [3]. For the geometrical convert to turtle pedagogy there are several good projects left 
virtually untouched by this volume. One that springs to mind is to explore also the extrinsic 
curvature of surfaces in 3-space, along the lines of Tom Banchoff’s films [1]. 

In closing, I should like to spin a yarn from Gauss to Nelson Max’s computer graphics 
masterpiece on the sphere eversion [8]. One thread of it is taken up in Chapter 4 on deforming 
plane closed turtle paths. The rest could lead to a computer based geometry whose protagonist is 
an entire surface writhing in higher dimensions. 

A succession of small changes in the parameters of a turtle loop simulates a regular homotopy, 
provided that only angles |@| < 7 are used in programming the turtle. The resulting deformation 
preserves the total turning of the curve. Graustein’s elegant analytic proof in [12], that (conversely) 
two regular closed plane curves with equal total turning are regularly homotopic, was generalized 
by Smale. In his thesis [10] he identifies the fundamental group, 7,(TS(M”), *), of the direction 
(= unit sphere) bundle over a Riemannian manifold as the set 7)(Im(S', M;*)) of regular 
homotopy classes of immersions of the circle in M with common initial place and heading. For 
n = 2 and M the plane, TS(R”) = R* X S' and the invariant is the total turning divided by 27. 
(This number, called “amplitudo” by Gauss, has acquired many aliases since.) In a sequel [11], 
Smale shows that the regular homotopy classes, 7)(Im(S*, R")), of immersions of the sphere are 
in 1:1 correspondence with 7,(V,, ,), where V, , is the Stiefel manifold of k-frames in n-space. 
Since the cross product of two independent vectors in R? determines an independent third, and 
the triplet can be orthonormalized, V3. retracts to SO(3). Since S$’ doubly covers SO(3) and 
m(S°) is trivial, all immersions of the sphere in space are connected to each other. A regular 
homotopy from the standard inclusion to its central (= antipodal) reflection is called an eversion 
of the sphere. 

The notion of regular homotopy and the Whitney-Graustein theorem have little known 
precursors in the 1901 dissertation of Hilbert’s student Werner Boy [2]. His curves in the plane 
and surfaces in space are still the 19th century loci of algebraic constraints, not the images of 
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mappings. His “completely continuous deformations,” restricted only by an upper bound on the 
absolute value of the Gaussian curvature measure, are not always (equivalent to) a regular 
homotopy, though for circles and spheres it is. Boy’s highly visual proof that total curvature 
classifies the deformations of plane closed loops, which could have a pretty turtle version, is 
omitted from his published paper, with good reason. Generally, its surface analog is false. A strip 
of paper joined after one full twist (not a Moébius band) will not deform into an untwisted one, 
though both have zero curvature. (In view of Smale’s theorem, one must seek a counterexample 
for closed surfaces among more complicated ones than spheres.) Boy proceeds with a geometrical 
realization of Gauss’ plan of relating topological connectivity to total curvature. He develops an 
equivalent to curvature that works also for piecewise polyhedral surfaces, reminiscent of Turtle 
Geometry, Chapters 5-8. He constructs his celebrated immersion of the projective plane as the 
missing link (x = 1, n = 0) in the zoo of known surfaces in real 3-space for which 


C=227x + 4an 


where C is the total (polyhedral) curvature, x the Euler-Poincaré characteristic and n the number 
of double curves ending in pinchpoints. This beautiful surface has been with us for 80 years, and 
only recently, Petit and Souriau [9], working out an idea of Bernard Morin, found a parametriza- 
tion that works well on an APPLE II. 
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and others. This list doesn’t include the traditional interaction with such fields as operator 
algebras, group representation theory, and partial differential equations, etc. Contemporary 
mathematicians who have become involved in this new interaction include, for instance, M. F. 
Atiyah, S. Novikov, and I. Singer, as well as the author of the book being reviewed. Physicists 
involved in the interaction include, for instance, G. tHooft, (elementary particle theory), and R. 
Penrose (relativity theory), as well as many others. 

The interaction between mathematics and physics goes back to the beginnings of both 
intellectual disciplines, and the cross-fertilization has had enormous benefits for both areas. 
However, they are indeed distinct disciplines with their own standards, methodology, and 
fundamental problems. The book under review is written by a mathematician who had been asked 
by a certain forum to talk about the interaction between these two fields, and this published book 
is an edited version of a series of lectures on the subject. The result is a delightful experience 
written in a beautiful style which is not at all pedantic, and which attempts to indicate the 
diversity of the topic as well as the depth and nature of the interaction. In the first paragraph of 
the book the author relates an anecdote to the effect that a well-known professor of logic was in 
the habit of beginning a course in logic by asserting that “logic is the science of laws of thought; 
now I must tell you what the words ‘science’, ‘laws’, and ‘thought’ mean, but I won’t tell you what 
‘of’? means.” Manin, in his book, also doesn’t attempt to explain the word “and” in the title of his 
book. He discusses, in a stream of consciousness fashion, numerous topics from both mathematics 
and physics and the reader can divine for himself the nature of the word “and.” The table of 
contents illustrates the breadth of the book: (1) a bird’s eye view of mathematics; (2) physical 
quantities, dimensions and constants: the source of numbers in physics; (3) a drop of milk: 
observer, observation, observable, and unobservable; (4) space-time as a physics system; (5) action 
and symmetry. In Chapter 1 he talks about set theory, mathematical truth, linearity, curvature, 
formulas and other mathematical notions. In Chapter 2 he discusses in a beautiful manner and in 
a language that mathematicians can appreciate the nature of physical laws and the importance of 
scales and fundamental units. In the next two chapters he takes an interesting excursion through 
the areas of quantum theory and relativity theory, and their interaction. He concludes with a brief 
discussion of symmetry and ends up with a tantalizing comparison between the theorem of Euclid 
on the infinitude of primes and the theorem of Deligne which solves an old, easily-stated 
conjecture of Ramanujan concerning prime numbers. 

The book is beautifully written and will at times tax the reader whose knowledge doesn’t reach 
far enough in a given direction, but the flavor of a vast variety of topics and their significance in 
both mathematics and physics is presented with the skill of a scientist who knows what it means to 
communicate with an audience. The one major defect in the book is the complete lack of 
references (such is the nature of a lecture), but the stimulus for further reading will certainly be 
there after reading the book. It is too bad that the author didn’t attempt to bridge this additional 
gap of communication, as it would have added greatly to the book. Nevertheless, I recommend the 
book highly to the mathematics community (and the physics community as well!) as a significant 
essay on the nature of human knowledge in two areas which have a profound effect on the 
civilization we live in. 


MISCELLANEA 


107. Anyone who cannot cope with mathematics is not fully human. 


—Robert A. Heinlein, Time Enough for Love, 
Berkley Medallion Edition, p. 247. 
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LETTERS TO THE EDITOR 


Material for this department should be sent to Professor P. R. Halmos, Department of Mathematics, Indiana 
University, Bloomington, IN 47405. 


Editor: 


The “Atom Bomb” game (MISCELLANEA, November 1982, 687) was misquoted. In the 
correct version (which has no solution), each player picks a number in [0, 1] and the one with the 
greater number kills the other, or in a tie both players survive—unless the sum is greater than or 
equal to 1, in which case the referee kills both. N. J. Fine and I heard this game at the Operations 
Evaluation Group (U. S. Navy) shortly after joining it in late 1945; it is possible that Erdés or 
Kaplansky heard it from one of us. 

Here is a more realistic model I use in lectures, especially as a closing line. Since it deals with 
positive integers rather than the unit interval, general audiences are more comfortable (?) with it. 
Each player picks a positive integer and the player with the greater one kills the other, or in a tie 
both players survive—all this subject to the following condition: there is a number N, whose value 
is known to the referee but is not known to either of the players; if the sum is greater than N, then 
the referee kills them both. 


Leonard Gillman 
Department of Mathematics 
University of Texas at Austin 
Austin, TX 78712 


Editor: 


Does anybody know of any studies which show that a person with a degree in mathematics 
education makes a better high school teacher than a person with a straight degree in mathematics 
with no courses in education? I would also like to pose the same question with regard to 
chemistry, physics, biology, English, etc. 

Since most people with a degree in mathematics education have fewer hours of mathematics 
than a person with a straight mathematics degree, it seems that the person with a straight 
mathematics degree would make a better high school teacher on the average, and particularly so 
from Algebra II onward. It also seems that the substitution of mathematics courses taught in the 
mathematics department in place of education and mathematics education courses taught elsewhere 
would yield a better high school teacher on the average. 

Does anybody have any answers, ideas or comments? 

Charles T. Scarborough 
Department of Mathematics 
and Statistics 

Mississippi State University 
Mississippi State, MS 39762 
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Browse through this list and see if your library is missing some of these important books 


published by the Association. 


APPLICATIONS OF UNDERGRADUATE 
MATHEMATICS IN ENGINEERING — written and 
edited by Ben Noble. A collection of articles prepared 
by engineers for the Committee on the Undergraduate 
Program in Mathematics. 364 pp. Hardbound. 

List: $19.00. MAA Member: $13.00. 


ANNOTATED BIBLIOGRAPHY OF EXPOSITORY 
WRITING IN THE MATHEMATICAL SCIENCES, 
prepared by M. P Gaffney and L. A. Steen. An invaluable 
reference source of expository articles in mathematics. 
282 pp. Paperbound. 

List: $12.00. MAA Member: $8.00. 


AN ANNOTATED BIBLIOGRAPHY OF FILMS AND 
VIDEOTAPES FOR COLLEGE MATHEMATICS, by 
David I. Schneider. An up-to-date listing of films and 
videotapes available for classroom use. 107 pp. 
Paperbound. 

List: $9.00. MAA Members: $6.00. 


A BASIC LIBRARY LIST FOR TWO-YEAR COLLEGES, 
prepared by the Committee on Basic Library Lists. A 
recommended library nucleus for two-year colleges. 
66. pp. Paperbound. 

List: $8.00. MAA Member: $6.00. 


A BASIC LIBRARY LIST FOR FOUR-YEAR 
COLLEGES, prepared by CUPM. Presents listings of 
books and journals that should be in every college 
library. 106 pp. Paperbound. 

List: $9.00. MAA Member: $6.50 


THE CHAUVENET PAPERS. A Collection of Prize 
Winning Expository Papers in Mathematics, edited by 
James C. Abbott. Two-volumes of the collected prize 
winning Chauvenet Papers. Vol. 1— 312 pp. 
Hardbound. Vol. 2— 282 pp. Hardbound. 

List: $21.00 each. MAA Member $16.00 each. 

Two volume sets 

List: $36.00. MAA Member: $27.00. 


CRITICAL VARIABLES IN MATHEMATICS 
EDUCATION: Findings from a Survey of the 
Empirical Literature, by E.G. Begle. A joint publication 
of the MAA and the National Council of Teachers of 
Mathematics. 

List: $8.00. MAA Member: $6.40. 


A COMPENDIUM OF CUPM RECOMMENDATIONS. 
Volumes I and II. A collection of the major 
recommendations of the Committee on the 
Undergraduate Program in Mathematics. Two volumes. 
756 pp. Hardbound. 

List: $16.50. MAA Member: $12.00. 


THE WILLIAM LOWELL, PUTNAM MATHEMATICAL 
COMPETITION: PROBLEMS AND SOLUTIONS — 
1938-1964. Compiled the R. E. Greenwood, A. M. 
Gleason, and L. M. Kelly. Contains problems and 
solutions to the first 25 Putnam Exams. 652 pp. 
Hardbound. 

List: $35.00. MAA Member: $26.00. 


THE MATHEMATICAL ASSOCIATION OF AMERICA: 
Its First Fifty Years. An historical perspective of the 
Association. 170 pp. Hardbound. 

List: $10.00. MAA Member: $5.00. 


FIFTY YEAR INDEX OF THE MATHEMATICS 
MAGAZINE, edited by Lynn A. Steen, and J. Arthur 
Seebach. Cumulative index of volumes 1-50. 163 pp. 
paperbound. 

List: $10.00. MAA Member: $6.50. 


INDEX OF THE AMERICAN MATHEMATICAL 
MONTHLY. Contains the tables of contents for each 
issue of volumes 1-80 as well as subject and author 
indices. 269 pp. Hardbound. 

List: $19.00. MAA Member: $13.00. 


PRIME-80. Proceedings of a Conference on 
Prospects in Mathematics Education in the 1980's. 
Included are the background of the conference and the 
recommendations that resulted. 

84 pp. Paperbound. $3.50. 


PROFESSIONAL OPPORTUNITIES IN THE 
MATHEMATICAL SCIENCES, Eleventh Edition. 1983. 
Designed for the student interested in a career in 
mathematics. 35 pp. Paperbound. $1.50 each. 95¢ for 
orders of five or more. 


RECOMMENDATIONS ON A GENERAL 
MATHEMATICAL SCIENCES PROGRAM. Prepared by 
the Committee on the Undergraduate Program in 
Mathematics. (CUPM) 102 pp. Paperbound. $3.50 each 


Order From: 


THE MATHEMATICAL ASSOCIATION 
OF AMERICA 

1529 Eighteenth Street, N.W. 
Washington, D.C. 20036 
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The Fourth Dimension [ 
and Non-Euclidean Ln 


Geometry in Modern Art H aN 


— Th 
Linda Dalrymple Henderson L HN 


During the first three decades of this century, the concept of the fourth 
dimension had a liberating effect on artists in nearly every important 
movement, from French Cubism to Surrealism. Linda Dalrymple Hender- 
son, as a prelude to her analysis of modern art, provides a history of the 
popularization of the n-dimensional and non-Euclidean geometries, 
including a discussion of the period writers who developed the philosoph- 
ical and mystical implications of higher spatial dimensions. 

Cloth, $60.00. Limited Paperback Edition, $18.50 


Princeton University 


41 William Street, Princeton, NJ 08540 


EVERY 
a) STUDENT 
BELONGS 


Let the MAA Placement Test Program help you match entering students with begin- 
ning mathematics courses according to training and ability, rather than trans- 
cripts and credentials. PTP tests are constructed by panels representing a broad 
spectrum of institutions and are carefully pretested. Information about pretesting 
scores and placement experience of a variety of participating institutions is pub- 
lished periodically in the PTP Newsletter. 


Your institution can have unlimited use of annually updated MAA Placement Tests on 
e Basic Mathematical Skills e Basic Algebra e Advanced Algebra 
e Trigonometry/Elementary Functions @ Calculus Readiness 
and also a subscription to the PTP Newsletter for one modest annual subscription 
fee. 


For information write to: 
The Mathematical Association of America 
Department PTP 
1529 Eighteenth Street, N. W. 
Washington, D.C. 20036 
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The Contest Problem 
Book IV, 


Annual High School Mathematics 
Examinations 1973-1982, 


New Mathematical Library #29 


Edited by Anneli Lax. 
Compiled and with solutions by Ralph A. Artino, 
Anthony M. Gaglione, and Niel Shell. 


198 pp. Paperbound. 
List: $10.00 MAA Member: $8.00 


The American High School Mathematics Examination, given each 
year to over 400,000 high school students, helps to identify young 
people with an interest in and a talent for mathematics problem solv- 
ing. This volume contains the Examinations for 1973-1982, and is a 
continuation of Contest Problem Book I, II, and III, published as vol- 
umes 5, 17, and 25 of the New Mathematical Library. The three 
earlier books contain the first twenty-five annual examinations along 
with solutions. 

The problems are designed so that they can be solved with a 
knowledge of precalculus mathematics, with emphasis on intermedi- 
ate algebra and plane geometry. The book includes subject classifica- 
tions that indicate which questions relate to a particular topic in 
mathematics. 

The solutions have been carefully constructed, but students are 
encouraged to attempt their own solutions before looking at the ones 
offered. 

If you teach or counsel high school students, or know of a young 
person interested in mathematics problem solving, take this TICAL 
Opportunity to order Contest Problem Book IV now. B 4 


ORDER FROM: The Mathematical Association of America 
1529 Eighteenth Street, N.W. 
Washington, D.C. 20036 


‘Oxford: 


The Nature of 
Mathematical Knowledge 
PHILIP KITCHER, University of Vermont 


What is mathematics about? How does math- 
ematical knowledge grow? This highly origi- 
nal book offers new answers to these ques- 
tions. Most mathematicians and philosophers 
believe that mathematics is a collection of 
truths about some abstract realm, truths that 
are proved once and for all. Mathematics 1s 
taken to be different from the natural 
sciences, independent of empirical evidence 
or the work of previous generations. Philip 
Kitcher, author of Abusing Science: The Case 
Against Creationism, argues against this 
‘mathematical apriorism.” He offers an alter- 
native approach, linking mathematics to natu- 
ral science and portraying mathematics as a 
body of knowledge that evolves through its 
history. 1983 273pp. $25.00 
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Now in Paperback: Mathematical 


FHI AMERICAN EDITION 
With & NEW PREFACE BY MORRIS KLINE 


Snapshots 


Third American Edition, Revised and Enlarged 
H. STEINHAUS, witha new 
preface by MORRIS KLINE 
For over 30 years, novices of all 
ages have turned to this book for an 
introduction to mathematical concepts. 
Using striking photographs and dia- 
grams, Steinhaus ingeniously explains 
mathematical phenomena, beginning 
with simple puzzles and games and 
moving to more advanced problems. 
“A book to stretch the imagination 
without unduly straining the mind. 
Steinhaus’s book affords an amazing 
display of the richness, the variety and 
especially the interrelatedness of math- 
ematical thought.””—Scientific American 


1983 320 pp., photographs and line drawings $7.95 


At your bookstore or send your check to 


Box 900 


OXFORD UNIVERSITY PRESS 


200 Madison Avenue New York, New York 10016 
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from the MAA Studies 
in Mathematics... 


Studies in 
Computer Science 


Edited by Seymour Pollack 
Volume 22, MAA Studies in Mathematics 
408 pp. Cloth. List: $29.00 MAA Member: $22.00 


Studies in Computer Science continues the tradition of excellence of 
the respected series MAA Studies in Mathematics. Written by com- 
puter scientists for mathematicians, the book presents a readable and 
balanced discussion of the role of mathematics in computer science, 
and the contributions of computing to mathematics. 

Opening with an historical overview of the development of com- 
puter science, the narrative continues with three articles on the 
nature of computer programs and the programming process, a dis- 
cussion of computational complexity, and a concluding group of 
articles on the impact of computer science on artificial intelligence, 
numerical analysis, statistics, and simulation. 

In the introduction the editor expresses his hope that the book 
will provide ‘‘an interesting look at an explosive field in the process of 
becoming.’’ What an understatement! The process is well advanced; 
the impact of computer science on mathematics and its teaching is 
already immense. And this book is not just interesting, it is exciting 
and important as background reading for anyone interested in the 
interactions between mathematics and computer science. Teachers 
will find that it enriches their teaching and students will find it 
rewarding supplementary reading. Order your copy today! 


ORDER FROM: The Mathematical Association of America 
1529 Eighteenth Street, N.W. 
Washington, D.C. 20036 
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ITERATIVE PROCESSES IN ELEMENTARY GEOMETRY 


GENG-ZHE CHANG* anpD PHILIP J. DAVIS 
Division of Applied Mathematics, Brown University, Providence, Rhode Island 02912 


1. Introduction. Certain constructions of elementary geometry, when iterated, give nse to 
figures which are often both of great visual appeal and whose mathematical analysis has attractive 
and difficult features. An example of such a figure is the “nested polygon.” A planar polygon is 
given. Each side is divided into a fixed ratio and the points of division become the vertices of a 
second polygon. This construction is then iterated (Fig. 1). 

An analysis of this problem and further references may be found in Berlekamp, Gilbert and 
Sinden [2], Davis [4]. An early solution of the problem presented by I. J. Schoenberg [6] pointed 
out the intimate relation to finite Fourier analysis, while Davis [4] placed the problem within the 
context of circulant matrices. 

In this paper we take up three iterative constructions of a different sort, and present a 
mathematical analysis of them. The constructions undoubtedly suggest generalizations and we 
have found numerous difficulties in the way of complete solutions. 


Fic. 1 


2. Projected Simplexes. Let P be an arbitrary point lying in the triangle 7,. Determine second, 
third, ... triangles T,, 7;,... as in Fig. 2. We wish to discuss the convergence properties of the 
figure. (Davis [4], page 3.) We present the solution for the n-simplex. Let 75, 7;,..., 7,, be the 
n + | vertices of an n-simplex T: (= T). A point P inside the simplex can be expressed by 


(2.1) P = oI + v,T, +ree + r,, 0, 
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<p 
ws 
LG 


where 0 <A; < landAg+A,+ °°: +A, = 1. (Ao, A),...,A,,) are called the barycentric coordi- 
nates of P with respect to the simplex. 


Joining 7, and P by a line segment, the intersection of this segment and the face opposite to 7 
is denoted by 7{ (Fig. 3). Since 7) = (1,0,0,...,0), a typical point on the straight line 
determined by 7) and P is 

(1—1t)T) + tP =[(1 —t)1 4+ trg,...,].- 


Making the first barycentric coordinate zero, we see that t must be 


Fic. 3 


Thus we get 
—y, 7° + Ta, 


TO = 1 P =[0,A,,A95---5A,]//(1 — Ag). 


TO, TS,..., T are similarly determined, and we have 


T{? — [Ao,0,AQ,...5 An1/(U — Ai), 


7) — [Ags Arse, A,-1,9]/(1 a A): 
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The n + 1 equalities above can be rewritten in matrix notation 


(2.2) 
T{ 
T{” 
TO = 
7) 
Orie 1 fta L 
l _ No 0) 0) 
] 
— 101 l di T, 
— 1 
111 --- 10 dA T 
l _ r, n n 


The 7” are the (n + 1) vertices of a simplex T. 
Write J = the (n + 1) X (n + 1) matrix, all of whose entries are 1, D = diag(Ao, A,,..., A,,)- 


Then J — D = diag(l —A,,1 —A,,...,1—A,) and (I— D)7! = diag((1 — Ay)7',..., 
(1 — A,)~'). Thus (2.2) can be written in the form 

(2.3) T) =(I-—D)7'(J —1)DT. 

Since J? = (n + 1), it is easily verified that 

(2.4) (J—1)7' =-s- I. 

Hence (2.3) may be inverted to yield 

(2.5) r=D-"(7s-r\(r- DT. 

Now 


P = (Ag, Niseees A,,)T 


~ (op Arse ADU — 1\(1- D)T 


l 
= 7 (1 —Xo,1—-A,,..., 1-A,)T™ 


It follows that the barycentric coordinates of P with respect to T’ are 4(1 — A ,1 — Aj... 
1 —A,,). Call them A}?,..., AD (and those of T= T®, ) so that 


l 
2.6 Q) = —(] — ,© 
( ) AS n @ ry ). 
For the general iterative step, we have 
1) — 
(2.7) T+) = MT), 
where 


(2.8) M, =(I-D,) (J -1)D, 
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with 
(2.9) D, = diag(A‘p?,..., ASP). 
Also, similarly to (2.6) we have A‘?*) = “(1 — \?)) which can be written in matrix form as 
1 
(2.10) Dy +1 =~ (I- D,). 


l 
From (2.10), (I — D,)~' = — Dr's so that 


(2.11) M, = ~Dysi(F — I)D,. 
From (2.7) it follows that 
(2.12) T’?) = M,_,M,_,°:* M\MT. 

Define C = J — I. Then from (2.12) and (2.11) we obtain 
(2.13) TP =< D, CP DyT. 
By induction we obtain from (2.10) 
(2.14) ,=a,0+ B,D, p=9,1,..., 
where 
(2.15) Byer =—(1- ay), = 0, Byer = -— By» Bo= I. 
The solutions of the difference equations (2.15) are 

P P 

a9 gett (-g yt. me(-HY" 
and 
(2.16’) (n+ l)a, + B, = 1. 

Since the eigenvalues of J are n + 1,0,0,..., 0, those of C=J— TJ aren, —1,-—1,...,— 1. 
Thus, we have the diagonalization (see Davis [4], p. 72) 
(2.17) C=J-—I1= F*diag(n, —1,—-1,...,-— 1)F, 


where F is the Fourier matrix of order n + 1 and * designates the conjugate transpose. (The /, k 
element of F* is (n + 1)7?wY 7 D4A~D, w = exp(2ai/(n + 1)).) From (2.17) and (2.13), we have 


(2.18) T = D, fF diag 1,(-=] geeey (-= | FDoT. 
We may now allow p — oo in (2.18). From (2.16), for n > 2, 
] 
li = , li = 0, 
Jim a, = 57> lim &, 


so that from (2.14), 


. l . | 
jim D, = a pZand lim D, =(n+ 1)/. 


Thus, from (2.18), 
(2.19) T‘©) = (n+ 1) F*diag(1,0,0,...,0) FDp. 
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Since F*diag(1,0,..., 0) F = 7 + 17 
To Ago P 
TO=JD)| > | =J) : =/:|/=P 
T, rT} \P 


and this verifies what is visually obvious, namely, the convergence of the vertices of the simplex to 
the point P. 

It is of interest to discuss the volumes of the simplexes. Let A‘”) and A designate the volumes of 
the simplexes 7” and T respectively. From (2.18) it follows that 


AY) poe ae 1\? 1\? 
—_ det{ D, 'F*diag( 1, (——-} (= 5) | FD, 


A 
] \?" 
= (- AoA, vee A,,/det D,. 


(See, e.g., Benson [1], p. 188.) 
Allowing p — oo, we obtain 


: n pA?) n+1 
(2.20) im (1 ) Zz = (n + 1) AoA, se rn 


To discuss the rate of convergence, we may proceed as follows: 
diag(1, ,,..., B,) = diag(B,,..., B,) + diag(1 — B,,0,0,..., 0). 


Hence, 
_ re 1— 8B, 
iag(1, B,,..., B,) ~ B,f + n+] J. 
Thus, from (2.18), 
(2.21 r= p-'(pr+——s\ por 
21) = Dy '\ Bp n+ 1 ° 
_ ne __ Pp 
= D, [A DoT + are) P| 
In view of (2.14), (2.16’), 
( ~~ B, ) 
(2.22) D, +] T+ B,Do 


Combining this with (2.21) yields 


a (7) —P) = Dy 'Do(T— P); 
P 


hence 
THEOREM 2.1: 
(2.23) lim 3 (7 —P) = (n+1)D,(T—P). 
po Pp 


Thus, the vertices of the simplex T‘”) converge to P with geometric rapidity (—1/n), and the 
right-hand side of (2.23) identifies the leading coefficient in the asymptotic expansion. 


3. Another Generalization of the Plane Case. In this section we consider a certain iterative 
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process which starts from a given triangle and which generalizes the plane case in a different way. 
It is particularly convenient in this case to place the triangle in the complex plane. We denote a 
point in the plane and its complex coordinates both by the same letter. For example, the triangle 
abc represents a triangle with the vertices whose complex coordinates are a, b and c. 


Cc 


Fic. 4 


Assume that a triangle abc is arbitrarily given and that a,,b,,c, are points on its sides 
bc, ca, ab respectively (Fig. 4). The intersection of the line segments aa, and b,c, is denoted by a,. 
Symmetrically we shall use b,, c, for the other two intersections. Do the same thing to the triangle 
a,b,c, and three points a,, b,, c, on each side, a triangle a,b,c, will be obtained. Generally, 
starting from the triangle a,_,b,_\c,—, and three points a,,b,,c, on its sides, a triangle 
Qn+10n+1Cn+ 1 Will be constructed by the same method. The following questions arise: 

(1) To what points will a,, b,, c, converge as n — 00? 


no “no 


(2) How fast is the convergence? 


Assume that 


ay 0 l-a a a 
(3.1) b,| = B 0 1— Bi |b 
Cc l-y Y 0 C 


where 0 < a, B, y < 1. The numbers a, B, y are completely determined once a,, b,, c; are given. 
To solve these two problems, we must express a,,, b,, c, in terms of a, b, c and a, B, y. We begin 
with the case n = 1. Since a, is on both line segments b,c, and aa,, we can write a, as 


(3.2) , (1 — a) )b) + acy = (1 — p)a t+ pa, 


where 0 < a,» < 1. a, can be determined as follows. Inserting (3.1) into both sides of (3.2) and 
equating the coefficients of b, c, respectively, we obtain 


(3.3) ay =n(1— a), 
(1 — @,)(1 — B) = ap. 
Solving a, from (3.3), we get 
Ci — a)( — B) 
(l1—a)(1- 8) + ay 


After similar manipulations have been applied to b, and c,, we get 


a, = 
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a, 0 l—a, a; a, 
(3.4) b,| = B, 0 1— Bi} } 4}, 
2 l-y v1 0 1 
where 
a, - =U = Bb) 
© (l-a)(1— B) + ay’ 
__CU-£)U—y) 
es) BO By — 9) + Ba’ 
(1 — y)G - 2) 


Incidentally, in the above we have employed a standard method for finding the intersection of 
two line segments whose end-points are known in complex coordinates. 

When we proceed from the triangle a,b,c, to the triangle a,,, ,b,41C¢,41, no additional effort 
need be made except to raise each subscript by n — | in (3.4) and (3.5): 


Qn+] 0 | - a, a, a, 
(3.6) By + = B,, 0 ] ~ B,, b,, 9 
Cn+1 ] — Yn Yn 0 Ch 
where 
a = (1 —a,_,)C1 — B,-1) 
" (1 — a, — Bye) + On 1%n 1 
] — ] — 
(3.7) B, ( B, —1)( Yn —1) 


7 (1 7 B, 1) 7 Yn —1) + By 1% —1 


y, = (1 — y,-1)(1 — @,—1) 
"(= Yn = On) + Yn Br’ 


n= 1,2,3,..., with ao = a, Bo = B, Yo — Y- 
Let A, B,C be the intersections two by two cyclically of the segments aa,, bb,, cc,. By the 
general method shown above we have 


4) lim 
(3.8) B| = imate 
« TB rap 
B(1 — y) By (1-B)UA-y)}] a 
x} (i-y)U-«@) yU-a) ya b 


oo (l-a)1—-B) = a(l - 8B) c 
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Geometric constructions suggest to us intuitively that the sequence of triangles a,b,c, will 
converge to the triangle 4 BC in a certain manner. 

Denote the areas of the triangle ABC and the triangle abc by A( ABC) and A(abc) respectively. 
From (3.8) it follows that 


_ A (abc) 
AARC) = (l-y+By)(—a+ ya)(1 — B+ af) 
By) By (1-6) - y) 
x|{(1 — vy) - @) y(1 — @) ya ; 
ap (1 —«)(1 — B) a(1 — B) 
(3.9) A(ABC) _ —_[aby= (== B)I=yT ac apey, 


(l-—y+ By)(l—a+ ya)(1 — B+ aB) 


This formula was discovered by E. J. Routh in 1896 (see Introduction to Geometry by Coxeter, p. 
211). Since A = B = C iff A(ABC) = 0, hence from (3.9), A = B = Cif and only if 


(3.10) aBy =(1—a)(1— B)(1— y). 

This is the famous theorem of Ceva and the lines aa,, bb,, cc, are often called Cevians of the 
triangle abc (see Introduction to Geometry, p. 220). 

If (3.10) holds, then aa,, bb,, cc; meet at the same point and the problems are reduced to the 
planar case discussed carefully in the previous section. Hence we assume aBy # (1 — a)(1 — B) 
(1 — y). In this case we can invert (3.8) to get 


a | 0 — (1-8) Y 
(3.11) |b] = a 2 Ww a 0 -(1-y) 
{| aby 0-a0-AI-Y] gg ; 
l-y+ By A 
x l—a+ya BY. 


1—Bt+aBliiCc 


It seems to us very complicated to deal with the general case, so we assume that a = B = y # 5. 
In this case we have a, = B, = y, (n = 0,1,2,--- ), and (3.6), (3.7), (3.11) are reduced respec- 
tively to the following simpler forms 


Gn+1 a, 
(3.12) b.1|=([(1—a,)l + a,0?]] 4, 
Cn+1 Ch 


where IT is the simple circulant matrix: 


0 1 0 
I=;0 0 14; 
1 0 O 
i- 2 
(3.13) a, = Ua ana) 


for n = 1,2,3,--- and 


(3.14) q =; —- [(1 — a) II — all? | 
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We need the following lemmas: 


LEMMA 1. We have 


ry a a ; n even, 
(3.15) a, = on 
aa) dd 
one n odd. 
a’ + (1- a) 


The proof is omitted. It is a straightforward mathematical induction. 
It will be more convenient to express a,, b,, c, in terms of A, B, C. To make the expressions 


neater, we introduce the following 3 < 3 matrix 
1 0 0 
Q=/0 0 1}. 
0 1 O 
Note that 0” = J, the 3 x 3 identity matrix, when n is even and Q” = Q when n is odd, and that 
(3.16) OllQ =II*, QII’Q =I. 
LEMMA 2. Let 
a \? 
(3.17) d= (> ==} 
forn = 0,1,2,...;then we have 
a, | A 
(3.18) b, | = Qo”"(Tl — A, T7)Q"| BI. 
c 1—-A, C 


Proof. By induction. 
For n = 0, (3.18) reduces to (3.14). 
Assume that (3.18) is true for even integers n. Then by (3.12) and (3.18) we have 


ant 1 A 
bi | = [(1—a,) I + a, 1? ]—- (i - A, 11?) B 
Cn+1 ” C 


A 
l . 
n Cc 


Since for even n from (3.17) and (3.15) we see that 


a 
or 
thus 
1 o =-—H ad = 
7  1+A,’ mor 1+A,”’ 
hence 


Qn+1 


h ee 
mT (1=A,)0 +A,) 


Cn+1 


A 
(—X, 11 + II?)| B 
Cc 
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A 
l 
— Tae OUT MTT) O 5 
A 
== — 0"""(11- A, TP) 9"! Bl. 
n+ | C 


Similarly, if we assume that (3.18) is true for odd 1, we can also prove (3.18) is true for n + 1. 
This completes the induction. 
From (3.18) we have 


(3.19) (ie —A,C)/(1-A,),  forneven 
; a.= 


"“\(C-A,B)/(l—-A,), for n odd. 


Without loss of generality we assume that 0 < a < 4; in this case 0 < ] c —_ 1. Hence from 
(3.19) it follows that 
B+i,(B-C)+0(%), n even 


3.20 a, = 
(3.20) erwin ay sot) n odd. 


Since A, — 0 as n > oo, then by (3.20) we have the convergence of the sequences of the 
vertices, as shown in the following theorem. 


THEOREM 3.1. For 0 < a < 4, we have 


lim a -{% even n > © 
" C, oddn— oo 


_/{C, evenn— o 
lim 6, on odd n — oo 


li -{* even 7 — 00 
mc, = 
, oddn—- oo. 


From (3.18) we have 

1-% 1+A,+% 
———*, A(ABC) = ——_—* A(ABC) 
(1—A,) (1—A,) 


A(ABC) + 3A, A(ABC) + smaller terms. 


A(a,bcn) = 


Hence problem (2) is answered by 


THEOREM 3.2. Assume that 0 < a < 4, then 


l—-—a 


(3.21) | 0 <A(ajbyéq) ~ A(ABC) = O| ( a y" 


as n— oO. 


The planar case of Section 2 and Section 3 differ in that in Section 2 the cevians of the starting 
triangle are assumed to be concurrent while in Section 3 they are not. The interesting and 
surprising thing about the asymptotics of convergence is that in the first case, the convergence is 
geometric, while in the second, it is supergeometric. (Numerical analysts would say that the 
convergence is linear in the first and quadratic in the second cases.) 

This difference is reflected in the fact that if one draws the figures with a ruler and pencil, in 
the first case many iterations may be drawn, but in the second, only a few. One might say 
therefore that the difference in the rates of convergence can be intuited visually. 
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A STATISTICAL ANALYSIS OF CASINO BLACKJACK 


MARTIN H. MILLMAN 
Department of Mathematics, William Paterson College, Wayne, NJ 07470 


Introduction. A recent (3/82) ruling of the New Jersey Supreme Court, preventing legal 
gambling casinos from expelling “skilled” blackjack players, prompted the initial interest for 
undertaking this study. With a view toward determining what advantage a player (or group of 
players) using correct strategy might obtain over the “house” in casino blackjack, a Vax 11/780 
computer was programmed to simulate the game. The model for this simulation was one “player” 
versus the “dealer,” with the computer executing both roles. The program shuffled and dealt the 
cards, placed the “player’s” bets, played the “player’s” and “dealer’s” hands, settled the bets, kept 
track of the “player’s” capital and, in general, followed actual casino rules* and procedures in 
every essential detail. Separate experiments were performed for the Las Vegas and the Atlantic 
City versions of the game. In Las Vegas blackjack, four standard decks of cards are employed, 
three of which are dealt out before reshuffling. In Atlantic City six decks are employed, four of 
which are dealt out before reshuffling**. In both cases, the rules governing play of the hands are 
identical. 

In order to investigate the expected win for a player using a correct strategy, a “card-counting”’ 
strategy, developed by Baldwin, et al. [1], Thorp [2], [3], Braun [4] and others, was incorporated in 
the program. The player’s bets and playing decisions were then made by the computer in 
accordance with this strategy. The strategy, as will be seen presently, yields modest wins for the 
player under both versions of the rules. 

The dealer is allowed no playing decisions in casino blackjack. The dealer’s hands were 
executed according to the fixed rule: “hit” totals of 16 or less and “stand” on totals of 17 or more. 


The Player Strategy. The program provides the “player” with a winning strategy that consists 


Martin H. Millman received his B.S. at City College of New York (1962) and his Ph.D. at New York University 
(1968) under J. B. Keller. He has been a postdoctoral fellow at the Courant Institute of Mathematical Sciences, has 
taught at New York University, New Jersey Institute of Technology, and Fairleigh Dickinson University and is 
currently a member of the Mathematics Department at William Paterson College, Wayne, New Jersey. As a result of 
the research reported here, his interest in blackjack has been limited to an occasional game with his little daughter, 
Tasha. Other interests include paddleball, chinese checkers, and computer graphics. 


*Complete rules of casino blackjack may be found in [2]. Our simulation allowed the player to double down on 
split pairs, but did not allow resplitting of split pairs. This is standard practice in Atlantic City casinos and common 
in Las Vegas. 

**A crucial aspect of any simulation of this kind is the ability of the random number generator to supply 
“pseudorandom” sequences of integers. In this experiment such sequences were used to produce “shuffled” decks of 
cards. The generator employed was of the linear congruential type. It was x?-tested and found acceptable as a source 
of sample integers from a uniform distribution of integers in the interval [1,312]. 
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players) using correct strategy might obtain over the “house” in casino blackjack, a Vax 11/780 
computer was programmed to simulate the game. The model for this simulation was one “player” 
versus the “dealer,” with the computer executing both roles. The program shuffled and dealt the 
cards, placed the “player’s” bets, played the “player’s” and “dealer’s” hands, settled the bets, kept 
track of the “player’s” capital and, in general, followed actual casino rules* and procedures in 
every essential detail. Separate experiments were performed for the Las Vegas and the Atlantic 
City versions of the game. In Las Vegas blackjack, four standard decks of cards are employed, 
three of which are dealt out before reshuffling. In Atlantic City six decks are employed, four of 
which are dealt out before reshuffling**. In both cases, the rules governing play of the hands are 
identical. 

In order to investigate the expected win for a player using a correct strategy, a “card-counting”’ 
strategy, developed by Baldwin, et al. [1], Thorp [2], [3], Braun [4] and others, was incorporated in 
the program. The player’s bets and playing decisions were then made by the computer in 
accordance with this strategy. The strategy, as will be seen presently, yields modest wins for the 
player under both versions of the rules. 

The dealer is allowed no playing decisions in casino blackjack. The dealer’s hands were 
executed according to the fixed rule: “hit” totals of 16 or less and “stand” on totals of 17 or more. 


The Player Strategy. The program provides the “player” with a winning strategy that consists 


Martin H. Millman received his B.S. at City College of New York (1962) and his Ph.D. at New York University 
(1968) under J. B. Keller. He has been a postdoctoral fellow at the Courant Institute of Mathematical Sciences, has 
taught at New York University, New Jersey Institute of Technology, and Fairleigh Dickinson University and is 
currently a member of the Mathematics Department at William Paterson College, Wayne, New Jersey. As a result of 
the research reported here, his interest in blackjack has been limited to an occasional game with his little daughter, 
Tasha. Other interests include paddleball, chinese checkers, and computer graphics. 


*Complete rules of casino blackjack may be found in [2]. Our simulation allowed the player to double down on 
split pairs, but did not allow resplitting of split pairs. This is standard practice in Atlantic City casinos and common 
in Las Vegas. 

**A crucial aspect of any simulation of this kind is the ability of the random number generator to supply 
“pseudorandom” sequences of integers. In this experiment such sequences were used to produce “shuffled” decks of 
cards. The generator employed was of the linear congruential type. It was x?-tested and found acceptable as a source 
of sample integers from a uniform distribution of integers in the interval [1,312]. 
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of two components: the “truecount,” which determines the player’s bets from hand to hand, and 
the method for playing the hands. Briefly, the player assigns the “count-value” + 1 to each 2, 3, 4, 
5 and 6-point card, the count-value 0 to each 7, 8 and 9-point card, and the count-value —1 to 
each 10-point card and each ace. The player then keeps a running total of these values as cards 
appear during the game. Whenever shuffling occurs, this running total is reset to zero. Im- 
mediately before each betting opportunity, the player computes the “truecount” by dividing the 
momentary value of the running total by the number of thus far unused decks of cards. The 
truecount measures the “concentration” of high cards (tens and aces) in the cards remaining to be 
played before reshuffle. It is therefore proportional to the probability that the next card dealt will 
be a high one. Braun has shown that the player’s expectation on the next hand is approximately 
proportional to (truecount-1), if the hand is played correctly. The strategy exploits this fact by 
prescribing a minimum bet on the next hand when the truecount is less than +2, and a maximum 
bet when the truecount equals or exceeds +2. The minimum bet in our experiment was taken to 
be one “chip” while the maximum bet was fixed at 20 “chips.” (The monetary value of one chip is 
immaterial in our Statistical analysis.) It is of interest to note that variations of bet size are 
essential for the success of the strategy. If, for example, the same bet is made on every hand, then 
the player’s rate of profit becomes so small compared to his risk, that the game is of no interest. 
By contrast, our 1-20 bet variation gives the player a rate of profit that approaches the maximum 
rate attainable by the strategy. 

The method of playing the hand employed by our hypothetical player is summarized in matrix 
form in Table I. The row labels running down the left side of the table represent all possible 
player hands. The third group of hands, A2 through A10, represents the so-called “soft” hands. 
(Here, as elsewhere in the table, A stands for an ace.) These hands contain at least one ace and 
total eleven or less when the aces are counted as ones. They cannot bust, even if “hit” with a 
10-point card. The number of cards in a hand is immaterial. For example, A8 represents hands 
such as A-8, A-5-3, or even A-2-3-A-2. 

The second group of hands in Table I, 2-2 through A-A, represents two-card hands that are 
pairs. Finally, the first group, 5 through 21, represents hands, having the given totals, which are 
neither “soft” nor pairs. For example, 15 represents hands like 10-5, 7-6-2 or 9-5-A. 

The column labels running across the top of Table I represent all possible dealer “up 
(exposed)-cards.” The matrix entries determine the player’s action for each possible combination 
of player hand and exposed dealer card. Possible player actions are abbreviated as follows: 
H =“hit,” 5 =“stand,” D =“double,” P =“split a pair.” Thus, for example, the player will 
stand with 5-3-4 if the dealer shows a 6 and split 7-7 if the dealer shows a 2. 

In addition to the strategy already outlined, our player also made the optional “insurance” bet 
(see [2]) when the dealer’s up-card was an ace if the truecount exceeded +3. Finally, if the 
truecount at the conclusion of a hand was negative, our player “left the table” to resume playing 
with freshly-shuffled cards elsewhere in the hypothetical casino. This tactic greatly reduced the 
proportion of hands played at negative expectation and thus increased the player’s net profit 
significantly. It was simulated in the program by having the computer shuffle and begin dealing 
anew whenever the truecount at the conclusion of a hand was negative. 

More sophisticated playing strategies exist than the one employed here. The principal improve- 
ment that can be introduced is to vary the playing decisions, as well as the bet size, as the 
truecount fluctuates. Braun [4] has in fact computed the optimal playing strategy for all practical 
values of truecount by simulation methods. While such improved strategies do result in slightly 
larger profits, the great burden they place on a human player’s memory, concentration and 
accuracy probably renders them uneconomical. We have therefore utilized the simpler strategy 
discussed above. 


Data and Results. The statistical parameters of a “game” consisting of twenty-thousand 
consecutive hands of casino blackjack, in which the player employs our strategy, were investi- 
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gated. Data were collected for samples consisting of 600 trials of twenty thousand hands each for 
the Las Vegas case and 400 trials of twenty thousand hands each for the Atlantic City case. In 
total, a random sample of twenty million hands was executed during nearly 21 (no pun) hours of 
computing time. 

The probability that the player’s final capital exceeds his initial capital after a game consisting 
of N hands is an increasing function of N. Hereafter we refer to this probability simply as the 
“win probability,’ denoted by P,. The rather large value N = 20,000 was selected for this 
experiment in order to insure interestingly large values of P,, under both versions of the rules. 
Practically speaking, the actual playing of twenty thousand hands in a casino would consume 
some 300 to 400 human hours. 

Table II gives a summary of the sample data. The first column reports the proportion of trials 
in the sample that resulted in a net win (of any amount) for the player. The second column reports 
the mean number of chips won, averaged over all the trials in the sample and rounded to the 
nearest whole chip. The third column then gives the sample standard deviation of these wins, 
again rounded to an integer. 

We define the player’s “edge” in a single twenty-thousand-hand trial to be the net number of 
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chips won divided by the total number of chips bet. The “edge” is therefore the player’s rate of 
profit on his investment. The fourth column of the table reports the mean edge, averaged over all 
the trials in the sample. 

Finally, the player was allowed unlimited credit in each independent trial and the worst state of 
his funds during each trial was tracked by the computer. At the conclusion of each trial the 
computer updated the largest single-trial deficit yet encountered. The largest single-trial deficit in 
the entire sample is reported in the last column. Thus, for example, our Las Vegas player had 
accumulated a net loss of 4961 chips at some point during his most distressing trial. 


TABLE II. Sample data. 


Proportion Standard Mean Largest 
of Wins Deviation Player Edge Player 
(chips) (%) Deficit 

(chips) 


The quantities in each column of Table II serve as estimators for the quantities in the 
corresponding columns of Table III. The first three columns of Table III give 95% confidence 
intervals for the win probability P,,, the expected win p, and the standard deviation o respectively 
for our twenty-thousand-hand game. All chip-values are again rounded to the nearest integer. 

The fourth column of Table III gives a very close estimate of the player’s expected edge. The 
figures in the last column of the table are to be interpreted as rough statistical solutions of one 
aspect of the classical “gambler’s ruin” problem. This problem recognizes the fact that although 
the player may be capable of winning in the long run, he may nevertheless be ruined (lose all his 
capital) in one of the large negative fluctuations that inevitably occur. In order to avoid this 
unhappy possibility, the player needs to know how much initial capital he requires to survive till 
the end of the game. Thus, for example, 4962 chips is the “required initial capital’ for the Las 
Vegas game in the sense that a player possessing such capital (and employing our strategy) has 
zero experimental probability of going broke*. 


Table III. Parameters for the twenty-thousand hand game. 


Win Expected Win Standard Expected Required 
Probability deviation Player Initial 
(chips) Edge (%) Capital 
(chips) 


Fig. 1 is a histogram representation of the 600-trial Las Vegas experiment. It shows the number 
of trials that fell within each profit interval of width 200, from — 5000 to +8000 chips. For 


*These “required capital” figures are merely rough estimates of the initial capital required to participate in the 
game with zero theoretical probability of going broke. There is over a 98% chance, though, that each exceeds the top 
1% percentile of the underlying theoretical distribution of maximum deficit per trial. Of more practical value would 
have been the experimental distribution of maximum deficit. The top 5% percentile of this distribution, for example, 
would have been a rather accurate estimate of the capital necessary to play with a 5% probability of ruin. 
Regrettably, these more accurate percentile figures could not be obtained due to severe limitations on the amount of 
computer memory (“disk quota’) that was available to the author. Thus, only the inaccurate 0% percentile figures 
quoted in Table III could be printed out. This omission might be remedied in a future rerun of the experiment. 
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example, the histogram shows that 40 of the 600 trials resulted in wins of between 1200 and 1400 
chips, while only 5 trials resulted in losses of between 1200 and 1400 chips. 

The total shaded area is equal to 120,000 chip-occurrences and 87% of this area lies to the right 
of the origin. A normal curve, normalized to area 120,000, having the same mean and standard 
deviation as the histogram (u = 2084, o = 1786), is also shown in Fig. 1. The resemblance 
between histogram and curve supports the hypothesis that the theoretical distribution of player 
profits is approximately normal. That is, a x* goodness-of-fit test accepts our sample as a random 
one from the normal distribution shown. Approximate normality of the theoretical distribution is 
at first somewhat surprising in view of the player’s varying bet size and the fact that consecutive 
hands of blackjack are not strictly independent events. However, a more detailed analysis does 
lead one to expect approximate normality. 


Conclusions. Table III shows that when the same strategy is employed by the player in both 
the 4-deck and 6-deck games, the expected win is about twice as great in the 4-declt case, while the 
initial capital required is, roughly, only about one and a half times as great. Furthermore, the 
relative fluctuation of the player’s funds is less in the 4-deck game and the win probability and 
player edge are significantly greater. Thus the Las Vegas game is more desirable from the player’s 
point of view. We may restate the difference between the two games in more practical terms by 
assuming the monetary value of a chip to be five dollars (the minimum bet allowed in most 
casinos). Then the player’s bets would range from $5 to $100 per hand. Table III shows that the 
Atlantic City player would require initial capital of roughly $18,110 to participate in the game and 
that his efforts would be rewarded at an average rate of about 28 cents per hand. By contrast, the 
Las Vegas player would require roughly $24,810 initial capital and would expect to win about 52 
cents per hand. 

Since a human player employing our strategy may expect to receive only about 60 hands per 
hour in an actual casino, the expected profit in the more favorable Las Vegas game is a modest 
$32 per hour on an initial investment of roughly $24,810. Taking into account the (approximately) 
13% chance of actually suffering a loss after twenty-thousand hands of tedious labor, casino 
blackjack would appear to be a rather unattractive business venture. The case against card-count- 
ing is further strengthened by the fact that imperfect shuffling and the presence of other players at 
the table would result in lower profits than those stated in Table III. In addition, the 1-20 betting 
ratio employed in the simulation is probably too extreme to be implemented without detection in 
a casino. The result of detection would likely be immediate shuffle whenever a high bet is made 
(Atlantic City) or outright expulsion from the game (Las Vegas). One therefore wonders why 
casinos continually complain about being fleeced by “professional” players. 
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CONJECTURE 2. 
n 
log n 


f(n)< forn + 144. 


Both these conjectures have been verified by computer for n < 10,000. 
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No, they are partial. They are two of the most famous partial differential equators in the world. 
Top: Lars Hérmander of Lund; bottom: Olga Ladyzenskaja of Leningrad. 


DETERMINING A FAIR BORDER 


THEODORE P. HILL 
Department of Mathematics, University of Leiden, Wassenaarseweg 80, Postbus 9512, 2300 RA Leiden, 
The Netherlands 


Introduction. Suppose n countries border on a region the ownership of which is in dispute (Fig. 
1). Is there a way of partitioning the disputed territory so each country receives a single piece 
adjacent to itself which it considers at least 1/n the total value of the territory, even though 
different parts of the territory may be valued differently by individual countries? The main 
purpose of this paper is to show that such fair borders always exist, under the quite natural 
assumption that each country’s value of the territory is nonatomic (i.e., single points have value 
ZeTO). 


FIG. 1. 


This border problem is in some respects closely related to several of the classical fair-division 
problems such as the “Problem of Cutting a Cake Fairly” of Steinhaus [2], [9], the “Problem of 
the Nile” described by Fisher [4], [5], the “Problem of Similar Regions” of Neyman and Pearson 
[8], and the “Ham Sandwich Problem” of Ulam [10]. 

Steinhaus first raised the question of whether an object (such as a cake) can be divided among 
n people, who may value different parts of the cake differently, in such a way that each person 
feels he has received at least 1/n the total worth of the cake according to his own value. For n = 2, 
the well-known “one cuts, the other chooses” method always yields a solution (although in some 
respects not an ideal solution, as the second person has an obvious advantage in general). 
Steinhaus showed that the cake-cutting problem has an affirmative solution for n = 3, and then 
Banach and Knaster solved the problem for general n with the following simple, elegant, and 
practical solution: pass a long knife parallel to itself slowly over the top of the cake until one of 
the participants says “stop,” cut the cake at that point and give the piece to that participant, and 
then continue moving the knife. It is easy to see that this procedure guarantees each person at 
least 1/n of the cake, according to his own value, provided that each participant’s value of every 
piece of zero volume is zero. 


The author’s formal education has included a Bachelor’s degree at West Point, a Master’s in operations research 
at Stanford, a year as a Fulbright scholar at Gottingen, and a Ph.D. in mathematics at Berkeley in 1977. He has 
taught at Washington University and is currently on leave from Georgia Tech as a NATO postdoctoral fellow at the 
University of Leiden. His main mathematical interests center around stochastic processes (especially abstract 
gambling theory and optimal stopping theory); this excursion into fair-division problems was largely recreational. 
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The “Problem of the Nile” [2], [4], [5] involves partitioning a set into k pieces (instead of n, as 
in the cake-cutting problem) and then evaluating each of n measures on each piece. “Each year 
the Nile would flood, thereby irrigating or perhaps devastating parts of the agricultural land of a 
predynastic Egyptian village. The value of the different portions of the land would depend on the 
height of the flood. In question was the possibility of giving to each of the k residents a piece of 
land whose value would be 1/k of the total land value, no matter what the height of the flood.” 
Feller [3] showed that if there are an infinite number of flood heights possible, the problem need 
not have a solution, whereas Neyman [7] first proved that a (nonconstructive) solution always 
exists if there are only a finite number of possible flood heights. 

A special case of Neyman and Pearson’s “Problem of Similar Regions” [8] which is actually 
equivalent to the “Problem of the Nile” is the “Bisection Problem’’: given a set and n probability 
measures on it, does there always exist a single subset having exactly measure one-half with 
respect to each of the n measures? (More generally, the “Problem of Similar Regions” asks for the 
existence, for each a in [0,1], of a single set having exactly measure a with respect to each 
measure.) 

Another classical bisection problem, but one which involves bisection in a particular way 
(namely by a hyperplane), is that of Ulam’s “Ham Sandwich Problem’: can an ordinary ham 
sandwich, consisting of bread, ham, and butter be cut by a plane in such a way that each of the 
three ingredients is cut exactly in half? More generally, can n objects in euclidean n-space always 
be simultaneously bisected by a single hyperplane? The answer is affirmative, and the standard 
proof is an application of the Borsuk-Ulam theorem [1], [2]: 


(1) If f is a continuous map of the sphere in n-dimensional space into (n — 1)-dimensional space 
such that f(—x) = —f(x) for every x, then there is some point on the sphere mapped into 
the origin. 


(For an interesting discussion, historical background, and proofs of the interrelationships among 
these classical division problems, the reader is referred to [2].) 


FIG. 2. 


Solution of the Border Problem. Solutions of the fair-division problems above are generally not 
adequate for solution of the border problem for two topological reasons: first, the pieces so 
determined may be badly disconnected (recall we want each country to receive a single piece); and 
second, the pieces may well not be adjacent to the correct countries (Fig. 2). Moreover, it is not 
always possible to connect the Banach-Knaster pieces to the correct countries (by thin, noninter- 
secting roads for example) without destroying either the overall fairness of the division, or the 
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connectivity of the other pieces. However, a (nonconstructive) cake-cutting result of Dubins and 
Spanier can be used to help settle the border problem affirmatively. Using Lyapounov’s convexity 
theorem [2], [6], 


(2) The range of every nonatomic, countably additive, finite dimensional, vector-valued measure 
is convex, 


Dubins and Spanier were able to show there is always a (Borel) partitioning of the cake so that 
each person receives strictly more than his share, provided at least two people value some part of 
the cake differently. 


THEOREM | (Dubins and Spanier [2], Corollary 1.2). Let D be a Borel subset of R*, and suppose 
J4\y+++5 fy, are nonatomic (Borel) probability measures supported on D, with yp; = w,; for some i F# j. 
Let p; > 0 with L"p, = 1. Then there exists a Borel partition C,,..., C, of D such that ,(C;) > p; 
for each i. 


(The numbers p; refer to the minimum proportion of the territory to be given to the ith 
country; often p, = 1/n for all i.) 

It should be noted that the partition guaranteed by Theorem 1 may also fail to solve the border 
problem for the same reasons as the Banach-Knaster procedure may; moreover, the elements of 
the Dubins-Spanier partition can be very complicated Borel sets in general. 

Countries (and the disputed territory D) will be identified with open connected regions in R?. 
For a set A C R?, A denotes the closure of A, A the interior of A, and 0A the boundary (A \A) of 
A. By a nonatomic measure is meant a Borel measure which assigns measure zero to each 
singleton set (single point). 


Definition. Open connected subsets A and B of R? are adjacent if 0A ™ 0B contains an open 
arc a (homeomorphic image of (0, 1)) such that A U B U @ is open and connected. 


In Figure 1, D is adjacent to A,, A, and A;, but not to A). Intuitively, two connected regions 
are adjacent if they “touch” on an interval, that is, if a single country can be formed from the two 
regions by erasure of some small open arc on their common boundary. That the common 
boundary just contain an open arc is not enough to insure this merger can always be accomplished 
(and hence that a fair division in the “connected-adjacent” sense exists), as can be easily seen by 
looking at such borders as the sin(1/x) curve describes. (However, it is easy to check in the above 
definition that if A U B U ais open, it is automatically connected.) We are now ready to state the 
main result; recall that p, represents the minimum proportion of the disputed territory the ith 
country is to receive. 


THEOREM 2. Suppose D, A,,..., A, are open connected regions in R* with A, adjacent to D for 
each i. If y,..., 4, are nonatomic probability measures on D and p, > 0, X"p; = 1, then there exist 
disjoint open connected subsets B,,..., B,, of D with B, adjacent to A, for all i, w;(B;) > p, for all i, 
and with UB, = D. 


Proof of Theorem 2. Without loss of generality, p, > 0 for all i. 


Case 1. w, = +: =p,. Expand A,’s territory into D continuously, taking care to avoid the 
boundary of D and to pass continuously through any arcs of positive u, measure, until 
uw, (B,) = »,(B,) = p, (Fig. 3). (This is possible since », is nonatomic and D is connected.) Now 
D\ B, is open and connected, and since p, = --- =p,, #j;(D\ B,) = 1 — p,, for all j = 1,..., 
n. Continue similarly to find B,,..., B,_,, and let B, = D\(B, U--- U B,_,). This completes 
Case 1. 


Case 2. pw; # pw, for some i # j. By Theorem | there exists a Borel partition C,,..., C, of D 
satisfying ;,(C;) > p; for alli = 1,..., n. Since the {y,} are Borel, there exist (disjoint) open balls 
E,,..., E, in D with w,(dE;) = 0 for all i andj, and 
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k 
ml 'S, zac, <e= min { 1;(C,) — p;} for alli = 1,..., 7. 
j 


Since D is open and connected, it is path (piecewise linear) connected and it follows easily that 
there are (sausage-shaped) open connected subsets of D adjacent to A, the closures of which have 
arbitrarily small 1; measure for all i, and which contain all the centers of F,,..., E,. Let B, be the 
union of one of these sets with U“%, which satisfies 1,;(B,4C,) < e and p,(0B,) = 0 for all 
i= 1,...,n. Then B, is open and connected, adjacent to A,, and satisfies u,(B,) > p,, and 
#,(C,\ B,) > p; for all j = 2,..., . (See B, in Fig. 3.) Now D\ B, is open and connected, so 
continue in this manner finding B,,..., B,_, and then let B, = D\(B, U--- U B,_,). It is easy 
to see that the sets B,,..., B, satisfy the conclusion of the theorem. 0 


FIG. 3. 


If the measures p,,..., pw, fail to be nonatomic, the conclusion of the theorem may fail: the 
worst case is that for some point x € D, p;({x}) = 1 for all i. 


Theorem 2 may easily be strengthened in two respects. First, the conclusion holds also for R* 
(or k-dimensional manifolds) if one generalizes the definition of adjacency as “touching” on 
k — | dimensional regions (i.e., 0A 0B contains a homeomorphic image of (0, 1)*~!... ). And 
second, the conclusion may also obviously be strengthened to guarantee that the new borders 
formed are polygonal. If wu; # w, for some i # j, the analog of Theorem 2 corresponding to 
Theorem 1 is also easy to prove: there is always a border which gives each country strictly more 
than its fair share. As an immediate corollary to Theorem 2, one may drop all adjacency 
requirements and conclude that a connected cake may be divided so that each person receives a 
single (i.e., connected) piece which is a fair share. 


It would perhaps be of interest to find a practical, constructive method for generating the fair 
border guaranteed by Theorem 2; no such solution is known to the author. 


Other Notions of a Fair Share. That each participant receive a piece which he values at least 
1/n of the total value is certainly not the only criterion for what constitutes a “fair share.” 
Suppose one person receives a piece which he values at exactly 1/n, whereas others get strictly 
more than 1/n according to his and their values. Has the first person received a “fair share’? In 
the border problem, since D is a nonempty open set, it is easy to see that the solution guaranteed 
by Theorem 2 is never unique, and the question arises of whether or not borders exist which are in 
some sense optimal, or at least more fair. Dubins and Spanier addressed this question for 
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cake-cutting, and offered several different criteria for determining whether one partition is better 
than another. 


The first criterion suggested is that the partition {A,, A,,..., A,,} 18 better than the partition 
{B,,..., B,) if Lu ;(A;) > UL", (B;), and for cake-cutting they proved [2, Theorem 2] that optimal 
partitions in this sense always exist. On the other hand, for the border problem it may be seen 
from Fig. 2, with »,; uniformly distributed on B, for each i, that optimal partitions in this first 
sense do not always exist (although clearly “e-optimal” ones always exist, as can be shown by a 
slight modification in the proof of Theorem 2 using [2, Theorem 2] in place of Theorem 1). 


A second notion of optimality Dubins and Spanier suggested was the following: “Find a 
partition that maximizes the amount received by the person who gets the least, and, among all 
such partitions, find one which maximizes the amount received by the person who gets next to the 
least, etc.” (for a formal definition, see [2, p. 8]). Again, for cake-cutting, optimal partitions in this 
second sense always exist [2, Cor. 6.10] but for the border problem they do not exist in general as 
can be seen from Fig. 2 again with the uniform distributions given above. However, “e-optimal” 
borders in this second sense also always exist; this time use [2, Cor. 6.10] in the modification of 
the proof of Theorem 2. 


A third notion of optimal partition suggested in [2] is one in which p,(A,) = p, for all i and all 
j, that is, all participants agree that each person received exactly the correct amount. Proving an 
assertion of Steinhaus [9], Dubins and Spanier showed that for cake-cutting, optimal partitions in 
this third sense also always exist [2, Cor. 1.1], and again it is easy to use their result to modify the 
proof of Theorem 2 and conclude that e-optimal (i.e., |u,(A;) — p,| < e for all i andj) solutions to 
the border problem always exist. But whether or not optimal borders (in this third sense) always 
exist is not known to the author, even if D is bounded. The techniques above do not seem to 
work; simply “taking limits” in general destroys both the connectivity and the correct adjacency 
of the pieces. 


One last criterion for comparing borders which are already fair in one of the above measure- 
theoretic senses, a criterion suggested by L. Karlowitz, is that of comparing total lengths of 
borders, and looking for one with minimal length (although some countries with large armies, for 
example, may well prefer long borders). Another look at Fig. 2 shows that fair borders of minimal 
length do not exist in general, but if D is bounded, there are always fair borders of nearly minimal 
length. 
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THE SLOW CONTINUED FRACTION ALGORITHM 
VIA 2 X 2 INTEGER MATRICES 


HARRY APPELGATE anp HIRONORI ONISHI 
Department of Mathematics, City College of New York, 138th Street and Convent Avenue, 
New York, NY 10031 


Introduction. It is well known that any real number a may be expanded as a simple continued 
fraction 


+ ———_____- 
Qo tc, 


where the numerators are all 1’s and q,, called the partial quotients, are integers with g, > 0 for 
n> 0. The expansion is finite for rational a and infinite for irrational a. Since the simple 
continued fraction is completely determined by the sequence of partial quotients go, g,, g>5..., it 
is customary to use the compact notation 


a= [go> q\> Qas+++]. 


Of considerable interest is the case where the sequence of partial quotients is eventually 
periodic, e.g., 


[1,2,3,5,4,5,4,5,4,...], 


which is abbreviated as [1, 2, 3,5, 4]. In 1770, Lagrange proved that the simple continued fraction 
expansion of a real irrational number is eventually periodic if and only if it is a quadratic 
irrational, i.e., a + bVA, where a, b € Q@ and A EN but not a perfect square. 

Less well known than Lagrange’s theorem are a number of results concerning the fine structure 
of continued fractions. For example, there is the “palindromic” character of the expansion of VA. 
This is described in (1.8) below, but the idea is well illustrated by 


v19 = [4, 2,1,3,1,2,8 |, 


in which the periodic part contains the palindrome 2, 1,3, 1,2 followed by a term 8 which is twice 
the initial term 4. Theorem 1.8 asserts that this happens for any a = VA, AGEN, A #2’. 
Another interesting result is Galois’ theorem (1828) characterizing those a that have purely 
periodic simple continued fractions (cf. (1.7) below). 
There are also striking connections with Pell’s equation 


—Ay?=41,A#n?’. 


The equation with +1 always has solutions, and the continued fraction of YA shows us how to 
find all of them. The equation with —1 has solutions if and only if the period of the continued 
fraction of VA is odd. This is described in (1.9) below. 

The purpose of this paper is to present these and related results within a conceptual framework 
which, we believe, makes the underlying ideas easy to grasp. Our starting point is Richards’ paper 
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“Continued fractions without tears” [4]. (However, while partially motivated by his paper, our 
paper is self-contained.) Our main innovation is the use of 2 x 2 integer matrices to extend 
Richards’ observations, and thus obtain the deeper results outlined above (as well as others). 

The fact that there is a connection between 2 x 2 integer matrices and continued fractions is 
well known. It arises by considering a fraction x/y as a lattice point (x, y) in the plane. Thus the 
point (x, y) is identified with the inverse slope of the line through the origin and (x, y). (In 
technical terms, (x, y) determines a point of the projective line.) 

We now proceed to the detailed developments. While no single ingredient in this approach is 
new, we believe that the “mix” is, which leads to some new observations (cf. (3.13) through 
(3.18)). Furthermore, we believe that this approach makes the subject conceptual and pretty easy. 
See if you agree. 


Outline. This paper consists of three sections. In § I, we summarize those aspects of the theory 
of simple continued fractions which we deal with in terms of 2 X 2 matrices. Seven theorems are 
stated, whose proofs will come later in §3. In §2 we describe the slowed down version of the 
simple continued fraction algorithm. Also we lay down the foundation for the use of 2 x 2 integer 
matrices and conclude with a clarification of the relationship between the slow algorithm and the 
“fast algorithm,” i.e., the classical simple continued fraction algorithm. In §3 we state and prove 
theorems which correspond to the classical theorems in §1. 


1. Continued Fractions. 
(1) The basic step in the simple continued fraction algorithm is given by the function 


l 
f(a) = 
of a real variable a, where g = g(a) is the largest integer <a. (cf. (2) in case you notice a 
difference.) A real number a determines an infinite sequence ay, a,, a,,... of real numbers such 


that 
a =a,a, =f(a,_;) forn> 0. 


a, is called the nth complete quotient of a. The largest integer g, < a, is called the nth partial 
quotient of a. 

(2) Remark for those who are already acquainted with the traditional treatment of simple 
continued fraction. Traditionally, q in the basic step is taken to be the largest integer < a, i.e., 
q = [a]. If a is an integer and g = [a], then g = a and f(a) is not defined. Our modification 
avoids this situation, i.e., f(a) is always defined. With this modified definition of partial quotients 
Gn>lGos--+s In] Should be defined by the recursion 


] 
[40] = 4+ land [qo,---. nl = do + [| - 
[ 415. an | 


(3) The sequence qo, g), q2,-.. of partial quotients of a real number a satisfies the following 
conditions: . 
(i) g, > 0 for all n > O and if g, = 0 for some k > 0, then g, = 0 for all n > k. 


THEOREM. Given any infinite sequence qo, qs q2;--. of integers satisfying the condition (1), there 
is a unique real number a such that q,, is the nth partial quotient of a for every n > 0. 


(4) Real numbers a and £ are said to be equivalent, and written a ~ B, if there are integers a, 
b, c, d such that 


aa+ec 


B= tatd 


and ad — bc = +1. 


THEOREM. Given real numbers a and B, a ~ B if and only if a complete quotient of a is equal to a 
complete quotient of B (cf. [3, Theorem 175, p. 142]). 
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(5) The following theorem is almost trivial. 


THEOREM. A real number a is rational if and only if | is a complete quotient of a, or equivalently, 
0 is a partial quotient of a. 


(6) The continued fraction of a real number a@ is said to be periodic if a,,,; = a,, for some n > 0 
and / > 0, where a, are the complete quotients of a. If a,,, = a, then a,,,,, = a, for all k > 0. 
In terms of the partial quotients g,, (the continued fraction of) a is periodic if and only if the 
SEQUENCE J, Jn+1>+++9 GIn+/—1 Fepeats indefinitely for some n > 0 and / > 0. By (5), a rational 
number is trivially periodic. 


LAGRANGE’S THEOREM. The continued fraction of an irrational real number a is periodic if and 
only if a is a quadratic irrational (cf. Theorem 177, p. 144, [3)). 


(7) The continued fraction of real number a is said to be purely periodic if a, = a for some 
| > 0, i.e., periodic from the very beginning. 


GALOIS’ THEOREM. A quadratic irrational a is purely periodic if and only ifa > 1 and —1 <a’ 
< 0, where a’ is the conjugate of a.(cf. p. 102, [2)). 


(8) Let A be a positive nonsquare integer. The continued fraction of a = VA is periodic. a, = 
(VA — q,) | is purely periodic by (7). Let / be the least positive integer such that a), , = a1, So 
that the sequence q,,..., g, repeats. 


THEOREM (Palindrome Effect).Under the situation above for a = VA, 
gq, = 2q, and q, = q,_, forO <n <1. 
(9) Let A be a positive nonsquare integer and consider the Pell equation 
x*-—Ay?= +1, 


where x and y are integers. Let g, be the partial quotients of a = VA and define sequences 
Ap, 4, 4>,.-.. and bo, b,, by,... by 


a, =Gn4n-1 + 4,2 Witha, = gq ,anda_, = 1, 
b, = GnOn— 1 + a) with b, = q\ and bo = 1. 


THEOREM. Let | be the least positive integer such that a,,, = a,. Then for every k > 0, 
kl 
jy — Ab; =(-l)". 


Moreover, (x, y) = (@,;_15 b4;—1) gives all solutions of the Pell equation such that x > y > 0. In 
particular, solutions with —1 occur if and only if | is odd. 


2. The Slow Algorithm and 2 x 2 Matrices. 
(1) The basic step of the slow algorithm is given by the function 
a-1 ifa>] 

l ifa = ] 


—— ifa<] 
l—a 


g(a) = 


of a positive real variable a just as the basic step of the fast algorithm is given by f(a) of (1.1). 
The significance of this will become clear soon (cf. (8) and (18)). 

(2) We are going to use matrices as tools to facilitate computation. We give an explanation of 
this in the next three paragraphs. (If you already know what it means for an element of the 
projective linear group PL, (IR) to act on the projective line P, you may skip these paragraphs and 
go to (6).) We assume that the reader is familiar with the action of 2 X 2 real matrices on the 
vectors in R?’; 
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a c\(x\_ [a%&% + cy 
(5 MF) = (be + ay) 


(x, y)"= (5). 


(3) Given a real number a, a vector (x, y)’ such that x/y = a@ is said to represent a. For 
example, (a, 1)’ represents a. Every vector (x, y)’ such that y # 0 represents a real number, 
namely x/y. A vector of the form (x,0)’ does not represent a real number. To avoid this 
inconvenience we let it represent “the point at infinity,” denoted by oo, provided x + 0. Now 
every nonzero vector in R? represents either a real number or oo. Given a nonzero vector u in R*, 
let [u] denote the real number or oo represented by u. Note that given nonzero vectors u and v in 
R?, [u] =[v] if and only if u = tv for some scalar t # 0. We sometimes write u ~ v to mean 
[u] = [v]. The union of the set R of real numbers and the set {00} will be denoted by P and we 
may refer to the elements of P as points. The set P is in one-to-one correspondence with the set of 
straight lines through the origin in the plane. (Cf. (5) for a more geometric description of P.) 

(4) Let GL,(R) denote the group of 2 X 2 nonsingular real matrices. For any A © GL,(R), 
u€ R*andteER, 


Note that we use column vectors 


A(tu) = t(Au). 


Thus given nonzero vectors u and v in R’, if u ~ v, then Au ~ Av. Thus for any A € GL,(R) 
and a € P, Aa makes sense; if a is represented by u € R’, then Aa is represented Au, i. e., 


A[u] = [Au]. 


For example, with e = (1, 1)", Al = [Ae]. If a # 00, then since (a, 1)’ represents a, 


d})~  bat+d 
On the other hand, since (1,0)’ represents 00, 


(5 ()a = 52" (= 00 if ba + d = 0). 


(5 ‘) 20 = 5 (= oo if b = 0). 
In this way a matrix 4 € GL,(R) is a map from P to P. As maps on P, A and ¢4 for any scalar 
t + 0 are the same. Since for us the matrices are tools and use them as maps on P, we shall not 
make any distinction between A and fA for any scalar t + 0. 

(5) Since geometric intuition is going to be very helpful in the sequel, let us look at P more 
geometrically. Given a nonzero vector (x, y)", consider the line through the origin and (x, y) and 
then consider the point 


(cos #,sin@) = | 


+x ty 
yx? + y? yx? + y? 
on the unit circle, where 0 < @ < a and hence the signature + is such that +y > 0 or y = 0 and 
+x > 0. (See Fig. 1.) This gives a representation of P as the semicircle 
{(cos 6,sin@)|0 < 8 < a}. 


Under this representation of P, a real number a is represented by the point 


a ] 
(cos 9, sin@) = | ———— , ————— 
Va? +1 yar+] 
and oo is represented by (1,0). This representation of P as a semicircle is geometrically and 
intuitively simple but it is not entirely satisfactory because there is an abrupt discontinuity at 
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(1,0). To remedy this, consider the point 
(cos 20, sin 26 ) 


on the unit circle with 9 as before. This closes the gap and we now have a representation of P as 
the entire unit circle. Since 


cos 26 = 1 — 2sin’ 6 and sin20 = 2cos Osin 8, 


2 ,,2 y) 
(cos 20,sin20) = ca, Ps). 
x" + yo x+y 


(0, «) 
(x, y) 


(cos 24, sin 26) 


A 


(cos 6, sin @) 


Fic. 1 Fic. 2 


This representation of P as the entire unit circle is more wholesome but the intuitive direct 
relationship between a vector (x, y)’ and the corresponding point is somewhat lost. We shall use 
this representation in the sequel. The point oo is represented by (1,0) and a real number a is 
represented by 


a—1 2a 
ae+1 az4+1) 
Since 


cos@ sin 20 
sin@ 1 —cos26’ 


the point (cos 20,sin2@) on the unit circle which represents a is also the intersection of the unit 
circle with the line through (0, a) and (1,0) as shown in Fig. 2. We shall identify P with the unit 
circle under this correspondence. Thus for example, 00 = (1,0) and 3 = (4/5, 3/5). 

(6) Given A € GL,(R), let (A) denote the set of a € P such that A~'‘a is a positive real 
number. For example, for the identity 7, (7) is the set of positive real numbers in P or 
equivalently, (7) is the open arc interval on the unit circle from (1,0) to (— 1, 0) counter-clockwise. 
(See Fig. 3.) In general, 


a=cot@é= 


(A) ={a € P|A~'a € (J)}. 
Since A is one-to-one on P, we also have that 
(A) = A(Z) = {Aaja € (7)}. 
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The set (A) is an open arc interval on the unit circle whose endpoints are represented by the 
column vectors of A. For example, if 


_—(3 -2)\_ _ {3 _{-2 
A=(} =})=(@0).a=(7}.6= (<4), 
then (A) is the open interval from 
4 3 304) _ 
[a]=($.5)=3t0l0]=(5,5)=2 
clockwise. (See Fig. 4.) Since A~ 'a = (a — 2)/(a — 3), in R, 
(A) OR = (a € Ria < 2 ora > 3}. 


In this example, det A < 0. If det A > 0, then the interval is from [a] to [b] counter-clockwise, 
where A = (a, b). It is clear that for any A and B € GL,(R), 


(BA) = B(A). 
(1) | (Q) (P) 
2 
3 
0 oe) 0 00 
FIG. 3 FIG. 4 Fic. 5 


We let [A] denote the closure of (A), i.e., (A) together with the endpoints of (A). 


(7) Put 
_({1 1 _({1l 0 
, (4 |) and 2 (i ): 
The intervals (P) and (Q) are shown in Fig. 5. In particular, (7) is the disjoint union of (P), (Q) 
and 1. For any A € GL,(R), if A = (a, b), then 
AP = (a, Ae) and AQ = (Ae, b), 


where e = (1, 1)’. Thus (A) is the disjoint union of (AP), (AQ) and Al. - 
(8) For a real number a, 


-1 -1, 
P-‘a=a-—landQ ‘a Tow 


Thus the basic step g(a) of the slow algorithm is given by 
P-'a ifae€(P) 

g(a) = ] ifa = | 
Q-'a ifae(Q). 


(9) We shall refer to the special matrices P and Q as the /etters. Products of letters are called 
words. We include the identity 7 as the trivial word. Note that the entries of a word are 
nonnegative integers and its determinant is + 1. 

(10) Consider a word L,... L,,, where L, are letters. The interval (L,... L,,) is obtained by 
successive dichotomy of intervals 
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(1) > (L,)> (L,L,)> °°: D(L,... L,) 


starting from (J) (cf. (7)). For example, the sequence leading to (PQP) is shown in Fig. 6. Thus 
we conclude that every word has a unique spelling. This observation is fundamental and will be 
used without explicit reference to it. 


(PO) 
(P) V x 
FIG. 6 
(11) Put 
_{0 1 
s=(i 5) 


det S = —1, (S) = (J), S* =I, SPS = Q and SQS = P (SLS = L’ for a letter L). It follows 
that if X is an n-letter word, then so is SXS. (SL,L,S = SL,SSL,S, etc.) Given a and B € P, we 
say that a is congruent to B, in symbols a = B, if a = B or Sa = 1/a = B. Sais the reflection of 
a about the line x = 0. Given A and B € GL,(R), we say that A is congruent to B, in symbols 
A = B,if A = B or AS = B. Since (S) = (J), if A = B, then (A) = (B). 

(12) A rational number or oo in P is represented by a vector having rational components and 
is referred to as a rational point of P. A rational point is always represented by a vector with 
integer components. A vector with integer components is said to be primitive if the components 
are relatively prime. A rational point is represented by a unique primitive vector (x, y)’ such that 
y>Oory=Oandx > 0. 

(13) Let GL,(Z) denote the group of 2 X 2 integer matrices of determinant +1. If A € 
GL,(Z), then the column vectors of A are primitive. Recall that A = —A as maps on P. 


LEMMA. Given A and B © GL,(Z), (A) = (B) iffA = B. 


Proof. If A = B, then (A) = (B) as noted in (11). Conversely, suppose that (A) = (B). Put 
A = (a, b) and B = (c,d). If det(AB) = +1, 1.e., det A and det B have the same signature, then 
[a] =[c] and [b] = [d] (cf. (6)). If det(AB) = —1, then [a] = [d] and [b] = [c]. Since a, b,c, d 
are primitive, we get that A = B or A = BS. (Recall A = —A.) 


(14) THEOREM 1. Given A © GL,(Z), (A) € (1) if and only if A is congruent to a word. In 
particular, if (A) C (1) and det A = +1, then A is a word. 


Proof. If A is congruent to a word, then the entries of A are > 0 and since o € (A), 
(A) C (J). Conversely, suppose that (A) Cc (J). Then the entries of A are > 0. From the 
condition’ that det A = +], it is easily verified that if A = (a,b) # J, then there is no letter L 
such that [a] € (L) and [b] € (L’). Thus if 4 + /, then (A) c (L), i. e., (L7'A) € (J) for some 
letter L. The largest entry of L~'A is smaller than that of A. Thus by induction on the size of 
entries, we conclude that L~ 'A is congruent to a word. Then A is congruent to a word. The second 
statement is clear because if X is a word, then det X = +1 and det(XS) = —1. 

(15) Given a € P, it is easy to find A € GL,(Z) such that a € (A). For example, if a € R 
and k is an integer < a, then a € (P*“). By a pair (a, A), we shall mean a pair consisting of 
a © P and A € GL,(Z) such that a € (A). 

(16) A pair (a, A) determines an infinite sequence a,, a,, a,,... of points such that 


— 4-! _7-! 
a =A a, a,=L, a,_, forn>O, 
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where L, is P or Q or J according as a,_, € (P) ora,_, € (Q) or a,_, = 1. We refer to a,, as 
the nth tail of the pair (a, A). The sequence L,, L,, L;,... will be refered to as the spelling of the 
pair (a, A). If X is the n-letter word such that a € (AX), then 


Wa X'AT la. 
(17) If 1 is a tail of (a, A), then clearly a is rational. Conversely: 
THEOREM 2. If a is rational, then for every A € GL,(Z) such that a € (A), 1 is a tail of (a, A). 


Proof. a) = A ‘a is rational and is represented by a primitive vector (a, b)’ such that a > 0 


and b > 0. Since 
P(3)-(*)*)anao-(5) = (yx), 


successive subtraction of the smaller component from the larger gives a word X such that 


~1{@)\ _ l : ~] _ 
x (5) = [jie x oo = 1. 

(18) We now clarify the relationship between the slow algorithm and the fast algorithm. Let a 
be a real number > | and let 8, be the nth complete quotient of a to distinguish it from the tail a, 
of (a, J). Since a > 1, a € (P) and a, = P 'a. Let gy be the largest integer < a. Then gy is the 
largest integer such that a € (P%°) and 


— —n 
a,=P "a forO<n < q. 


Put A, = PS. Then 


Since a € (P”*'), a, € (P). If a,, = 1, then 8, = 1 and a, = 1 for all n > go and B, = 1 for 
alln > 1. Ifa, #1, thena, €(Q) and Sa, © (P) and B, > 1. Let q, be the largest integer 
< B,. Then q, is the largest integer such that Sa, © (P%), or equivalently, a, © (Q) and 


A+n = Q "a, forO<n< qQ. 
Put A, = A, PS = P”Q%, Then 
l 
— = B . 
Bi - 4 ? 
Continuing this, we arrive at the following conclusion: if 


P,..., P, Q,...,0,P,..., P,Q,... 


qo q\ q2 
is the spelling of (a, 7), then qo, g;, q2,.-. is the sequence of partial quotients of a and with 
Pn = Wo ¥ oe F n—1s 


=I 
A, a 


3. Theorems. 

(1) The spelling of a pair (a, A) satisfies the following conditions: 

(i) if L, = I for some k > 0, then L, = J for all n > k, and for any k > 0, not all L, forn > k 
are the same letter. 

This corresponds to the condition (i) of (1.3). 


THEOREM 3. Given A © GL,(Z) and an infinite sequence L,, L,, L3,..., where L, = P,Q or T, 
satisfying the condition (i), there is a unique a © (A) such that L,, L,, L3,... is the spelling of 
(a, A). 
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Proof. Put X, = L,... L,, and consider the sequence of points AX,1 € (AX,,). Since (AX, 1) 
Cc (AX,), 
a= hm AX,1 
noo 
exists. Clearly, a € (A) and a, = X,'A~'a is the nth tail of (a, A) for every n > 0 and such a 
point a is unique. 


(2) LEMMA. If a € (A) 1 (B), where A and B © GL,(2Z), then there are words X and Y such 
that AX = BY anda © [BY]. 


Proof. First suppose that B = J. If a = Al, then since a € (J), (AL) C (J) for some letter L 
and a is an endpoint of (AL). If a # Al, then a € (AL) for some letter L. Repeating this, we 
arrive at a word X such that (AX) C (J) and a € [AX]. By (1.14), AX = Y for some word Y. In 
case B + J, consider the point B™'a € (B~'A) (J). We have words X and Y such that 
B~'AX = Y and B 'a € [Y]. Multiplying by B, we get that AX = BY and a € [BY]. 

(3) Points a and B € P are said to be equivalent, and written a ~ B, if a = CB for some 
C € GL,(Z). For a and B € R, this coincides with the relation ~ introduced in (1.4). 

(4) Pairs (a, A) and (B, B) are said to be equivalent, and written (a, A) ~ (B, B), if a tail of 
(a, A) is congruent to a tail of (8, B). For any pair (a, A) and for any B € GL,(Z), 


(a, A) ~ (Ba, BA). 
In fact, these pairs have the identical tails. 
LEMMA. The relation ~ on the pairs is an equivalence relation. 


Proof. Only the transitivity needs verification. Suppose that (a, A) ~ (8, B) and (B, B) ~ 
(y, C), say a, = B, and B,, = y,. We may assume that / < m. Then B,, = X~ 'B, for some word X. 
Y, = S‘B,, and B, = S/a,, where i and j are 0 or 1. Thus 


Y, = S'X” ‘Sa, 
If j = 0, then X~ 'a, = a, and y, = a, for some r. If j = 1, then SXS is a word and SX” 'Sa, = a, 
and y,, = a, for some r. Thus (a, A) ~ (y, C). 
(5) THEOREM 4. Ifa € (A) (B), where A and B € GL,(2), then (a, A) ~ (a, B). 


Proof. If a is rational, then 1 is a tail of both (a, A) and (a, B) by (2.17). Assume that a@ is 
irrational. By (2), a € (AX) = (BY) for some words X and Y. X_'4~'a is a tail of (a, A) and 
Y 'B-'ais a tail of (a, B). Since AX = BY, X"'A~'a = Y-'B™'a and (a, A) ~ (a, B). 

(6) Given a and B € P, if (a, A) ~ (8, B) for some A and B € GL,(Z) such that a € (A) 
and B €(B), then clearly a ~ B. Conversely, if a ~ B, then (a, A) ~ (8, B) for every A and 
B € GL,(Z) such that a € (A) and B € (B). In fact, if B = Ca, C € GL,(Z), then by (5), 


(a, A) ~ (Ca, CA) ~ (B, B). 


This result implies the theorem of (1.4). 

(7) The theorem of (1.5) follows from (2.17). 

(8) A pair (a, A) is said to be periodic if a, ,, = a, for some n > 0 and / > 0, where a,, are the 
tails of (a, A). In view of (2.18), it is clear that for a € R, (a, A) is periodic if and only if the 
continued fraction of a is periodic. 


LEMMA. [f a,,., = Sa, then Q,,.3; = 4,. 


Proof. Let X be the /-letter word such that a, ,, = X~'a,. Then Y = SXS is an /-letter word 
and a,,,= Sa, € (SX) = (Y). Thus 


_ —1 _ —]1 _ —1 _ 
Q,49,= VY Q,4,= VY Sa, = SX a, = a,. 


(9) If a is a rational point, then (a, A) is trivially periodic for any A € GL,(Z) such that 
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a € (A) (cf. (2.17)). In the rest, we assume that a is irrational; in particular, a € R. If (a, A) is 
periodic, then a is a quadratic irrational. In fact, suppose that a, ,, = a,,/> 0. If X is the word 
such that a,,,= X 'a,, then Xa, = a, and a, is quadratic. Since a ~ a,,, a is quadratic also. 


(10) LemMa. If a is quadratic, then there is C € GL,(Z) such that Ca=a, C#TI and 
detC = +1. 


Proof. A self-contained proof of this lemma will be given later (cf. (20)). For now we give a 
proof which makes use of a result from algebraic number theory and may be skipped. Let U be 
the Z-module generated by a and 1 and K = Q(a) and consider the coefficient ring 


O, ={x © K|xU Cc U}. 
This ring contains a positive unit A # 1 (cf. Theorem 5, p. 112, [1]). Put 
Aa =aat+c, 
X= ba+d, 
where a, b,c, d € Z. Then 
C= (5 4) € GL,(Z) 
and Ca = a. Sinced # 1, C# J. If det C = —1, use C’; since? # 1, C* # I. 


(11) THEOREM 5. Jf a is quadratic, then (a, A) is periodic for any A € GL,(Z) such that 
a & (A) (cf. (1.6)). 


Proof. Considering a, = A~ 'a, we may assume that A = J. By (10), take C € GL,(Z) such 
that Ca = a, C # J and det C = +1. Sincea = Ca € (C), a € (J) N(C). Thus by (2), there are 
words X and Y such that a € (X) = (CY). Since det C = +1, X = CY. Since a € (X), X_'aisa 
tail of (a, J). Since 


Y'a = Y'C"'a = X"'a € (1), 


Y~'a is a tail of (a, J) also. If X and Y have the same length, then ¥ = Y and C = J. Thus X and 
Y have different lengths, say one has n letters and the other has n + / letters, / > 0. Then 
a,.); = a,. Thus (a, J) is periodic. 


(12) A positive quadratic irrational a is said to be slowly purely periodic if (a, /) is purely 
periodic, i.e., a, = a for some / > 0. For example, for a = v2, we have 


v2) ,/v2-1)\,/v2-1 l v2 
Q ~ =S 
l 1 /*\2- 72 v2 l 
L 
where u > v means that L~'u = v for u and v € R?. Thus a, = Sa and a, = a and hence y2 is 
slowly purely periodic. If a is slowly purely periodic, then any nontrivial word A such that Aa = a 
is referred to as a word for a. For example PQQP is the shortest word for v2. A word for a 


contains both P and Q (cf. (1)). If a is purely periodic (in the fast algorithm), then a is slowly 
purely periodic. But the converse is false. For example, v2 is not purely periodic. 


P 


(13) THEOREM 6. Az irrational a is slowly purely periodic if and only if a is a quadratic irrational 
such that «a > 0 and a’ <0, where a’ is the conjugate of a. If a is slowly purely periodic and 


A =L,... L, is a word for a, then A°? = L)... L, is a word for — a’. 


Proof. We may assume that a > 0. Let L, be the letter such that a, = L;'a,,_ ,. First suppose 
that a is slowly purely periodic, so that a, = a for some / > 0. Put 


a Cc 
A=L,...L,= (5 4): 
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Then Aa = a and we get that 
ba? —(a-—d)a-c=0. 


Thus a is a quadratic irrational and aa’ = —c/b. Since —c/b < 0, a’ < 0. 
Conversely, suppose that a is a quadratic irrational and a’ < 0. Let 


h(t) =at? —bt-—c 
be a quadratic polynomial such that h(a) = 0, where a,b,c © Z and a> 0. Since —c/a = 
aa’ <0,c > 0. Put 
b 2c 2 
— A= 
C LP, 2¢') and b* + 4ac 
Then det C = —A and Ca =a. Put C, = L|'CL,. Then detC, = —A and C,a, =a,. We 
compute that 


inp. [b-—2a —2h(1) in, [b+2¢ 2c 
, cP = | 2a at) | and 0 cO= (SR) —b— 2c} 


If L, = P, then a > 1| and A(1) < 0. If L, = Q, then a < 1 and A(1) > O. Thus the off-diagonal 
entries of C, are > 0 and hence a; < 0 also. Repeating the same, we get that if 


then C.a,, = a,, and 
(1) a, > 0,c, > Oand detC, = —A. 


In particular, a’, < 0. Since there are only a finite number of integer matrices C, satisfying the 
condition (1), C,,, = C, for some n > 0 and / > 0(Cy = C). Then a,,,, and a, are the positive 
root of the same quadratic equation and hence a, ,, = a,. Thus a, ,, = a, for some n > 0 and 
1 > 0. (If we use (11), this part of the proof is unnecessary. But we wanted a proof which does not 
depend on the lemma in (10).) 

Since a, = L7'a,_ 1, a’, = L7'a’,_,. Put 


n n—l 


_ 0 | _ 
s-(_§ 5) and 8, Ja. 


Since a’, < 0, 8B, > 0. Since L7JL, = J, 
L,'B, = Ly Jo), = Ly IL, '0),-) = Jo. = By. 


This says that B, € (L7) and B,_, is the first tail of (B,, 7). Since a, ,,=a,, a ,, =a’, and 
Bi +1 = B,,. Thus a, = Li and | re = L,, and Brass = B, | and hence Qn+1—1 — &,-1- Repeat- 
ing this, we get that a, = a. Thus a is slowly purely periodic. 

We have shown that L}... Li is a word for B = B,. Thus L,... L,; = SL}... L7S is a word for 
SB = SJa’ = —-a’. 

(14) Theorem 6 corresponds to Galois’ Theorem of (1.7). The idea of the proof is the same (cf. 
p. 102, [2]); a slight simplification occurs because the basic step in the slow algorithm is simpler. 
In any case, the proof does not depend on the lemma in (10). Theorem 6 is the first of the new 
observations mentioned in the introduction. It brings out a neat, finer structure. Other new 
observations are consequences of this and are noted in (16), (17) and (18) below. 

(15) Let A € GL,(Z) and det A = +1. A is a word if and only if A has nonnegative entries 
(cf. (2.14)). A is a word containing both P and Q if and only if A has positive entries. Given 
A € GL,(Z), put 


A* = SAS. 
If A =L,... L, is a word, then since L* = L’, 


A* = LT... LT =(L,... L,)' = AP, 
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and hence 
AP = Atl = Al* 

A*™ = A if and only if AS is symmetric and A’ = A if and only if A is symmetric. Thus for a 
word A, we have 

(i) A°P = A if and only if AS is symmetric, 

(ji) A°P = A* if and only if A is symmetric. 
Note that if A°? = A*, then A has an even length. (If A has an odd length, look at the letter in the 
middle.) 


(16) COROLLARY 1. Let a be a quadratic irrational such that a > 0 and a’ < 0 and let A be a 
word for a. The word A is a palindrome, i. e., A? = A, if and only if a’ = —a, i. e., a = VA for 
some positive nonsquare rational number A. If so, then 

a bA 
A= (s a 
for some integersa > b> 0. 
This is an immediate consequence of Theorem 6. Here are some examples: 


A The shortest word for VA 
2 POOP 
3 POP 
5 PPQQOOPP 
3/5 OPPPOQ 


Corollary | gives the theorem of (1.8). 

(17) Other interesting conclusions can be drawn from Theorem 6. Let a be a quadratic 
irrational such that a > 0 and a’ < 0 and let A be a word for a. If —a’ = B™ 'a for some word B 
shorter than A, then A = BC for some word C and CB is a word for —a’ and hence 


CB = A? = CBP 


and both B and C are palindromes. As a special case, if k = a + a’ is a positive integer, then 
B = P* and A = P*C and C is a palindrome. For example, 


A=PPPQOP fora=1+ V6. 


(18) COROLLARY 2. Let A be a nontrivial word. A°? = A* if and only if there is a positive 
nonsquare rational number A such that ASVA = VA. If so, then 


_[bA a 
a=(? ;] 


for some integers a > b> 0. 


Proof. Put A = L,... L,. Given A, let a = VA and suppose that ASa = a. Then a, = A~!a = 
Sa and a,/ = a. The spelling of (Sa, I) is Li,..., L],... . Thus AA* is a word for a and 


(AA*)°P = A*°PAP = AA* 


by (16) and hence A°P = A*. Conversely, suppose that A°? = A*. Then AA* is a palindrome 
containing both P and Q. Thus AA* is a word for VA for some A by (16) and L’,..., Lj,... is the 
spelling of (a,, J). Thus a, = A~'a = Sa. The statement about the form of A is clear. 

(19) Let A be a positive nonsquare rational number and a = VA. Let C be the shortest word 
such that C"'a = a, ie, CS'a = a,i=0 or 1. If Ca =a, then there is no word A such that 
ASa = a and every word for a is a power of C. Suppose that CSa = a. Then CC% is the shortest 
word for a and every word for a is a power of CC* and every word A such that ASa = a is of the 
form (CC*)*C for some k > 0. 
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If A is a nontrivial word such that AS‘a = a, then by (16) and (18), 
asa (F 2] 
for some integers x > y > 0 and 
x? — Ay? = det (AS’) = (-1)'. 
Conversely, let x > y > 0 be integers such that 
x*—Ay?= (-1)‘. 
Put 


Then AS‘a = a and det A = +1. Thus by (16) and (18), 
A = C* or (CC*)“ or (CC*)*C. 
This tells us how to find all solutions of the Pell equation (cf. (1.9)). 
(20) Finally, we present a self-contained proof of the lemma in (10). Let a be a quadratic 
irrational and 
h(t) = at? — bt-—c 


be a quadratic polynomial such that h(a) = 0, where a,b,c € Z and a> 0. Its discriminant 
A = b’ + 4ac is a positive nonsquare integer. Put 


c= iy, ?): 
Then Ca = a. Take positive integers x and y such that x* — Ay? = 1 by (19) and put 
A=xI+yC. 
Then Aa = a and det A = x* — Ay? = 1. Clearly A + J. Thus A is a desired matrix in GL,(Z). 
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If A is a nontrivial word such that AS‘a = a, then by (16) and (18), 
as=(y 
for some integers x > y > 0 and 
x? ~ Ay? = det (AS) = (-1)’. 
Conversely, let x > y > 0 be integers such that 
x*-Ay*= (—1)'. 
Put 


Then AS’a = a and det A = +1. Thus by (16) and (18), 
A = C* or (CC*)* or (CC*)“C. 
This tells us how to find all solutions of the Pell equation (cf. (1.9)). 
(20) Finally, we present a self-contained proof of the lemma in (10). Let a be a quadratic 
irrational and 
h(t) = at? — bt-c 


be a quadratic polynomial such that h(a) = 0, where a,b,c € Z and a> 0. Its discriminant 
A = b’ + 4ac is a positive nonsquare integer. Put 


c= LY, ?). 
Then Ca = a. Take positive integers x and y such that x? — Ay? = 1 by (19) and put 
A=xI+yC. 
Then Aa = a and det A = x” — Ay? = 1. Clearly A # J. Thus A is a desired matrix in GL,(Z). 


References 


1. Z. I. Borevich and I. R. Shafarevich, Number Theory, Academic Press, New York, 1966. 

2. Harold Davenport, The Higher Arithmetic, Harper Torchbooks, Harper and Row, New York, 1960. 

3. G. H. Hardy and E. M. Wright, An Introduction to the Theory of Numbers, 3rd ed., Oxford Univ. Press, New 
York, 1954. 

4. Ian Richards, Continued fractions without tears, Math. Mag., 54 (4) (1981) 163-171. 


MISCELLANEA 
109. . The Good Old Days— Half a Century Ago 


The preparation of the manuscript has been carried on by correspondence without expense to 
the National Research Council and without special grant for relief from teaching from any of the 
institutions represented. It is hoped that the time saved to readers by the exposition and references 
here contained may justify the time that was sacrificed to the preparation of this report. 


THE COMMITTEE 


—Albert A. Bennett, William E. Milne, and Harry Bateman, Numerical integration of differential 
equations, Bulletin of the National Research Council, 92 (1933). 


AN INTERESTING CANTOR SET 


W. A. COPPEL 
Department of Mathematics, IAS, Australian National University, G.P.O. Box 4, Canberra, A.C.T. 2601, 
Australia 


1. Introduction. Cantor sets are all the same topologically, since any two are homeomorphic. 
But the classical Cantor set also enjoys a number of properties which are not preserved under 
homeomorphism. These properties extend in part to the Cantor set described in this paper. I was 
led to its definition by a study of recent work of Feigenbaum [2], {3]. However, it is of interest in 
its own right and will be presented from this point of view here. The discussion illustrates a variety 
of significant topics, such as the notion of dimension, Fourier—Stieltjes transforms, 2-adic integers, 
and ergodic theory. 


2. The Two-Ratio Cantor Set. We define a Cantor set C to be a compact metric space which 
has no isolated point and which has the property that for any two distinct points a, b, there exist 
disjoint closed sets A, B with union C containing a, b respectively. A compact subset of the real 
line is a Cantor set if and only if it has no isolated point and contains no interval. 

Let r,, r, be positive numbers with sum less than 1. We construct a Cantor set C = C(1r, 7.) in 
the following way. Put 


E, = [0,1], 
E, = [0, ri | U [1 — ry, 1], 


E, = [0,r7] U[n(l-1n), 7] 
{I —n,l-nd- r)| U[1— nn, 1], 


In general, E, is a union of 2* disjoint compact intervals. If [a, b] is a typical interval in E,, the 
intervals in F,,, are given by [r,a, r,b] and [1 — r,b,1 — r,a]. It should be observed that if we 
choose c so that r; < c < 1 — n, then all intervals [r,a, r,b] lie to the left of c and all intervals 
[1 — 75,1 — r,a] he to the right of c. By induction we easily see that E,,, C FE, and that E, has 
Lebesgue measure (r, + r,)*. More precisely, if the lengths of the m = 2* intervals in E, are, from 
left to night, d,,..., d,,, then the lengths of the 2m intervals in E,, ,, also from left to right, are 


N15 Gy, 1, d5,7\d5,...5 4,15 oA m— 15 Ams Mdm: 


Thus if we set 
00 
C= () E,, 
k=0 

then C is a nonempty compact set of Lebesgue measure zero. A point belongs to C if and only if it 
is either an endpoint of an interval of some set £, or a limit of such endpoints. It follows readily 
that C is a Cantor set. 

For r; = *, = 3 we obtain the original set of Cantor himself. The more general case r, = 7, = r 
is of some interest in harmonic analysis and is discussed, for example, in Kahane and Salem [6]. 

We note first that the set C = C(1,, 7.) has Hausdorff dimension a, where a is the unique root 
between 0 and | of the equation 


(1) ret p= |, 


The author is a Professorial Fellow at the Institute of Advanced Studies, Australian National University, 
Canberra. His only degree, B.A., was obtained from Melbourne University in 1950. He has published research on 
various aspects of the theory of ordinary differential equations, iteration, and mathematical control theory. His 
article on J. B. Fourier in this MONTHLY, 76 (1969) 468-483, received a 1970 Lester R. Ford award. 
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The definition of Hausdorff measure is given by Kahane and Salem, and a proof may be modelled 
on one given there for the case r, = 7, = r. In that case a = log2/log r_' can be expressed in 
closed form. The result is also contained in Theorem II of Moran [7]. 

There are other reasonable definitions of dimension besides that of Hausdorff. We show next 
that one proposed by Besicovitch and Taylor [1] gives the same value in the present case, although 
in general the two definitions do not agree. 

In forming EF, we exclude from £, an open interval of length 7. In forming E, we exclude from 
E, two open intervals of lengths r,t and r,r. In general, in forming F,,, we exclude from E, 2* 
open intervals of which one has length r;‘r, a further k have length r‘~'r,7, another k(k — 1)/2 


have length r{~*r37,..., and one has length r}‘r. For any B such that 0 < B < 1 the sum of the 
Bth powers of the lengths of these intervals is 
k 
(2) PY (Ee fh — oP 
j=0 


where we have put 7 = r? + r?. Thus the sum of the 6th powers of the lengths of all excluded 
intervals is 
6, = TAC] +nt+yt+---). 


Hence 


(3) og =T9(1—n) ifn <1, 

= oify > |. 
The Besicovitch—Taylor dimension is the infimum a of all 8 for which og is finite and thus satisfies 
rot+n=l. 

Since the Hausdorff and Besicovitch-Taylor dimensions are equal, it follows by a result of 
Hawkes [4] that the entropy dimension of C also exists and has the same value a. The entropy 
dimension is defined by dividing [0,1] into equal subintervals, counting the number c, of 
subintervals which contain points of C, and taking the limit of log c,/log n as n > oo. 


3. The Corresponding Cantor Function. We are now going to construct a distribution function 
W~ with support on the Cantor set C. Let I, (j= 1,..., 2* — 1) denote the open intervals 
complementary to E,, numbered from left to right. Let Y, be the unique continuous function on 
E = [0,1] such that y,(0) = 0, Y,(1) = 1, y, is linear on each interval of E,, and 


a(x) =j2-*ifx EI, (j= 1,..., 24-1). 
Then y, is nondecreasing, ¥,,., =, on J, (j = 1,...,2* — 1), and 


evi (x) - W,.(x)|< 27* 


for all x € E,). Hence the sequence {,} converges uniformly on £o. Its limit y is a continuous, 
nondecreasing function such that ¥(0) = 0, ¥(1) = 1, and y is constant on any open interval 
complementary to any set E,. 

This construction can be applied to an arbitrary Cantor set on the real line. However, the 
special structure of the set C makes it possible to say more. In fact wy satisfies the functional 
equations 


b(nx) = 50(2), 
(4) YU -nx)=1-5¥9(x), O<x<)) 


pld-x)n+x(l—n)] <5. 
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The first two relations are easily verified if x is an endpoint of an interval J, , and extend at once 
to arbitrary x € Ep. 

The functional equations (4) completely characterize y. In fact if Y* is any bounded function 
which satisfies (4), then the difference ¢ = y — * satisfies 


6(x)=50(x/n) (0<x<n), 


= -5e[( —x)/n| (-n<x<)1), 


=0 (7,<x<l1l-n). 


If we denote by p, p,,#, the supremum of |¢(x)| over the intervals [0, 1],[0,7,],[1 — 7,1] 
respectively, then pp = max(p,, >). On the other hand p, = 4p, w, = 4p. It follows that p = 0. 
There is no difficulty in principle in determining the moments of y. It is readily shown that 


[ix dbi(x) = 5 (1 +h+---+h*), 
where h = (r, — r,)/2. Letting k — oo we obtain 
(5) [x4u(x) = (1 —h) 
It may further be shown that 
(6) (2-r?— 12) ['x? dv (x) =1 —r(1—h)'. 
It is also of interest to consider the Fourier-Stieltjes transform 


W(r) = fie d(x) (-—0 <A< oo). 
0 
We have 


Dra) = fle Pm dbx) 


1 


=2{ e ®" dy(n4x) 


0 


j 
= 2f rem dy (x) 


and similarly 
—iAS — idx 
e Y(-—nA) = 2f e Ax d(x). 


It follows that ) satisfies the functional equation 


(7) V(r A) + eM (—nA) = 29 (A). 


Harmonic analysts may be interested to determine for what values of r, and r, the set C is a set of 
uniqueness. 


4. Connection with the 2-adic Integers. We define a 2-adic integer to be an infinite sequence 
Q = (do, @, 2y,... ), Where a; = 0 or | for all 7. If B = (bo, bj, b5,... ) is another such sequence 
the sum 


a+ B=(co, C1, C5---) 


is defined in the following way. If ag + by < 2, then co =a, + bo, but if ag + by > 2, then 
Co = 4) + by — 2 and we carry | to the next position. The terms c,,c,,... are successively 
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determined in the same fashion. With this definition of addition the set J of all 2-adic integers is 
an abelian group. 

We can also define a metric on J by setting d(a, a) = 0 and d(a, B) = 2° * ifa + B and k is 
the least integer such that a, # b,. This metric is invariant and nonarchimedean, i.e., for all 
a,B,yEeJl 


d(a+y,B+y)=d(a,B), 
d(a + B,0) < max[d(a,0), d(B,0)]. 
Moreover J is now a compact topological group. 

If 6 =(1,0,0,...), then the multiples n6(n =0,1,2,...) consist precisely of all a = 
(ao, @,, 25,... ) with a, = 0 for all large i. Hence the semigroup J) formed by these multiples is 
dense in J. 

Now let u and v denote the maps of the unit interval into itself defined by 

u(x) =rjx,v(x)=l—-nx (0<x< 1). 
Then every endpoint, other than 0 and 1, of an interval of the set E, can be uniquely represented 
in the form 
Wn 7+ 2w,(1), 
where w, = u or v for each i and 1 < m < k. For example, v °v > u(1) represents the endpoint 
1 —r(1 — r,r,) of E;. To such an endpoint we make correspond the 2-adic integer 
a= (do, Qis Ay,...); 


where a, = 0 fori > m,a,, = 1, and a, = 0 or | according as w,,_, = u or v for0 <i < m. To 
the endpoints 1 and 0 we make correspond the 2-adic integers 6 = (1,0,0,... ) and 0 = (0,0,0,... ). 
In this way we define a 1-1 map w of the set Cp of all endpoints of intervals of the sets E, onto the 
set Jy of all 2-adic integers a = (ao, a), a5,... ) with a; = 0 for all large 7. 

We will show that this map w is uniformly continuous. If D is an interval of the set E,, then it 
has the form 


D=w,° vs ow Ey; 
where w),..., w, are uniquely determined u’s or v’s. One endpoint of D is § = w,° --- ow,(1). 
If w,=u for 1<i<k, then the other endpoint of D is § = 0. Otherwise there exists an h 
(1 <h<k) such that w, = v and w, = u for all i < h, and the other endpoint of D is then 
E=w,e +++ oW,,,(I1). 


If a = w(€) and & = w(&), then in any event we have d(a, @) < 2~*. Any point £’ of C, in the 
interior of D has the form 


g’ = w/e +> owi(I), 


where wi,..., w; are uniquely determined u’s or v’s and / > k. Moreover 


D' =w° +++ ow, Eyg C D=w,0 ++: ow Ep. 
Since also « 
D' Cw? +++ Oo w_4 41 Eo 
we must have w;_, = w,_; for0 <i <k. Ifa’ = w(é’), it follows that d(a, a’) < 27“. 
Put 


rm = min(7,,7,,1 —r, —7). 


Then the distance between any two distinct endpoints of intervals of E, is at least 7°. Thus if two 
distinct points £,,& of Cy are distant less than 7‘, then they lie in the same interval D of E,. 
Hence, by what we have just shown, the corresponding 2-adic integers a,, a, satisfy d(a,, a,) < 
2-*. 
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This proves that w is uniformly continuous. The inverse map w 7! is also uniformly continuous. 
For if d(a,,a,) < 2~*, then €, =w '(a,) and £, =w '(a,) lie in a common interval D = 
w,° +++ °w,E, and hence are distant at most r*, where r = max(r,, 7,). 

It follows that the map w admits a unique continuous extension, which we will still denote by 
w, mapping the whole of C into J. Moreover w must map C onto J, since its range is compact and 
contains a dense subset of J. Since the inverse map also admits a unique continuous extension, w 
is actually a homeomorphism of C onto J. If we now define the sum £ = §, © &, of two elements 
of C by w(é) = w(§,) + w(§,), then C acquires the structure of a compact topological abelian 
group. Moreover this structure is naturally connected to the definition of the set C. 

Conversely, the measure v on C determined by the distribution function y can be transferred to 
a measure p on J. In fact this is precisely the Haar measure on J. This may be shown without 
difficulty from the explicit form for Haar measure on J, given in Hewitt and Ross [5, p. 202]. 
Returning to C, we see that the measure p is invariant under the group action just defined. If we 
set T§=£€ 01, for any © C, then it follows from the ergodic theory of group rotations, 
described in Walters [9, pp. 160-162], that for any continuous function f: C > R 


n—-1 


- v f(T’) > [f) d(x) uniformly asn > oo. 
nN e=0 de 


It may be noted that the measure » is also invariant under the piecewise linear transformation 
S of the unit interval defined by 


Sx=ri'x for0<x<n7, 


(l-r,—-n) (x-—17) forrn,<x<1l-n, 
=r;'(l-x) forl-rn<x< 1. 
In fact the inverse image of (0, x) is the union of the intervals 
(0.1%), (iat d-n- ry) x), (1 — 7x,1), 
whose total v-measure is 


v(x) +1-YU-nx)= v(x). 


Transformations of this type have been extensively studied in ergodic theory; see, e.g., Parry [8] 
and Wilkinson [10]. 
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In this department the MONTHLY presents easily stated unsolved problems dealing with notions ordinarily 
encountered in undergraduate mathematics. Each problem should be accompanied by relevant references (if any 
are known to the author) and by a brief description of known partial results. Manuscripts should be sent to 
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HOW FEW nr-PERMUTATIONS CONTAIN ALL POSSIBLE k-PERMUTATIONS? 


PETER J. SLATER 
Mathematics Department, University of Alabama, Huntsville, AL 35899 


If S, = (1,2,..., n} and o: S, — S, is a permutation, then we can represent o by (5), 55,..., 5,) 
where s, = o(i) for 1 < i < n. The collection 


C = {(1,2,3,4,5,6), (6,5, 4, 3,2, 1), (3,6, 1,5, 4, 2), 
(5,4,6,2,1,3), (2,1,5,6, 3,4), (4,3,2,1,6,5)} 


of permutations of S, has the interesting property that for any 2-element subset {a, b) of S,; there 
are permutations 7’ and r” in C with r’(a) = b and r’(b) = a and with r’(a) = a and r’(b) = BD. 
In other words, any permutation of a 2-element subset of S; is “contained in” a member of C. 


Problem. Determine M(n, k), the minimum number m of permutations a), 0,,...,6,, of S, 
such that, for any k-element subset K of S, and any permutation 7 of K, at least one a; satisfies 
o,(j) =7(/) for all j € K. 


Trivially, M(n, n) =n! and M(n, 1) = 1, and it is easy to see that M(2 p — 1,2) = M(2p,2) 
= 2 p. However, the computation of M(n, 3) is already quite complex. The values for n = 3, 4, 5 
and 6 are 6, 15, 30 and 35. 

Consider M(n,n — 1). If K= 8S, —h and 7 is a permutation of K, and if o,(/) = 7(/) for 
J © K, then o,(h) =h and o, has a fixed point. In short, a permutation o of S, is the unique 
extension of at least one permutation on n — | elements of S, if and only if o has at least one 
fixed point. As in Brualdi [1, page 79] a permutation with no fixed points is called a derangement, 
and d,, denotes the number of derangements of S,. Then 


M(n,n-1)=n!-d 


and the resulting sequence [2, # 1423] 
1,1, 4, 15, 76, 455, 3186, 25487, ... 
first appeared in a study of the game of Mousetrap [3]. 
It is not true in general that M(n, n — h) is the number of permutations of S, with at least h 


points fixed. For example, there are six permutations of S, with exactly two fixed points and one 
with exactly four, but M(4, 2) = 4. 
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SYMMETRY FACTORS FOR DIFFERENTIAL EQUATIONS 


LANCE L. LITTLEJOHN 
Department of Mathematics, Computer Science and Systems Design, The University of Texas at San 
Antonio, TX 78285 


1. Introduction. It is well known (i.e. [2, p. 66]) that a second order linear differential equation 
(1) a,(x)y"(x) + a(x) y'(x) + ao(x) y(x) = 0 
can be made formally self adjoint when it is multiplied by the function 


fa /ar) at 


a,(x) 
Indeed, one probably remembers that f(x) is found by solving the first order equation 


(3) an(x)f'(x) + (a3(x) — ay(x)) f(x) = 0. 

Likewise, it is a familiar fact ((2, p. 45], [4]) that an odd order differential equation with real 
coefficients cannot be formally self adjoint. However, the answer to the following is less well 
known: can a differential equation of order 2n (n > 2), with real coefficients, be made formally 
self adjoint by multiplying it by a suitable function f(x)? If so, how does one find this f(x)? It is 
the purpose of this article to answer these questions. 


(2) f(x) = 


2. The Main Results. From here on, let L,,,(y) = X4".9a,(x) y ‘“(x) where a, is real-valued, 
a, € C*(I), ay,(x) # 0 for x € I, where J is some compact interval of the real line R and n is 
some natural number. The Lagrange adjoint of L,,(y), denoted by L;,,(y), is defined by 


Li,(y) = ¥ ("aor 


L,,,(y) is said to be formally self adjoint ot symmetric when L,,,(y) = L3,,(y). The function f(x) 
is a symmetry factor for the differential expression L,,,(y) if f(x) L,,,(y) 1s formally self adjoint. 
(According to this definition, (2) 1s a symmetry factor for (1). In this case, (2) is usually called an 
integrating factor, but since an integrating factor is usually reserved for a function which makes a 
differential expression exact, we coin the term “symmetry factor.’’) 

There are many expressions available for the most general formally self adjoint differential 
expression of order 2”. For example, the reader is referred to [1, p. 204] and [3, p. 125] for two 
such representations. However, the one that we use here is a remarkable formula due to H. L. 
Krall [4]. 


THEOREM | (Krall). The most general formally self adjoint differential expression of order 2n with 
real coefficients is given by: 


YX d(x) M(x) +E b3 oe) 


where B,, is the Bernoulli number defined by: 


x x << 
wor toate 


J2s—2k — ] 


s—k By, 94b° 7X) (x) ypP** P(x) 
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If we replace b, by b,,, switch the order of summation in the second term and then replace k by 
k — 1 in the second term, we see that the most general formally self adjoint differential expression 
of order 2n can be written as: 


(4) 
J2s—2k+2 — |] 


: (2k) A oe 25 (2s—2k+1) (2k—-1) 
DX bulx)y (x) + x Lo _ i] s_-k+1 By, 24K 426%; (x)y (x). 


We shall prove the following theorem: 


THEOREM 2. f(x) L,,(y) is formally self adjoint if and only if f(x) simultaneously satisfies the n 
homogeneous differential equations: 


n 2s—2k+1 d5—2k+2 
2s 2s— 2k + | 2 at (2s—2k+1-/) Y) 
(5) x » (54 — A J s—-k+] Bas 24 4244s (x) f(x) 


=a,,_\(x)f(x), k=1,2,...,n. 


Observe when k = n, (5) reduces to na,,,(x)f’(x) + (nas, (x) — ao,_\(x)) f(x) = 0. In par- 
ticular, when n = 1, we get (3). From this equation, we can determine f(x) up to a nonzero 
constant multiple. In view of this, we restate Theorem 2. 


THEOREM 3. f(x) is a symmetry factor for L,,,(y) if and only if: 


el/nf (a2n—1(O/arn (edt 


(i) f(x) = 


Ay, (x) 
and 
(11) for n > 2, f(x) simultaneously satisfies the (n — 1) homogeneous differential equations 
n 22s—-2k+1 2s—2k+2 
2s \(2s—2k+1)\2 — 1 Qs-2k+1-J)(¥) fU) 
x 2X (5, _ A J s—-k+] By, 2442435 (x)f (x) 


= a,,_,(x) f(x), fork = 1,2,..., (n- 1). 


Proof of Theorem 2. Suppose f(x)L,,(y) is formally self adjoint. Using (4), the functions 
bo, (x) = f(X)a>,(x), k = 0,1,..., m satisfy 


n Vs J2s—2k+2 — | 55 2k4 I 
f(x )ao,-1(%) = E (2° 1) Bonne 2b *)(x), k = 1,2,...,1n 


sak s—-k+1 
- 45 Q2s—2k+2 _ |] (25—2k+1) 
1.€., Ay,_1(x) f(x) = » ( By .-rk »(f(x)az,(x)) 
sak 2k — 1 s—k+1 + 
n 2s—-2k+1 2s—2k+2 
= jo oa nl J s—-k+] By, 24% 424%s (x)f (x) 


where we have used Leibniz’s formula. Thus condition (5) holds. To prove the converse, retrace 
these steps. 
In the case of constant coefficients, (5) simplifies: 


n—~k -\ y2j+2 
2k +2j\ 222-4 | 
d | 2k — J) FF Baye 2tae asf 12) =4y,-\f(x), k=1,2,...,7. 
j=0 


From Theorem 3, it is necessary that the symmetry factor be f(x) = e{42-'1)/"42"_ Since 


d*¢(x) _ ak, f(x) 


k k yk 
dx N"Q>n 


464 H. HARUKI AND S. HARUKI [August-September 


the following theorem is immediate. 


THEOREM 4. Let L,,(y) = Lx.0a,y\(x) where a, € R,k =0,1,...,2n and a,,, # 0. Then 
f(x)L,,,(y) is formally self adjoint if and only if: 
(i) f(x) = ef2n-1X/Na2,, 


and 
(11) for n > 2, 


n~k -\ (22/+2 — 1)B, oa ast} 
[2H + 25) Seca k= 1,2,....(n—). 


a1 2 Ik -] (j + 1)n2stlq2st! 
3. Examples. 
(1) L(y) = y® — S5y® + Sy” + Sy’ — by. 


By Theorem 4, the symmetry factor is necessarily f(x) = e‘ °*”*. However, condition (ii) of 
Theorem 4 is not satisfied so L(y) cannot be made formally self adjoint. 


(2) L(y) = yO —2y + y+ xy. 
According to Theorem 3, the symmetry factor is necessarily f(x) = e *. It is easy to check to see 
that e * also satisfies condition (11) of Theorem 3 so e *L(y) is formally self adjoint. Note that 


e L(y) =(e*y")" — (e*y’)! + xe*y. 
(3) L(y) = x?y — (2x? — 4x) y® + (x? — (2R + 6)x)y” + ((2R + 2)x — 2R)y’. 
Using Theorem 3, it follows that e~* is a symmetry factor. Again, observe that 
e-*L(y) = (x7e*y”)" — (([2R + 2] x + 2)e*y’)’. 
(4) L(y) = y© — Sy + 2y — y” + By! + By. 
The symmetry factor must necessarily be f(x) = e~°*/?. However, condition (ii) of Theorem 4 is 
not'satisfied so L(y) cannot be made formally self adjoint. 
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The aim of this note is to show an efficient method of evaluating the definite integral 


(1) f(a) =f 


1 log(x* — 2xcosa + 1) 
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+ 248 pp, $6.50 (P). [ISBN: 0-486-61471-9] Reprint of 1958 edition with Davis” 1973 Monthly article 
on Hilbert”’s tenth problem as an appendix. Introduction to recursion theory using Turing machines. 
Covers basic theory of recursive sets and functions, logical and combinatorial decision problems, 
Kleene hierarchy, computable functionals. No exercises. No mathematical prerequisites except matu- 
rity. KS 


Foundations, P, L. Rigorous Proof in Mathematics Education. Gila Hanna. OISE Pr, 1983, viii + 97 
pp, $16.50 (P). [ISBN: 0-7744-0251-2] Full scale theoretical assault on two basic assumptions under- 
lying the curricular reforms of the 1950°s and 60%s which laid new emphasis on formal rigor in the 
secondary curriculum. By examining the conflicting tenets of logicism, formalism and intuitionism 
and by looking at the social and psychological realities of actual mathematical practice, the author 
hopes to refute the beliefs that (a) "rigorous proof is the most important characteristic of modern 
mathematical practice," and (b) "in modern mathematical theory there are generally accepted criteria 
for the validity of a proof.” GHM 


Graph Theory, P. Graph Theory. Ed: Béla Bollobas. Math. Stud., No. 62. Elsevier North-Holland, 
1982, viii + 201 pp, $46.50 (P). [ISBN: 0-444-86449-0] Proceedings of conference held at Cambridge, 
England, March 11-13, 1981. Contains 19 research articles dealing with many aspects of graph 
theory. KS 


Discrete Mathematics, T(14-15: 1, 2), S, L. Concepts in Discrete Mathematics. Sartaj Sahni. 
Camelot Pub, 1981, x + 437 pp, $26.95. [ISBN: 0-942450-00-0] Readable introduction to topics in 
discrete mathematics most relevant to computer science: logic, sets, relations, functions, complex- 
ity, recurrence relations, probability, graphs. No mention of Turing machines. Numerous examples 
and exercises; many deal with design or analysis of algorithms. KS 


Number Theory, T(17: 1), S, P, L*. Sequences. H. Halberstam, K.F. Roth. Springer-Verlag, 1983, 
xviii .+ 290 pp, $28. [ISBN: 0-387-90801-3] This edition is essentially a reprint, with corrections, 
of the 1966 classic which gave the first systematic account of that part of number theory which 
deals with specific sequences of integers. (First Edition, TR, August-September 1967.) CEC 


Number Theory, P. Séminaire de Théorie des Nombres, Paris 1980-81: Séminaire Delange-Pisot-Poitou. 
Ed: Marie -José Bertin. Progress in Math., V. 22. Birkhauser Boston, 1982, vii + 362 pp, $22.50. 
[ISBN: 3-7643-3066-X] 


Algebra, T(15-16: 1), S, L. Introduction to Field Theory, Second Edition. Iain T. Adamson. Cam- 
bridge U Pr, 1982, viii + 181 pp, $9.95 (P); $19.95. [ISBN: 0-521-28658-1; 0-521-24388-2] Good expo- 
sition of Galois theory following Artin’s approach. Includes background material on field exten- 
sions and applications to finite fields, ruler-and-compass constructions, and solution by radicals. 
Few but generally interesting exercises. Minor changes from First Edition. KS 


Algebra, P. Lecture Notes in Mathematics-966 & 967: Algebraic K-Theory. Ed: R. Keith Dennis. 


Springer-Verlag, 1982. Part I, viii + 407 pp, $20 (P) [ISBN: 0-387-11965-5]; Part II, vii + 409 pp, 
$20 (Pp). [ISBN: 0-387-11966-3] Proceedings of a conference held at Oberwolfach in June 1980. JAS 


Finite Mathematics, T(13). College Mathematics for the Managerial and Social Sciences. Soo Tang 
Tan. Prindle, Weber & Schmidt, 1983, viii + 788 pp. [ISBN: 0-87150-354-9] Standard topics in finite 
mathematics and calculus normally covered in a one-year course. Two years of high school algebra is 
the assumed background. AWR 


Finite Mathematics, T(13). Finite Mathematics with BASIC: A Liberal Arts Approach. Irving Allen 
Dodes. Krieger Pub, 1981, xii + 359 pp, $21. [ISBN: 0-88275-862-4] A text that emphasizes linear 
programming, Markov chains, game theory, computation, and applications to industry--at the cost of 
de-emphasizing the usual work with probability, statistics, truth tables, and switching circuits. 


AWR 


Calculus, T(13). Techniques of Calculus. Robert E. Dressler, Karl Stromberg. Amsco Coll Pub, 
1983, x + 627 pp, $14 (P); $20. [ISBN: 0-87720-978-2] The standard fare for a one-year single vari- 
able calculvs course intended "to teach the student how to produce correct solutions...not how to 
give logically pristine proofs." Is perhaps aimed at the high school calculus course. AWR 


Calculus, S(13-14). Microcomputer Applications for Calculus. Gary G. Bitter. Prindle, Weber & 
Schmidt, 1983, xi + 241 pp, (P). [ISBN: 0-87150-378-6] A collection of very simple programs, each 
presented in Basic, Pascal and Fortran, illustrating most of the standard calculus topics. They do 
not illustrate good programming style, however. GHM 


Calculus, T*(13: 1, 2). Applied Mathematics for Business and Economics, Life Sciences, and Social 
Sciences. Raymond A. Barnett, Charles J. Burke, Michael R. Ziegler. Dellen Pub, 1983, xvi + 856 
pp, $26.95. [ISBN: 0-89517-049-3] Algebra review, finite mathematics and calculus in one and 
several variables. Survey-selected topics, applications and emphasis for students with one and a 
half to two years of high school algebra. Stress is on computational skills, ideas and problem- 
solving rather than on theory. No specialized experience is required for the applications. 
Instructor aids, including quizzes and chapter tests are available from the publisher. JK 
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Calculus, T(13-14: 2). Introductory Mathematical Analysis for Students of Business and Economics, 
Fourth Edition. Ernest F, Haeussler, Ir., Richard S. Paul. Reston Pub, 1983, xv + 904 pp. [ISBN: 
0-8359-3274-5] Well-done, not overpowering treatment of calculus of one and _ several variables, 
matrix algebra, linear programming, and, in this latest edition, a chapter on basic probability 
theory. Self-contained with respect to applications in business and economics. Attractive format. 
Lots of exercises. (First Edition, TR, May 1974; Second Edition, TR, December 1976; Third Edition, 


TR, December 1980.) JK 


Complex Analysis, S(18), P. Characteristic Properties of Quasidisks. Frederick W. Gehring. Pr U 
Montreal, 1982, 97 pp, $11 (P). [ISBN: 2-7606-0601-5] Quasidisks are images of the unit disk under 
quasiconformal mappings of the Riemann sphere. Based on lectures at the August 1982 Montreal NATO 
Advanced Study Institute on function theory, the book presents various characterizations of quasi- 
disks by geometric, function-theoretic, and other properties. Some proofs and an application are 
given. PZ 


Complex Analysis, S(18), P. Vector Valued Nevanlinna Theory. H.J.W. Ziegler. Research Notes in 
Math., No. 73. Pitman Pub, 1982, xiii + 201 pp, $19.95 (P). [ISBN: 0-273-08530-1] Extends the 
value distribution theory of F. and R. Nevanlinna to the context of simultaneous solutions of no 
meromorphic functions of one variable. The two main theorems of Nevanlinna theory are generalized 
and studied from the viewpoint of Hermitian differential geometry. With an appendix on the rudi- 
ments of complex manifolds. PZ 


Complex Analysis, T(16-17: 1). Funktionentheorie. Rudolf J. Taschner. Manzsche Verlag, 1983, 241 
pp, DM41.50 (P). [ISBN: 3-214-00015-2] An introduction to complex analysis written for engineers and 
other users of mathematics. JD-B 


Differential Equations, P. Lecture Notes in Mathematics-954: Differential Systems Involving 
Impulses. Sudhakar G. Pandit, Sadashiv G. Deo. Springer-Verlag, 1982, vii + 102 pp, $8 (P). [ISBN: 
0-387-11606-0] The derivative involved in these equations is the distributional derivative, allowing 
treatment of perturbed ordinary differential equations in which the perturbations are of impulsive 
type. This monograph attempts to unify research results of the past fifteen years. Five chapters 
deal with problems of existence, uniqueness, stability, boundedness, and asymptotic equivalence. 
AWR 


Differential Equations, P. Irregular Singularities in Several Variables. A.R.P van den _ Essen, 
A.H.M. Levelt. Memoirs No. 270. AMS, 1982, iv + 43 pp, $4 (P). [ISBN: 0-8218-2270-5] Generalizes 
to the setting of first order linear partial differential equations a classical theorem of first 
order linear ordinary differential equations, to the effect that linear systems with analytic coef- 
ficients are triangulable by taking a suitable root of the independent variable, and a _ linear 
transformation of the unknown functions. PZ 


Differential Equations, P. Lecture Notes in Mathematics-964: Ordinary and Partial Differential 
Equations. Ed: W.N. Everitt, B.D. Sleeman. Springer-Verlag, 1982, xviii + 726 pp, $32 (P). [ISBN: 
0-387-11968-X] Proceedings of a conference at Dundee, Scotland, March 29-April 2, 1982 on the occa- 
sion of its centenary: 60 papers on a wide variety of research in differential equations. LAS 


Differential Equations, T(15-16: 1). Elementary Applied Partial Differential Equations with Fourier 
Series and Boundary Value Problems. Richard Haberman. Prentice-Hall, 1983, x + 533 pp, $34.95. 
[ISBN: 0-13-252833-9] An elementary introduction to partial differential equations emphasizing 
engineering and physical science applications. The topics covered include separation of variables, 
Fourier series, Sturm-Liouville theory, Green’s functions, Fourier and Laplace transforms, and _ the 
method of characteristics. AO 


Numerical Analysis, P. The Method of Discretization in Time and Partial Differential Equations. 
Karel Rektorys. Math. & Its Appl., V. 4. D Reidel Pub, 1982, xviii + 451 pp, $69. [ISBN: 90-277- 
1342-1] Generalization of the method of lines applied to elliptic, parabolic, hyperbolic, nonhomo- 
geneous, nonlinear, and integrodifferential problems. Examples. Includes the author’s results on 
existence, convergence, error estimates and regularity. RWN 


Numerical Analysis, P. Sparse Matrices and their Uses. Ed: Iain S. Duff. Inst. of Math. & its 
Appl. Academic Pr, 1981, xii + 387 pp, $43.50. [ISBN: 0-12-223280-1] Proceedings of a conference 
held July 9-11, 1980 at the University of Reading (England). Most of the papers are accessible to 
the nonspecialist. AO 


Functional Analysis, P. Topics in Locally Convex Spaces. Manuel Valdivia. Math. Stud., V. 6/7. 
Elsevier North-Holland, 1982, xiii + 510 pp, $69.75 (P). [ISBN: 0-444-86418-0] Aimed at people who 
are already familiar with the general theory of locally convex spaces, the book collects’ recent 
results from the periodical literature and sets forth some new results. AWR 


Functional Analysis, P. Lecture Notes in Mathematics—955: Bundles of Topological Vector Spaces and 


Their Duality. Gerhard Gierz. Springer-Verlag, 1982, iv + 296 pp, $13.50 (P). [ISBN: 0-387-11610- 
9] 


Functional Analysis, T(18: 1), S, P. Linear Equations in Banach Spaces. S.G. Krein. Transl: A. 
Iacob. Birkhauser Boston, 1982, xii + 102 pp, $14.95. [ISBN: 3-7643-3101-1] A brief but comprehen- 
Sive treatment of linear equations in Banach spaces. Covers, e.g., solvability, stability, special 
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classes of equations, the adjoint equation, regularity. Concludes with applications to integral and 
differential operators. Appendix on elementary functional analysis; no exercises. Russian edition 
appeared in 1971. PZ 


Functional Analysis, P. Spectral Theory. Ed: Wieslaw Zelazko. Banach Center Pub, V. 8. PWN, 
1982, 603 pp. [ISBN: 83-01-01495-4] Survey papers, summaries of lectures, and research contributions 
from the 1977 spectral theory semester at the Banach Center in Warsaw. Concludes with a collection 
of problems from the Center’s problem book. LAS 


Functional Analysis, T(18: 1), S$, P. The Convolution Product and Some Applications. Wilhelm Kecs. 
Transl: Victer Giurgiutiu. D Reidel Pub, 1982, xvii + 332 pp, $69.50. [ISBN: 90-277-1409-6] A 
treatment of the basic properties and applications of the convolution product for functions and dis- 
tributions. After a general introduction to topological vector spaces and convolutions, applica- 
tions are made to integral transforms, differential equations, electrical engineering, and visco- 


elastic solids. References, index. JS 


Functional Analysis, P. Der Graphensatz in lokalkonvexen topologischen Vektorrdumen. Michael Ober- 
guggenberger. Teubner-Texte zur Math., B. 44. B.G. Teubner, 1982, 148 pp, 15,50M (P). On Banach’s 
closed graph theorem and some of its generalizations. JD-B 


Analysis, P. Nonlinear Analysis, Function Spaces and Applications, Volume 2. Ed: Oldrich John, 
Alois Kufner. Teubner-Texte, B. 49. B.G. Teubner, 1982, 268 pp, 27M (P). Proceedings of the Spring 
School held in Pisek, Czechoslovakia, May 24-28, 1982. JAS 


Algebraic Geometry, S(18), P. Commutative Algebra: Proceedings of the Trento Conference. Ed: Sil- 
vio Greco, Giuseppe Valla. Lect. Notes in Pure & Appl. Math., V. 84. Dekker, 1983, viii + 351 pp, 
$47.50 (P). [ISBN: 0-8347-1899-2] Collection of research papers representing an overview of the most 
recent developments in commutative algebra with special reference to algebraic geometry, LCL 


Differential Geometry, T(18: 1), P. Total Curvature in Riemannian Geometry. T.J. Willmore. Ser. 
in Math. & Its Applic. Halsted Pr, 1982, 168 pp, $39.95. [ISBN: 0-470~27354-2] An exposition of 
"the invariants which arise from integrating various curvature measures over manifolds." Very 
definitely a sophisticated presentation of the material. No exercises, good bibliography, a short 
and rather incomplete index. JAS 


Differential Geometry, P. Differential Geometry and Mathematical Physics. Ed: M. Cahen, et al. 
Math. Physics Stud., No. 3. D Reidel Pub, 1983, vii + 188 pp, $28.50 (P). [ISBN: 90-277-1508-4] 
Lectures given at the meetings of the Belgian Contact Group on Differential Geometry held at Liége, 
May 2-3, 1980 and at Leuven, February 6-8, 1981. JAS 


Geometry, T(16-17), P. Combinatorial Integral Geometry With Applications to Mathematical Stereol- 
ogy. R.V. Ambartzumian. Wiley, 1982, xvii + 221 pp, $45. [ISBN: 0-471-27977-3] Could as easily be 
described as a book on probability or combinatorics. A fresh development built upon the idea of 
introducing measures into the space of lines in the plane to solve problems in geometrical probabil- 
ity. AWR 


Geometry, T(17:1, 2), S, P. An Introduction to Convex Polytopes. Arne Brdnsted. Grad. Texts in 
Math., No. 90. Springer-Verlag, 1983, viii + 160 pp, $28. [ISBN: 0~-387-90722-X] Assumes only ele- 
mentary linear algebra and some point set topology; develops basic material on d-dimensional convex 
sets and, particularly, convex polytopes with the goal of studying the combinatorial theory of con- 
vex polytopes. Highlights are the Dehn-Sommerville relations, upper bound theorem and lower bound 
theorem, the latter two dating from the early 1970%s. SS 


Geometry, P. Lecture Notes in Mathematics-970: Twistor Geometry and Non-Linear Systems. Ed: H.D. 
Doebner, T.D. Palev. Springer-Verlag, 1982, v + 216 pp, $11.50 (P). [ISBN: 0-387-11972-8] Survey 
lectures from the Fourth Bulgarian Summer School on Mathematical Problems of Quantum Field Theory 


held at Primorsko, Bulgaria in September 1980. JAS 


Operations Research, T(14: 1). Introduction to Mathematical Programming: Quantitative Tools for 
Decision “aking. Benjamin Lev, Howard J. Weiss. Elsevier North-Holland, 1982, xii + 289 pp, 
$27.50. [ISBN: 0-444-00591-9] Intended for graduate or undergraduate business students. Few 
mathematics prerequisites, but does require some sophistication. Topics include linear programming, 
application to the classical transportation and assignment problems, integer and dynamic program- 
ming, search techniques in non-linear programming. JRG 


Operations Research, P. Mathematical Programming with Data Perturbations II. Ed: Anthony V. 
Fiacco. Lect. Notes in Pure & Appl. Math., V. 85. Dekker, 1983, vi + 155 pp, $34.50 (P). [ISBN: 
0-8247~-1789-9] Papers selected from a May 1980 symposium at George Washington University covering 
recent results in programming sensitivity and stability analysis methodology. LAS 


Game Theory. The Mathematics of Games of Strategy: Theory and Applications. Melvin Dresher. Dover 
Pub, 1981, 184 pp, $4 (P). [ISBN: 0-486-64216-X] An unabridged republication of the author’s book 
Games of Strategy: Theory and Applications, Prentice-Hall, 1961. Written to an audience that has 
had a year of calculus but no linear algebra, this is a self-contained exposition that starts with 
parlor games, proves the minimax theorem, and discusses a variety of ways to find optimal stra- 


tegies. A bit more than a third of the book treats infinite games. Explanation is largely couched 
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in military jargon. Attractive as a text except for lack of exercises. AWR 


Optimization, S(16-17), P, L. Nonlinear Mathematics. Thomas L. Saaty, Joseph Bram. Dover Pub, 
1981, xv + 381 pp, $6.50 (P). [ISBN: 0-486-64233-X] Reproduction of this same title published in 
1964 by McGraw-Hill. The goal is to provide some unified approaches to nonlinear mathematics. 
Attention is given to nonlinear problems in optimization, programming, ordinary differential equa- 
tions, automatic control, and prediction theory. AWR 


Optimization, T(17-18: 2), P. Optimization--Theory and Applications: Problems with Ordinary Dif- 
ferential Equations. Lamberto Cesari. Appl. in Math., No. 17. Springer-Verlag, 1983, xiv + 542 
pp, $68. [ISBN: 0-387-90676-2] Focused on nonparametric problems of the calculus of variations in 
one independent variable and on problems of optimal control modelled by ordinary differential equa- 
tions, this book emphasizes the connections of present work to classical theory. Numerous concrete 
problems appear in the text before existence theorems are taken up. Intended for graduate students 
in mathematics or engineering. AWR 


Optimization, T(16-17). Practical Optimization. Philip E. Gill, Walter Murray, Margaret H. Wright. 
Academic Pr, 1981, xvi + 401 pp, $46.50 (P). [ISBN: 0-12-283952-8] Emphasis is on implementation of 
optimization methods with particular attention to the effects of finite-precision computation. To 
make the book self-contained and more suitable as a text, chapters are included on numerical linear 
algebra and on a treatment of optimality conditions. Authors have all been active in development of 
software for the solution of optimization problems, AWR 


Optimization, 8S(17-18), P.  Grundprinzipien der Theorie der Extremalaufgaben. Vladimir M. 
Tichomirov. Teubner-Texte zur Math., B. 30. B.G. Teubner, 1982, 152 pp, 16M (P). A brief and 
modern introduction to the general theory of extremal problems. JD-B 


Optimization, P. Constrained Optimization and Lagrange Multiplier Methods. Dimitri P. Bertsekas. 
Computer Sci. & Appl. Math. Academic Pr, 1982, xiii + 395 pp. [ISBN: 0-12-093480-9] This monograph 
presents the method of multipliers and many of its variations that have been developed in the past 
fifteen years. Several previously unpublished results and algorithms are given. AO 


Optimization, T(16-18: 1). Matrices and Simplex Algorithms. A.R.G. Heesterman. D Reidel Pub, 
1983, x + 790 pp, $96. [ISBN: 90-277-1514-9] After an elementary introduction to matrices and deter- 
minants, the emphasis is on programming algorithms. Aimed largely toward applications in economics, 
it deals with linear, quadratic, and integer programming. Numerous examples, exercises, and pro-~ 
grams (in Algoi}. References, index. Inordinately priced. JS 


Statistics, P. Lecture Notes in Statistics-l2: Statistical Analysis of Counting Processes. Martin 
Jacobsen. Springer-Verlag, 1982, vii + 226 pp, $14.80 (P). [ISBN: 0-387-90769-6] A counting process 
is a stochastic process that counts the number of events that have occurred before time t. This 
monograph treats multiplicative intensity models for counting processes first introduced by Odd 


Aalen in 1975. LAS 


Statistics, S$(16-18), P. Lecture Notes in Medical Informatics-19: Discovery and Representation of 


Causal Relationships from a Large Time-Oriented Clinical Database: The RX Project. Robert L. Blum. 
Springer-Verlag, 1982, xix + 242 pp, $18 (P). [ISBN: 0-387-11962-0] RX is a computer program that 
derives from a clinical database a set of causal conjectures. The principles behind RX can be used 
in any area of knowledge to draw inferences (rather than merely retrieve information) from a data- 


base. LAS 


Statistics, S(15-18), P. Lecture Notes in Statistics-15: Sampling With Unequal Probabilities. 
K.R.W. Brewer, Muhammad Hanif. Springer-Verlag, 1983, ix + 164 pp, $12.80 (P). [ISBN: 0-387-90807- 
2] 


Statistics, S(16-18), P. Regression Analysis of Survival Data in Cancer Chemotherapy. Walter H. 
Carter, Jr., Galen L. Wampler, Donald M. Stablein. Statistics, V. 44. Dekker, 1983, x + 209 pp, 
$35. [ISBN: 0-8247-1736-8] An exhortation, supported by extensive preclinical evidence, to the use 
of regression models in clinical trials of cancer drugs, especially in predicting optimal doses when 
several drugs are used at once. LAS 


Statistics, P. COMPSTAT 1982, Part 1: Proceedings in Computational Statistics. Ed: H. Caussinus, P. 
Ettinger, R. Tomassone. Physica-Verlag, 1982, 466 pp, (P). [ISBN: 3-7051-0002-5] 15 invited and 56 
contributed papers from a 1982 Toulouse symposium on computational statistics providing a diverse, 
international perspective on this rapidly growing field. LAS 


Statistics, $(15-16). Initiation a 1”Analyse des Données. Jean de Lagarde. Dunod (UD Distr: SMPF, 
485 Sth Ave., Suite 1042, NY 10017), 1983, ix + 157 pp, 68F (P). [ISBN: 2-04-105493-0] An introduc- 
tion to statistics, intended for users and without proofs. Several programs in Basic. JD-B 


Computer Literacy, S. What’s So Funny About Computers? S. Harris, William Kaufmann, 1982, wi + 
122 pp, $6.95 (P). [ISBN: 0-86576-049-7] 120 antidotes to “terminal shock," all with some byte to 
them. "The central processing unit is on strike. How does one negotiate with a central processing 
unit?" LAS 
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Computer Programming, S. Fast BASIC: Beyond TRS-80 BASIC. George A. and Thomas G. Gratzer. Wiley, 
1982, ix + 278 pp, $14.95 (P). [ISBN: 0-471-09849-3] Describes how to increase speed of Basic pro- 
grams by using peek and poke commands and machine language routines from TRS-80 ROM. Contains back- 
ground information on representation of numbers, memory organization and Z-80 microprocessor. Exer- 
cises with complete solutions. KS 


Computer Programming, T(13-14: 1), S, L. Programming Concepts and Problem Solving: An Introduction 
to Computer Science Using Pascal. Peter Linz. Ser. in Comput. & Inform. Sci. Benjamin/Cummings, 
1983, xxii + 388 pp, $21.95. [ISBN: 0-8053-5710-6] Introduction to programming using Pascal. Suit- 
able for a first course, possibly a year course if supplemented. RM 


Computer Programming, T(13: 1). Introduction to Programming BASIC: A Structured Approach, Second 
Edition. Chris R. Siragusa. Prindle, Weber & Schmidt, 1983, xiii + 317 pp (P). [ISBN: 0-87150- 
386-7] Structured approach to programming introduced early and used throughout the text. Examples 
and exercises require minimal degree of mathematical sophistication. Changes from First Edition 

include changes in the order in which Basic instructions are introduced, new techniques for data 
retrieval, more emphasis on programming style. JRG 


Computer Programming, T(13: 1). Introduction to BASIC Programming. Gary B. Shelly, Thomas J. 
Cashman. Anaheim Pub, 1982, xiii + 404 pp, (P) [ISBN: 0~88236-118-X]; Test Bank, 153 pp; 
Transparency Masters, 219 pp. Detailed introduction, with stress on the importance of good program 
design, proper documentation, and readable code. Abundant advice, coding tips, examples, review 
questions, exercises, summaries, Unusually fine color graphics on high enamel stock. The Test 
Bank contains nearly 100 true-false questions and 500 multiple-choice questions. All the drawings 
in the text are included in Transparency Masters. LCL 


Computer Programming, S*(13). Introduction to Structured Programming with Pascal. Milton M. Under- 
koffler. Prindle, Weber & Schmidt, 1983, ix + 279 pp, (P). [ISBN: 0-87150-394-8] An introduction to 
Pascal which emphasizes top-down design, structured programming techniques, and program documenta- 
tion. Thirty-three completely solved problems illustrate the main points made in the text. AO 


Computer Programming, S(15-17), P, L. Studies in Ada Style, Second Edition (January 1983). Peter 
Hibbard, et al. Springer-Verlag, 1983, 111 pp, $12 (P). [ISBN: 0-387-90816-1] The first part of 
this monograph presents some of the ideas that motivated the development of Ada. The second part 
presents five Ada programs together with discussion of their design. AO 


Computer Programming, S(15-17), P. ADA, An Introduction, Second Edition--January 1983. Henry Led- 
gard. Springer-Verlag, 1983, viii+ 135 pp, $12.80 (P). [ISBN: 0-387-90814-5] The text has been 
slightly revised to bring it into conformity with the July 1982 Ada standard. (First Edition, TR, 
February 1982.) AO 


Computer Programming, $(13). Programming the IBM Personal Computer: FORTRAN 77. Robert A. Rouse, 
Thomas L. Bugnitz. Holt, Rinehart & Winston, 1983, xiv + 240 pp, $16.95 (P). [ISBN: 0-03-062042-2] 
An elementary introduction to Fortran 77 emphasizing structured programming techniques. The text is 
geared specifically to the Fortran 77 available on the IBM PC. AO 


Computer Programming, T(13: 1), S. FORTRAN Programming: A Spiral Approach, Second Edition. Charles 
B. Kreitzberg, Ben Shneiderman. HarBrace J, 1982, x + 437 pp, $17.95 (P). [ISBN: 0-15-528015-5] An 
elementary introduction to Fortran (including some of the features of Fortran 77) compatible with 
WATFOR/WATFIV. Each chapter includes a reading on programming style and discusses debugging tech- 
niques. Structured flowcharts are used to present algorithms. AO 


Data Structures, $(13-14). Elementary Computer Graphics. Aftab A. Mufti. Reston Pub, 1983, xiv + 
210 pp, $19.95. [ISBN: 0-8359-1654-5] A very sketchy survey of hardware, data structures, and matrix 
operations associated with two and three dimensional computer graphics, illustrated primarily by 
wire-frame drawings and associated Fortran programs. More of an outline than a monograph, it just 
skims the surface of a fascinating subject. LAS 


Computer Science, T(15-17: 1), L. Data Models. Dionysios C. Tsichritzis, Frederick H. Lochovsky. 
Prentice-Hal¥, 1982, xiv + 381 pp, $24.95. [ISBN: 0-13-196428-3] Data models are used in the design 
of database management systems to organize and represent information in a form amenable to computer 
manipulation. This introductory text describes a number of data models and illustrates their use. 
AO 


Computer Science, T(15-17: 1), S, L. Abstract Machines and Grammars. Walter J. Savitch. Little, 
Brown & Co, 1982, xiv + 215 pp, $24.95. [ISBN: 0-316-771619] Concise introduction to computability, 
automata and formal language theory. Good motivation of topics and discussion of relations between 
topics. Assumes familiarity with high-level language like Pascal. KS 


Computer Science, T(14-18: 1, 2), S, L. Fundamentals of Programming Languages. Ellis Horowitz. 
Computer Sci Pr, 1983, xiv + 450 pp, $23.95. [ISBN: 0-914894-37-4] Focuses on topics and concepts 
found in most programming languages: syntax, variables, expressions, statements, typing, scope, pro- 
cedures, data types, data abstraction, exception handling, concurrency, I/0. Contains chapters on 
functional programming, data flow programming, and object oriented programming languages. Each 
chapter ends with a list of concepts discussed in the chapter and a set of chapter exercises. 
References, index. RJA 
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Computer Science, $(15-18), P, L. History of Programming Languages. Ed: Richard L. Wexelblat. 
Academic Pr, 1981, xxiii + 758 pp, $45. [ISBN: 0-12-745040-8] Proceedings of the ACM SIGPLAN History 
of Programming Languages Conference, June 1-3, 1978. Contains an editor’s introduction, an after- 
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Computer Science, L. International Microcomputer Dictionary. Sybex, 1981, x + 121 pp, $3.95 (P). 
[ISBN: 0-89588-067-9] It really is! Sixty-one pages of dictionary are followed by a list of common 
chip numbers and nearly forty pages of Danish, Dutch, French, German, Hungarian, Italian, Norwegian, 
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1982, x + 220 pp, $25 (P). [ISBN: 0-942344-01-4] First of a four-part series consisting of captioned 
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value problems, complex analysis, and numerical methods. AWR 


Applications (Engineering), S(17-18), P. Robot Vision. Ed: Alan Pugh. Springer-Verlag, 1983, xi + 
356 pp, $47.50. [ISBN: 0-387~12073-4] The first in a series concerning manufacturing applications of 
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tion. Kenneth H. Huebner, Earl A. Thornton. Wiley, 1982, xxii + 623 pp, $37.50. [ISBN: 0-471- 
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Section Reports 


An asterisk (*) by the title of a paper indicates that copies of the paper are available from 
the author. Papers presented under special sponsorship as part of joint meetings are so noted in 
parentheses. 


Oklahoma-Arkansas Section 


The fall meeting of the Oklahoma-Arkansas Section was held at the University of Oklahoma on 
March 18-19, 1983. The attendance was approximately 100 people with 23 papers given by faculty and 
students. 


Invited Address: 


"Some Mathematical Surprises," by Victor Klee, University of Washington. (N.A. Court Lecture) 

"Report on the National Science Board Commission and Precollege Education in Mathematics," by 
Katherine Layton, National Science Board Commission on Precollege Mathematics, Science, and 
Technology. 


Panel Discussion: 


"Precollege Mathematics Education in the United States," Katherine Layton (Presider); R.D. Ander- 
son, Past President of MAA; Dale Franks, Superintendent of the Hope Public School System, Hope, 
Arkansas; Dave McCurdy, United States Congressman, 4th District Representative, Oklahoma; and 
Charles Watson, Mathematics Specialist, Department of Education, Arkansas, 


Short Presentations: 


"Continued Square Roots," by Bianca M. Hearn, Hendrix College. 

"p-adic Numbers," by Daryl Ezzo, Oral Roberts University. 

"A Child’s Garden of Quaternions," by Bobby Winters, East Central University. 

"Elementary Transfer Orbits," by Tim Koster, Oklahoma State University. 

"Log-Convexity and the Gamma Function," by Karen Anderson, Hendrix College. 

"Functional Equations Arising from Notational Ambiguities in Calculus," by Karen Shirley, Hendrix 
College. 

"Two Ancient Greek Construction Problems in Euclidean and Hyperbolic Geometry," by Jack M. Rau, 
Oklahomna State University. 

"The Diffusion Index as an Indicator of Stock Market Trends," by Dennis Bertholf, Harry Comeskey, 
and Wayne B. Powell, Oklahoma State University. 

"Robust Autoregressive Time Series Parameter Estimation," by John Fingerlin, G.0. Scan-AMCO. 

"Digital Image Processing," by Frank Chimenti, Southwestern State University. 

"Some Closed Graph Theorems," by Ray Hamlett, East Central University. 

"Problem Solving," by Doug Foster, Oral Roberts University. 

"A one-hour course on Hofstadter’s Godel, Escher, Bach," by Dale Alspach, Michael Folk, and Joel 
Haack, Oklahoma State University. 

"A Mathematical System of Single Fingerprint Classification," by William R. Orton, University of 
Arkansas at Fayetteville. 

"An Applied Linear Algebra Course," by Tom Cairns and Bill Coberly, University of Tulsa, 

"Functions of Mathematics Augmented With a Micro," by Kelvin Casebeer, Southwestern State Univer- 
sity. 

"Survey of the Mathematics Shortage in Oklahoma," by Harold Huneke, University of Oklahoma. 

"Problem Solving for Gifted Junior High Students," by Paul Duvall, Oklahoma State University. 

"Apportionment: A Model with Means," by Donald L. Pattern, University of Oklahoma. 

"Joint Graphs and the Traversability," by Mahesh M. Hiremath, University of Arkansas at Fayette- 
ville. 

"Supercyclic Hilbert Space Operators," by Bill Stockwell, Central State University. 

"Constructing Amicable Pairs From Known Amicable Pairs: An Alternative Approach," by Dale Woods 
and Joe D. Flowers, Central State University. 

"Maximum Likelihood Estimation of Linear Sufficient Statistics," by Bill Coberly, University of 
Tulsa. 


Nebraska Section 


The fall meeting of the Nebraska Section was held at the University of Nebraska, Omaha on March 
25-26, 1983. 


Invited Lecture: 
"MAA”s New Mathematical Science Curriculum Recommendations," by Alan Tucker, SUNY at Stony Brook. 
Contributed Papers: 


"Numerical Estimate of Shock Region of a Certain Hyperbolic 2-Conservation Law," by Peter H. 
Chang, University of Nebraska, Omaha. 
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"Modular Arithmetic, Second Differences and Symmetry," by Mel Thornton, University of Nebraska, 

Lincoln. 
* "Objective Testing with Subjective Probability," by Nelson C. Fong, Kearney State College. 

"Good Principles Gone Awry: How Not to Teach Programming," by Albert F. Shpuntoff, University of 
Nebraska, Omaha. 

"Exceptional t-Designs," by Sypros S. Magliveras and David W. Leavitt, University of Nebraska, 
Lincoln. 

* "High School Math Contest," by Richard Barlow, Kearney State College. 

"Paradoxical Coverings of the Real Line," by Ivan Niven, President, Mathematical Association of 
America. 

"Group Codes for the Gaussian Channel: A Survey of Current Research," by John Karlof, University 
of Nebraska, Omaha. 

"On the Number of Combinations Without r-Separation," by John Konvalina, University of Nebraska, 
Omaha. 

"Conditional Yeh-Wiener Integrals," by Kun Soo Chang, University of Nebraska, Lincoln. 

"How to Solve the nXnXn Cube," by J.A. Eidswick, University of Nebraska, Lincoln. 

"To Compute or Not to Compute: The KSW Computer Science Placement Test," by Stanley Wileman, 
University of Nebraska, Omaha. 

"A Basic Maximum Flow Algorithm for Network Modules of Building Evacuation," by John T. Hsieh, 
University of Nebraska, Omaha. 


Illinois Section 


The sixty-second annual spring meeting of the Illinois Section convened at Rockford College on 
April 29-30, 1983 with approximately seventy members in attendance. 


Invited Addresses: 


"Problems of Intuitive Geometry," by Kenneth Stolarsky, University of Illinois. 

"The Logic of Differential Equations," by Lee Rubel, University of Illinois. 

"Modern Applications of Second Grade Mathematics," by Charles Vanden Eynden, Illinois State 
University. 

"Root Systems and Arrangements of Hyperplanes," by Louis Solomon, University of Wisconsin. 

"What Are the Waves of the Future?" by Carole Bauer, Triton College. 

"Mathematics and Axiomatic Social Choice Theory: The Arrow (Theorem) Has Left Its Mark," by Ed 
Packel, Lake Forest College. 

"A Mathematical Principle for the Circumvention of Multiple-Choice Examinations," by Graham Evan, 
University of Illinois. 

"CUPM’s New Recommendations for a General Mathematical Sciences Program," by Alan Tucker, SUNY at 
Stony Brook. 

"A Random Talk," by Robert V. Hogg, University of Iowa. 


Professor Howard C. Saar, Secretary-Treasurer of ISMAA for the past seventeen years, was 
presented the Distinguished Service Award of the Section. 


Seaway Section 


The Seaway Section held its spring meeting jointly with the New York State Mathematics Associa- 
tion of Two Year Colleges at Utica, New York on April 22-24, 1983. Over 200 mathematicians were in 
attendance. 


Invited Addresses: 


"The Mathematics Teacher of the 1980%s," by Stanley Bezuszka, Boston College. 
"Some Recent Applications of Functional Equations to the Behavioral Sciences" (The Gehman Lec- 
ture), by Janés D. Aczél, University of Waterloo. 


é 
Short Papers: 


"Fagin”s Theorem on Domain-Key Normal Form for Relations," by Douglas Cashing, St. Bonaventure 
University. 

"On the Use of Multivariable Polynomials for Integrating Ordinary Differential Equations," by Mike 
Mikalajanus, Montreal, Quebec. 

"Writing About Mathematics," by Dorothy Buerk, Ithaca College. 

"Simple Sum-Preserving Rearrangements of Series: A Discouraging Theorem," by Paul Schaefer, SUNY 
College at Geneseo. 

"Developments from Normal Surface Theory which Apply to Knot Spaces," by Richard Gustafson, SUNY 
College at Oneonta. 

"Business Simulation Games: A Successful Interim Course," by Patti Frazer Lock, St. Lawrence 
University. 
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Student Papers: 


"Rubik Type Puzzles and Their Groups," by Karen F. Gold, SUNY at Albany. 

"Computer Graphics Generation of Wallpaper Groups," by John Gennavi, Nancy Goering, and Keith 
Swingruber, Colgate University. 

"The Solution to Bulgarian Solitaire," by Emilio Mastrandrea, Rochester Institute of Technology. 


It was announced that, for the second year, David Ash, University of Waterloo, is the Putnam 
Prize winner from the Seaway Section. 


Rocky Mountain Section 


The sixty-sixth annual spring meeting of the Rocky Mountain Section was held on April 29-30, 
1983 on the campus of Colorado State University in Ft. Collins, Colorado. There were 125 MAA 
members in attendance. 


Invited Lecture: 


"Mathematics in 1983: Our Problems, Our Prospects, and our Constituency,” by Gail Young, Univer- 
sity of Wyoming. 


Panel Discussion: 


"Implications of MuMath on the Mathematics Curriculum," by A.R. Brown, Colorado School of Mines; 
William Dorn, University of Denver; Darel Hardy, Colorado State University; Gail Young, Univer- 
sity of Wyoming. 


Short Presentations: 


"Legal Protection of Computer Software: A Glitch in the System," by Antonette Logar, National Col- 
lege. 

"A Two-Year Curriculum Integrating Discrete and Continuous Mathematics," by Ronald E. Prather, 
University vf Denver. 

"Edge-labelled Trees," by Julie Yancey, Fort Lewis College. 

"Splittings of a Definite Integral," by Hung Li, University of Southern Colorado. 

"A Senior Design Course in Computer Science From the Students” Point of View," by Colleen Borstad, 
South Dakota School of Mines and Technology. 

"Finite Math for the Freshman Computer Science Student," by Karen Whitehead, South Dakota School 
of Mines and Technology. 

"Observable Differences Between Male and Female Computer Science Students," by James Sandau and 
Sheri Kirley, South Dakota School of Mines and Technology. 

"Demonstration of MuMath,™ by Aaron Meyerowitz, Colorado State University. 

"Seven Notable Women Mathematicians," by Julia Ann Walker, Boulder, Colorado. 

"The Jordan Curvve Theorem," by David Lear, University of Colorado. 

"A Mathematical Model Predicting Ultimate Crude Oil Production for the United States," by William 
Manzer, Western Wyoming College. 

"What are the Effects (If Any) of a Mathematics Placement Exam?" by Duane Porter, University of 
Wyoming. 

"A Moment on Moments," by Aubrey Owen, Community College of Denver. 

"A Technique for Directly Evaluating the Mean Time to First Failure for Nonreparable Electronic 
Systems with Active Redundency," by John Garstka, Air Force Academy. 

“Mathematical Learning Theory," by Edward De Francia, Fort Lewis College. 

"Money-Math," by Ben Manvel, Colorado State University. 

"Extrema in Polar Coordinates," by Aubrey Owen, Community College of Denver. 

"Tackling a Ticklish Type of Tic-Tac-Toe (Or the Case of the [Almost] Total Tactics)," by Ira 
Rosenholtz, University of Wyoming. 

"Computer-Generated Insights into Number Theory Results," by Robert Fisk, Colorado School of Mines 
and Technology. 

"Integration: Why You Can and Why You Can’t," by Rick Miranda, Colorado State University. 

"Factoring Messy Trinomials," by Carl Kerns, Mesa College. 

"New-Wave Cryptography," by Richard Games, Colorado State University. 

“Service Courses: How the Engineers View What Mathematics Departments Provide," by David Ballew, 
South Dakota School of Mines and Technology. 

"Zeros and Factors of Polynomials with Positive Coefficients," by William Briggs, University of 
Colorado. 

"The Van Meegeren Art Forgeries," by James Coler, University of Colorado. 


Kentucky Section 


The spring meeting of the Kentucky Section was held at Bellarmine College on April 8-9, 1983. 
Eighty-five people were in attendance. 
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Invited Addresses: 


"Why Mathematics is Useful," by Maynard Thompson, Indiana University. 
"Applications of Mathematics in Bio-Medicine," by Maynard Thompson, Indiana University. 
"Factorings in Graphs," by Gary Chartrand, Western Michigan University. 


Invited Lectures: 


"The Reconstruction Problem," by Don Greenwell, Eastern Kentucky University. 

"Ramsey Theory," by Glenn Powers, Western Kentucky University. 

"Directed Graphs," by Michael Jacobson, University of Louisville. 

"Functional Obscurities of a Familiar Formula," by J.B. Barksdale, Western Kentucky University. 
"Polya”s Picture of Complex Functions," by Bart Braden, Northern Kentucky University. 
"Subjective Probability and the Concept of Coherence," by Walter Ferbes, Bellarmine College. 
"Microcomputer Courseware for Beginning Calculus," by Bill Shoalf, Murray State University. 
"Number Theory and Public Key Cryptography," by Patrick Costello, Eastern Kentucky University. 


Allegheny Mountain Section 


The Allegheny Mountain Section held its annual spring meeting at Indiana University of Pennsyl- 
vania in Indiana on April 29-30, 1983. 


Invited Lectures: 


"Curves of the Calculus: History and Application," by V.F. Rickey, Bowling Green University. 

"CUPM’s New Mathematical Sciences Curriculum Recommendations," by Alan Tucker, SUNY at Stony 
Brook. 

"Mathematics for a Computer Science Curriculum," by William Scherlis, Carnegie-Mellon University. 

"Coalitional Games and Various Applications," by William Lucas, Cornell University. 

"The Mathematical Sciences Curriculum K-12: What Is Still Fundamental and What Is Not," by Marcia 
Sward, Associate Director, MAA. 


Panel Discussion: 


"Putnam Exam: Ideas on Offering It and Taking It," Ron Harrell (Moderator), Allegheny College. 
Short Presentations: 


"Phe Next Computer Science Course After Programming," by David Brown, Bethany College. 

"Fibonacci Numbers and Finding Extrema," by John Atkins, West Virginia University. 

"Applications of Computer Graphics in Mathematics Instruction," by Roy Myers, Penn State Univer- 
sity at New Kensington. 


Student Presentations: 


"Variations on Conway’s Game of Life," by Tim Price, Allegheny College. 

"What Can We Say About Duffian Numbers?" by Rosemary Zbiek, Indiana University of Pennsylvania. 
"Tiling the Plane with Convex Polygons," by Kim Kratz, Allegheny College. 

“Amicable Numbers," by Mark Woodard, Indiana University of Pennsylvania. 


Maryland-District of Columbia-Virginia ‘Section 


The spring meeting of the Maryland-District of Columbia-Virginia Section was held April 16, 
1983 at Thomas Nelson Community College in Hampton, Virginia. 


Invited Address: 
"Hardware, Software, and Mathematics," by Jack Beidler, University of Scranton. 
Short Presentations: 


"Almost Isosceles Pythagorean Triples," by William Wardlaw, U.S. Naval Academy. 

"Environmental Mathematics," by Ben Fusaro, Salisbury State College. 

"Constructions in Non-Euclidean Geometry," by George Ivey, Gallaudet College. 

"What is a Sociology of Mathematics?" by J. Fang, Old Dominion University. 

"Wedderburn’s Finite Division Algebra Theorem: An Historical Perspective," by Karen Parshall, 
Sweet Briar College. 

"Rudiments of Computational Support Distribution Test Space Topology," by Steven Mallis, Eastern 
Shore Community College. 

"Adjacency Games/Squares," by Craig Bailey, U.S. Naval Academy. 

"Adjacency Games/Hexagons," by Mark Kidwell, U.S. Naval Academy. 

"License Plates and Phone Numbers--Statistics Classroom Learning Tools," by Edwin Landauer, Gen- 
eral Physics Corporation. 

"Tracking Algorithms Used with Search Radars," by Paul Ilacqua, J.S. Lee Associates. 

"Computing Solutions to the Billiard Ball Problem," by Brian Shelburne, Sweet Briar College. 
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the following theorem is immediate. 


THEOREM 4. Let L,,(y) = Lp a,y (x) where a, € R,k =0,1,...,2n and ay, #0. Then 
f(x) L,,,(y) ts formally self adjoint if and only if: 


(i) f(x) = ef2n-1X/ N47, 
and 
(11) for n > 2, 


| 


(2 27) COD Baseadan tin 


a _ = 
2k—-1 du Ik- ] (j + l)n7Jtlazst} 


, k=1,2,...,(n—1). 


3. Examples. 
(1) L(y) = y© — Sy + Sy” + Sy’ — by. 


By Theorem 4, the symmetry factor is necessarily f(x) = e{°*”*. However, condition (ii) of 
Theorem 4 is not satisfied so L(y) cannot be made formally self adjoint. 


(2) L(y) = yO - 2yO + y! + xy. 
According to Theorem 3, the symmetry factor is necessarily f(x) = e *. It is easy to check to see 
that e * also satisfies condition (ii) of Theorem 3 so e *L(y) is formally self adjoint. Note that 


e*L(y) = (e*y")" — (e-*y')! + xe-*y. 
(3) L(y) = x?y — (2x? — 4x) y® + (x? — (2R + 6)x)y” + ((2R + 2)x —2R)y’. 
Using Theorem 3, it follows that e* is a symmetry factor. Again, observe that 
e-*L(y) = (x7e"*y")” — (((2R + 2] x + Ze ry’y’, 

(4) L(y) = y® — Sy + 2y — y" + by’ + By. 
The symmetry factor must necessarily be f(x) = e~°*/°. However, condition (ii) of Theorem 4 is 
not'satisfied so L(y) cannot be made formally self adjoint. 

Acknowledgements. The author would like to thank Professors H. L. Krall and A. M. Krall for their helpful 
comments and suggestions. 
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The aim of this note is to show an efficient method of evaluating the definite integral 


(1) f(a) = f 


log(x* — 2 +] 
| log( x X COS a ) x 


, (0<a<2z). 
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Then the following three well-known (e.g., [1]) Euler’s integrals can be evaluated immediately by 
applying our result: 


\logi-—x), 1, 
(2) [> dx = 67> 

\logi +x), 1 4, 
(3) ae dx = 197» 

\log1+x*), 1, 
(4) [> dx = 547 


Our method is rather unusual. We evaluate (1) by using solutions of a certain functional equation 
for a single variable and without using any of (2), (3), and (4). 


THEOREM. If f is defined by (1), then 
(5) f(a)=- (a—m)/2+ 77/6. 


Proof. Apply the identity (x? — 2xcosa + 1) (x? + 2xcosa+ 1) =x*—-2x*cos2a + 1 to 
obtain 


f(a/2) + f(a — a/2) 
- J {[los( x? ~ 2xcos(a/2) + 1)(x? + 2xcos(a/2) + 1)]/x} dx 


— [ (Plog(x* — 2x?cosa+ 1)|/x)} dx, 
0 
which, with the transformation x = yt, implies 
(1/2) [°{[log( 1? -— 2¢cos a + 1)]/t) dt = (1/2) f(a). 
0 


Hence f defined by (1) satisfies the functional equation 


(6) f(a/2) + f(a — a/2) = f(a)/2 
for all a in the closed interval [0, 277]. 

Since, by a basic theorem on differentiation under the integral sign, f is twice continuously 
differentiable, we differentiate (6) twice with respect to a to obtain 


(7) f'"(a/2) + f(a — a/2) = 2f"(a). 


Further, f” is a continuous function in the closed interval [0,27]. Hence f’(a) has a maximum 
value M in [0,27] and a minimum value m in [0,27]. Suppose that f’(a)) = M for a fixed 
ay & [0,27]. Then by setting a = ay in (7) we have 


(8) f"(o/2) + f(t — ao/2) = 2f"(ao) = 2M. 
However, both a,/2 and 7 — ay/2 © [0,27], and 

(9) f"(ao/2) <M, f(t — ao/2) < M. 
Hence we have 

(10) f’"'(ao/2) +f" (9 — ap/2) < 2M. 


So, by (8), (9), and (10) we must have f’(a)/2) = M. By iteration f’"(ao) = f"(a 9/2") = M and 
by continuity it follows that 


Jim f"(ao/2") = (0) = M. 
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Similarly, we see that f’(0) = m. Thus M = m. Therefore, f’(a) is a constant function in [0, 277]. 
Let f’’(a) = a. Then f is given by the quadratic polynomial 


(11) f(a) =aa’*/2 + Bat y. 
Our final step is to determine constants a, 8B, and y explicitly. Substitute (11) in (6) and then 
equate the coefficients of the term a and the constant term to obtain 


(12) — 7a/2 = B/2, wa/2+ Bat+2y=y/2. 


Notice that the coefficient of the term a* vanishes. By (1) and by a basic theorem on 


differentiation under the integral sign we obtain f’(7/2) = 7/2, which, with (11), implies 

(13) ma/2+ B=7/2. 

It follows from (12) and (13) that a = —1, B = 7, and y = —7/3. Thus, (11) yields (5). This 
completes the proof of Theorem. 


The values of the Euler integrals (2), (3) and (4) now follow from (1) and (5) by setting a equal 
to 0, 7, and 7/2 respectively. 


This work was supported in part by the Natural Sciences and Engineering Research Council of Canada Grant 
Nr. A-4012. 
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A HOMOLOGY VERSION OF THE BORSUK-ULAM THEOREM 


JAMES W. WALKER 
Department of Mathematics, California Institute of Technology, Pasadena, CA 91125 


An involution on a topological space X is a continuous map from X to X which is its own 
inverse. For example the antipodal map, which maps a point to the opposite end of the diameter 
on which it lies, is an involution on the n-sphere S”. 

Suppose X and Y are spaces equipped with involutions a and b, respectively. A map f from X 
to Y is equivariant if it respects the involutions, i.e., bo f = fea. 

One formulation of the Borsuk-Ulam theorem is that if m is greater than n, then there is no 
map from S” to S” which is equivariant with respect to the antipodal map. Many sources, for 
example [1, § 7.2], include proofs of the Borsuk-Ulam theorem, as well as applications such as the 
“ham sandwich theorem.” We will use singular homology theory to prove a somewhat stronger 
theorem. 

Our stronger theorem shows that the existence of any equivariant maps to S” from any space X 
with an involution forces the existence of very special homology classes for X, so special that X 
could not be a sphere of dimension greater than n. 

A few words about terminology: An elementary 0-chain is a singular 0-simplex with coefficient 
1; loosely speaking, it’s just a single point. We will use reduced homology, which essentially means 
that we consider the empty set to be a singular simplex of dimension — 1, which is the boundary 
of every 0-simplex. It follows that H_,(X) vanishes unless X is empty, and H,(X) vanishes if X is 
path connected. Recall that each continuous map f induces a chain map f,,, defined by composing 
f with singular simplices. In turn, such a chain map f,, induces a homology homomorphism f ,. 


THEOREM. Suppose X is a space with involution v, and g: X > S” is an equivariant map. Then 
there exists an integer j < n, and a homology class B of H,(X; Z/2) such that B is nonzero and 
vx(B) = B. Furthermore, if no such B exists for j less than n, then B can be chosen such that g,( B) is 
the nonzero element of H,,(S"; Z/2). 
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homologous to either zero or 6h,,. In either case, when we apply @, we find that 6h, — g.0c,, is 
homologous to zero. That is, 0h, and g.,0c,, belong to the same homology class. Note that @c, is a 
cycle, because 0@c,, = 90c, = 00c,_, = 0. Therefore, if B is the homology class of @c,, then 
g4(B) is the nonzero element of H,,(S"; Z/2). It follows that B is nonzero. Finally, the fact that 
66c, = 0 means that v,6c, = 9c,, sov,(B) = B. 
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ON THE NUMBER OF MULTIPLICATIVE PARTITIONS 


JOHN F. HUGHES 
Department of Mathematics, Bryn Mawr College, Bryn Mawr, PA 19010 


J. O. SHALLIT 
Department of Mathematics, University of California, Berkeley, CA 94720 


I. A Number-Theoretic Function. In this note we show that if f(7) is the number of essentially 
different factorizations of n, then 


f(n) < 2nV?, 


In considering numbers that have exactly k divisors, one is led to examine this function f(7), 
the number of ways to write n as the product of integers > 2, where we consider factorizations 
that differ only in the order of the factors to be the same. We call these representations of n 
multiplicative partitions. For example, f(12) = 4, since 

12=6°:2=4-3=3:2:-2 
are the four multiplicative partitions of 12. From these four representations, we can conclude that 
a number has exactly 12 divisors if and only if its prime factorization is one of the following: 


P'', P'q, PG’, par. 
This follows from the expression for t(n), the number of divisors of nm = p{'p$2 --+ pZ*. 


tT(n) = u (1 +a,). 


For example, see [1]. 

The behavior of f(”) is quite erratic, and apparently has not been previously studied in this 
form. We observe that if g is prime, then f(g“) = p(k), the number of additive partitions of k. 
Also, if Gg), do5---5 G, are distinct primes, then f(g¢,q. ---: g;,) = B(k), the Ath Bell number. See 
[2]. 
More geherally, f(g/' --: gZ*) is the number of additive partitions of the “multi-partite 
number” (a,, a5,..., @,), where addition is defined component-wise. See [3] for further details. 

We will show that 


(1) f(n)< 2nv?, 
For a table of f(n) for 1 < n < 100, see the Appendix. 
II. Proof of the Main Result. To prove (1) we first define an auxiliary function: 


g(m,n) = the number of multiplicative partitions of n with all elements < m. 


Clearly f(n) = g(n, n). We have the following 
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THEOREM 1. 
(2) g(m,n)= ), g(d,n/d). 
d\n 
d<m 


Proof. We define g(m, 1) = 1 and g(1,”) = 0 forn # 1. Letn = a,a, --: a, be a multiplica- 
tive partition of n with all factors < m. Then we may assume the factors are arranged in 
decreasing order, so a, is the largest factor in the product. The number of ways to choose 
a, +: a, is therefore g(a,,n/a,). But a, was unspecified, and therefore could be any divisor d of 
n such that d < m. Summing over all such d gives the result. 0 

From Theorem | we can obtain a simple estimate for g(m, n). 


THEOREM 2. 


g(m,n) < mn. 


Proof. The theorem is clearly true for m = 1 or n = 1. We will show it is true by induction on 
the product mn. Assume true for all m, n such that mn < MN, where M > 2. Then from Theorem 
1 we have | 


g(M,N)= )) g(d,N/d). 
d\N 
ad<M 


Since d: N/d = N < MN, we may apply the induction hypothesis to the terms inside the 
summation. We find 


g(M,N)< ) d-N/d 


and the theorem is true by induction. 0 
Theorem 2 gives the estimate f(n) = g(n,n) <n’. It is possible to improve this estimate, 
which we do in a moment. First we need three easy lemmas. 


LEMMA 3. 
g(a,b) <g(b,b). 


Proof. This follows immediately, since if a > b, we have strict equality, while if a < b, we have 
summing over fewer terms of equation (2). 0 
LEMMA 4. Let 0 < c < 1. Then 


‘ 
nivé 


f(n) < g(n°,n) + 2X f(a), 


Proof. 
f(n) = a(n, n)= Lad, n/d ) 
|n 
= 2 g(d,n/d) + 2 g(d,n/d) 
|n \n 


d<n° d>n 
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<g(n°,n)+ > g(n/d,n/d) (by Theorem | and Lemma 3) 
d|n 
d>n‘ 


= (n,n) + ¥ g(d,d) 
< g(n*,n) + ¥ sd) 


which is the desired result. 0 


LEMMA 5. Let a > 0. Then 


kat! 


k 
dd <7 t 


Proof. This is easily proved by comparison with the integral {* t? dt. 
We are now in a position to prove our main result. 


THEOREM 6. 
f(n)< 2nV? 


Proof. The table in the Appendix shows the theorem is true for n < 69. We will prove the 
theorem by induction on n. Assume f(d) < kd‘*! for d <n, where n > 70 and c and k are 
constants to be specified later. Then from Lemma 4 we have 


ni-¢ 


f(a) < g(n°,n) + 2 f(a) 


ni-¢ 


<n°t!+ > f(d) (by Theorem 2) 
d=1 


<n°t!+k ¥ dt! (by induction) 
d=1 
ctl (n'~¢)°** l—-c\ ctl 
<n +k rarer eu UL ) (by Lemma 5). 


Now put k = 2 and c = y2 — 1 to get 


2 
v2 +1 


f(n) <n? 4+ nv? 4 2n2W2 -1) 


< 2nVv2 


since 2/ V2 + 1 < 5/6 and 2n*2-) < 1/6n¥? forn > 70. 
Our theorem is now proved by induction. 0 


III. Two Conjectures. Numerical evidence seems to indicate that the exponent V2 in Theorem 
6 is too large. We make two conjectures; the second is more doubtful. 


CONJECTURE 1. 


f(n) <n. 
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CONJECTURE 2. 
n 
log n 


f(n)< forn # 144. 


Both these conjectures have been verified by computer for n < 10,000. 


Appendix 
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ANSWERS TO PHOTOS ON PAGE 437 


No, they are partial. They are two of the most famous partial differential equators in the world. 
Top: Lars Hérmander of Lund; bottom: Olga LadyZenskaja of Leningrad. 
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COMMENTS AND COMPLEMENTS 


DEBORAH AND FRANKLIN TEPPER HAIMO 
Department of Mathematical Sciences, University of Missouri, St. Louis, MO 63121 


Members of our alert readership periodically communicate to us comments about various 
papers published in the Notes sections. They identify errors, supply added references, provide 
alternative proofs, introduce historical supplements, point out duplications in the literature, and 
bring various other items of interest to our attention. Having relinquished our editorial responsi- 
bilities, we take this opportunity to share correspondence of general interest that has accumulated 
over recent months. 


Combinatorics. John R. Hunt of the University of Wisconsin in Menomonie points out that on 
page 630 of the paper, “Infinite products for k th roots,” by N. J. Fine (this MONTHLY, 84 (1977) 
629-630), the number appearing in equation (10) as 140541, found by the recursive relation 
Qns1 = G + 3q2 — 3, should be 140451. 

David Zeitlin of Minneapolis, Minnesota, writes that his abstract in the Notices of the AMS, 26 
(1969) A-527, entitled “7(N), the number of incongruent triangles with integer sides and 
perimeter N, is a solution of a homogeneous, linear difference equation of minimal order nine 
(9), contains the result that T(N) satisfies the equation 


T(N+9)=T(N+7)+ 7T(N 4+ 6)+T7(N + 5) 
—T(N+4)-—T(N+3)—-T(N+2)+T7(QN), 
for N = 0,1,2,.... A generating function for T(N) is given by 


(x? — 1)°(x? + IO - “yy T(N)x* =x’. 


His comments are in reference to the statement in the paper, “Triangles with integer sides,” by 
J. H. Jordan, Ray Walch, and R. J. Wisner (this MONTHLY, 86 (1979) 686-689), that T(N) 
“seems confusing if not intractable.” A further remark is that, on page 188 of L. Dickson’s 
“History of the Theory of Numbers” (Chelsea, N.Y., 1952), a 1908 paper by R. W. D. Christie is 
cited that deals with a related problem of dissimilar rational triangles of equal perimeter. 

In the paper, “Iterated binomial coefficients,” by Solomon W. Golomb (this MONTHLY, 87 


(1980) 719-727), the author states that “no simple reduction formulae have yet been found for the 
n 
most general case of (3) .’ According to David Zeitlin, this is not the case since, in the 


Abstracts of the AMS, 2 (1981) 290, Zeitlin’s paper, “General reduction formulas for iterated 
binomial coefficients,” solves the problem. When Golomb’s attention was called to the Zeitlin 
abstract, he commented that he would hesitate to label Zeitlin’s results as ‘“‘ the solution.”’ He notes 
that there are many reduction formulas and Zeitlin’s fail to be simple in the sense that the 
coefficient C, and D, are themselves complicated summations involving both two-tiered and 
three-tiered binomial coefficients. He acknowledges, however, that he has not seen anything 
simpler. 

V. N. Murty of the Pennsylvania State University cites G. Chrystal’s “Textbook of Algebra” 
(Chelsea, vol. 2, 7th ed., 1964), where the results appearing in Gabriel Klambauer’s paper, 
“Summation of series” (this MONTHLY, 87 (1980) 128-130), are also given. In particular, the 
author’s examples illustrating technique I are included in Chrystal’s text on page 336, and the 
technique, known as early as 1870, is due to Laisant. Further, technique II of the paper is given in 
Chrystal’s Chapter XXXI, page 419, section 12. Murty comments that Chrystal has an excellent 
collection of methods of summing infinite series using elementary methods. 


Number Theory. B. M. Stewart of Okemos, Michigan, in commenting on C. M. Cordes’ paper, 
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“Permutations mod m in the form x”” (this MONTHLY, 83 (1976) 32-33), suggests the alternate 
formulation of the problem as follows: “Find the smallest s > 0 so that each of the congruences 
x* = amod m for a = 0,1,..., m — 1 has exactly one solution.” He remarks that the existence of 
s is contingent upon m being square free. In that case, if m = p, --: p, for an ascending set of 
primes p,, then the least s must be the smallest prime q such that (¢,(p, — 1) -::(p, — D=1. 

David Zeitlin points out Moshe Lotan’s “A problem in difference sets” (this MONTHLY, 56 
(1949) 535-541) as a useful reference for Robert Miller’s “A game with n numbers” (this 
MONTHLY, 85 (1978) 183-185), and cites, in addition, the Italian journal, Periodiche di Mathema- 
tiche (17 (1937) 25-30). 

Referring to a comment attributed to Ray Davis and noted on page 837 of the “‘Comments and 
Complements” article (this MONTHLY, 86 (1979)), Joseph Berlau of Hartsdale, New York, 
questioned, and Ralph Boas of Northwestern University clarified the meaning of 300.999508 as an 
irrational number that starts with these digits. 


Algebra. While commenting that Phyllis Joan Cassidy’s note, “Products of commutators are 
not always commutators: an example” (this MONTHLY, 86 (1979) 772) “gives the most elegant 
proof [he] know(s], even for the case n = 1, of the following result: There exists a group G such 
that, for all n, not every element of the commutator subgroup can be written as a product of n 
commutators,” Roger Lyndon of the University of Michigan is concerned that “the very modest 
list of references given might mislead some readers in that it does not indicate the rather 
considerable amount of work that has been published on this and related problems.” He cites two 
examples. In his book with Paul E. Schupp, “Combinatorial group theory” ( Ergebnisse der Math., 
89, Springer Verlag, Berlin, Heidelberg, New York, 1977), on page 55, it is stated that a certain 
free group G has the property mentioned above; and, in the paper by Basil Jordan, Robert M. 
Guralnick, and Michael D. Miller “On cyclic commutator subgroups” (Aequationes Math., 17 
(1978) 112—113), it is stated that Guralnick has constructed a group G of order 96 in which not 
every element of the commutator subgroup is a commutator, and that this is the smallest order for 
such a group. Asserting that the problem considered can be put under the general heading of 
Equations in Groups, Lyndon points out that some notes on this subject include those of 
J. Mycielski “Can one solve equations in groups?” (this MONTHLY, 84 (1977) 723-726), and 
“Equations unsolvable in GL,(C) and related problems” (this MONTHLY, 85 (1978) 263-265), and 
his own paper “Equations in groups,” with an extensive bibliography, especially on the question 
of products of commutators, to appear in Boletin da Soc. Brasileira da Math, 11 (1980). 

Raphael M. Robinson of the University of California, Berkeley, informs us that his research 
problem, “Solutions of an equation in abelian groups” (this MONTHLY, 86 (1979) 690), has 
generated five solutions. One, by Sidney C. Garrison, Martin R. Perlet, and Stephen M. Gagola, 
all of Texas A and M University (this MONTHLY, 88 (1981) 195-196), uses dual groups, and 
explicitly reaches the conclusion 


1 

2 a >I,n>1, 

for the equation I1?_, (1 — g,) = 0 in abelian groups, where |g,| is the order of g,. Other similar 
solutions were submitted by Alfred W. Hales of the University of California, Los Angeles, Masao 
Kiyota of the University of Tokyo, and Geoffrey R. Robinson of Coventry, England. Ki Haug 
Kim and Fred W. Roush of Alabama State University proposed a solution referring to group 
representations. 


Geometry. Leon Gerber of St. John’s University in Jamaica, New York, provides historical 
perspective for some published notes on geometry. As references for Dixon Jones’s notes, 
“Quadrangles, butterflies, Pascal hexagons, and projective fixed points” (this MONTHLY, 87 (1980) 
197-200) he cites “Poncelet’s theorems, ‘Traité des propriétés projectives des figures’” (Paris, 
1822, article 513), and page 147 of Cremona’s “Elements of Projective Geometry” (1913). 

Historical sources for the 3-dimensional version of E. Snapper’s “Affine generalization of the 
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Euler line (this MONTHLY, (1981) 196-198), according to Leon Gerber, include Educational Times 
Reprints, 19 (1873) 38-39 and N. A. Court’s “Modern Pure Solid Geometry,” Chelsea, 2nd ed., 
1964, as Ex. 11 on page 121. Further, an n-dimensional version is on page 85 of geometry notes 
Gerber himself published in 1975 for use in a seminar at St. John’s University. 

Commenting on Walter Stomquist’s paper, “How to cut a cake fairly” (this MONTHLY, 87 
(1980) 640-644), A. Keith Austin of the University of Sheffield, England, writes that “the 
algorithm on page 641 fails in the case where the knife of the player who shouted ‘cut’ is not the 
center knife. In this case, the player who shouted is not satisfied about the piece received by one 
of the other players, as he thinks it is too large.” On being apprized of this observation, Walter 
Stomquist writes that Dr. Austin is quite right in this respect: “If a player shouts ‘cut’ on the 
assumption that the cake will be cut by the sword and his own knife, then he may feel slighted if 
the cut is made by another player’s knife instead. (He will get 1/3 of the cake, but as [was pointed 
out], he will regard another piece as larger.) I should have made clear that, when deciding when to 
say ‘cut,’ each player should watch the sword and the middle knife. That is actually why the knives 
are manipulated so publicly. In this way, the player can delay shouting until he will be satisfied by 
the left piece as it actually will be cut.” 


Analysis. A note from John P. Hoyt of Lancaster, Pennsylvania, expresses surprise that the 
editorial staff of the MONTHLY permitted the deletion of the “De” from De Moivre’s name in 
J. van Yzeren’s note “Moivre’s and Fresnel’s integrals by simple integration” (this MONTHLY, 86 
(1979) 691-693). “ What is so ‘difficult’ about ‘De Moivre’? ‘De Moivre’ it has been, ‘De Moivre’ 
let it be.” He points out that even such a nonmathematical source as The Encyclopedia Britannica 
refers to “De Moivre.” Noting that the “van” has been included in the author’s name, he asks 
plaintively whether De Moivre is “to suffer the fate of ‘Tchebycheff and L’Hopital’ [sic].” 

Hans Samelson writes with some embarrassment that it has come to his attention that his proof 
for Rolle’s theorem appearing in his paper, “On Rolle’s theorem” (this MONTHLY, 86 (1979) 486), 
is identical with that given by R. J. Easton and S. G. Wayment in their paper, “The sliding 
interval technique” (this MONTHLY, 75 (1968) 886-888). He adds, further, that these authors gave 
a generalization of the theorem to E” (this MONTHLY, 77 (1970) 170-172) also. These facts 
escaped not only the author but three knowledgeable referees and the editors as well. 

Providing an addendum to his note, “An ultimate proof of Rolle’s theorem,” Alexander Abian 
states that “lines 16 to 13 from the bottom of page 485 (this MONTHLY, 86 (1979)) should be 
replaced by the following, which extends almost verbatim [the] proof of c € (ay, by) toc € (a,, D,) 
for every i € w:... [For,] from (1) and (2) we have g(m,.,,) > max{g(a;), g(b;)) for every k. 
Now, if g(m,.,) > max{g(a,), g(b;)) for some k, then from (6) and (9) it follows that g(c) > 
max {2(a,), g(b,)), which implies a, # c # b, and therefore c € (a,, b,). On the other hand, if 
g(m,.,.) = max{g(a,), g(b,)} for every k, then from (2) and (8) it follows that c = m,, which 
again implies c € (a,, b;).” 

David Zeitlin cites page 12 of I. J. Schwatt’s “An Introduction to the Operation with Series” 
(second edition (reprint of the 1924 first edition), Chelsea, New York), where a related formula is 
included to that contained in Steven Roman’s paper, “The formula of Faa Di Bruno” (this 
MONTHLY, 87 (1980) 805-809). The companion result is an expansion for D."g( f(t)) which, 
according to Zeitlin, is easier to use for computational purposes than Faa Di Bruno’s formula. 

Colin C. Graham of Northwestern University, commenting on A. M. Russell’s paper, “A 
Commutative Banach algebra of functions of bounded variation,” points out that Nina Bau in a 
paper in the Mathematische Annalen in 1930 knew that BV, and hence BV*, are closed under 
multiplication. Though she did not state the norm inequality used by Russell, that inequality has 
been known at least since 1957, when it appeared in a paper by Hewitt and Zuckerman in the 
Pacific Journal of Mathematics (7 (1957) 913-941). 

Robert S. Doran of Texas Christian University writes that not only is W. E. Pfaffenberger’s 
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Theorem | in his paper, “A converse to a completeness theorem” (this MONTHLY, 87 (1980) 216), 
well known, as the author acknowledges, but that Theorem 2 is also well known. He cites Seymour 
Goldberg’s book, Unbounded Linear Operators (McGraw-Hill, 1966), where, in Corollary 1.5.8, 
page 21, a simpler proof is given, and Sterling Berberian’s “Introduction to Hilbert Space,” 
(Oxford, 1961), where on page 193 a short, elegant solution is given to the result stated as Exercise 
12 on page 108. 

The fact that Theorem 2 is well known was also pointed out by Chaitan P. Gupta of Northern 
Illinois University and by A. Venugopalan of Calicut, India. 

Joel Brenner of Palo Alto, California, points out that appealing to the method that Adolf 
Cusmariu used in his note, “A proof of the arithmetic mean-geometric mean inequality” (this 
MONTHLY, 88 (1981) 192-194), one can prove that, for positive real numbers x,,..., x,,, not all 
equal, the inequality 


(n—1)(x,+---+ x,)° > 2n(x,;x, +--+ + x,_)x,) 


holds. 

Commenting on U. V. Satyanarayana’s paper, “A note on Riemann-Stieltjes integrals” (this 
MONTHLY, 87 (1980) 477-478), Roy O. Davies, of the University of Leicester, writes that, 
“essentially the same proof was given for essentially the same result (Riemann integrals instead of 
Riemann-Stieltjes but the extension is trivial) in Mathematical Note 2970, ‘A fundamental 
inequality for integrals,’ by J. Sr.-C. L. Sinnadurai [Mathematical Gazette, 45 (1961) 235-236; 
corrigenda ibid 46 (1962) 143].” That observation also was made both by the latter author himself 
and by D. F. Scrimshaw of Brunel University. Max L. Weiss, of the University of California, 
Santa Barbara, provides the background information that Satyanarayana’s interest in and proof of 
the theorem in question originated years ago as a consequence of “a loose end in one of my 
classes in a term paper concerning line integrals in connection with Cauchy’s Theorem.” He 
further notes that “Sinnadurai’s proof is mathematically superior to Satyanarayana’s. The former 
proves an interesting lemma about positive functions from which the result follows easily, while 
the latter involves the definition of the integral from the beginning.” 

On having the Sinnadurai paper called to his attention, Satyanarayana reached the similar 
conclusion that the former had a “more elegant” proof, but commented that “the method of my 
proof seems to be the most natural one any student (with proper preparation) would also come up 
with ... and the emphasis of my note is on pedagogy.” 

David Zeitlin writes that in his paper “An inequality for a class of polynomials” ( Fibonnacci 
Quarterly, 16 (1978) 128-129, 146, 151), he generalizes equation (18) on page 29 of the paper 
“Inequalities and identities for sums and integrals” by M. S. Klamkin and D. J. Newman (this 
MONTHLY, 83 (1976) 26-30). 

In commenting on J. M. Henle’s note, “Functions with arbitrarily small periods” (this 
MONTHLY, 87 (1980) 816), Ralph Boas of Northwestern University provides yet another proof of 
Burstin’s theorem that a Lebesgue-measurable function having arbitrarily small periods is con- 
stant a.e. He notes that if y is a period of a bounded, measurable function having dense periods, 
then 


x x+y x 
ffery) dt = f f(t) dt = f f(t) dt. 
If, now, 


F(x) = f f(t) de, 


then F is a continuous solution of F(x + y) = F(x) + F(y), for all x and a dense set of y’s. 
Hence 


F(x) =Ax =f dt, 
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fia —f(t)| dt =0. 
It follows that 


fir — f(t)] dt =0 
for every (a, b), and A — f(t) = 0 ae. Henle’s remark on seeing this proof was that it was an 
“interesting proof, though not self-contained.” 

Bruce R. Johnson of the University of Victoria, British Columbia, writes that he has just been 
alerted to the Lajos Takacs paper, “On the method of inclusion and exclusion,” in the Journal of 
the American Statistical Association, 62 (1967) 102~—113. It contains, he points out, the same proof 
as that of his note, ‘““An elementary proof of inclusion-exclusion formulas to real or complex-val- 
ued, finitely additive set functions.” An extensive list of references on the subject is cited in the 
Takacs paper. 

Julian Hennefeld’s question in his paper, “A nontopological proof of the uniform boundedness 
theorem” (this MONTHLY, 87 (1980) 217) about why Hausdorff’s proof “fared so poorly in 
comparison to rival proofs” and his remark that “textbooks in analysis and functional analysis 
give only the Baire category proofs without any mention of an elementary proof in the text or 
even aS an exercise” prompted Béla Székefalvi-Nagy of Institutum Bolyaianum Universitatis, 
Hungary, to respond that this observation is not completely accurate. Essentially, the same 
nontopological proof appears, in section 31 of Riesz-Sz.-Nagy’s “Functional Analysis” (Ungar, 
1955), but relies directly on an 1897 paper by Osgood, except for obvious changes needed to avoid 
topology beyond the completeness property of the space. In section 87 of that text, Hausdorff’s 
work also is referred to. Sz.-Nagy urges young mathematicians not to overlook classical authors 
like Osgood, for example. 

Albert Wilansky of Lehigh University supplements the reference above with G. G. Lorentz’s 
“Approximation of Functions” (Holt, Rinehart, Winston, 1966), where on page 95 a proof 
without category appears. He notes, further, that Hilbert-space proofs are included in §S. S. 
Holland’s note, “A Hilbert space proof of the Banach-Steinhaus theorem” (this MONTHLY, 76 
(1969) 40-41) and in Stone and Tamarkin’s paper, “Elementary proofs of some known theorems 
of the theory of complex euclidean spaces” ( Duke Mathematical Journal, 3 (1937) 294-302). In his 
own book, “Modern Methods in Topological Vector Spaces” (McGraw-Hill, 1978), Wilansky uses 
the category argument to prove his Theorem 3-3-6 because it is brief and he wants to move on; 
because it motivates the idea of barrelled space; and because the theorem holds for all spaces of 
second category (not necessarily complete ones). The modern trend toward proving functional 
analysis results by elementary methods prompted Wilansky’s colleague, G. A. Stengle, to remark, 
“That’s all very well, but does it generalize?” 

William C. Waterhouse of the Pennsylvania State University points out that the main theorem 
in H. Hueber’s note, “On uniform continuity and compactness in metric spaces” (this MONTHLY, 
88 (1981) 204-205), was proved earlier in M. Atsuji’s paper, “Uniform continuity of continuous 
functions of metric spaces” ( Pacific J. Math., 8 (1958) 11-16), and that further results on spaces 
where all continuous functions are uniformly continuous can be found in J. Rainwater’s “Spaces 
whose finest uniformity is metric” (Pacific J. Math., 9 (1959) 567-570) and in his own note, “On 
UC spaces” (this MONTHLY, 72 (1965) 634-635). 

Referring to the same paper, Angus E. Taylor writes that on page 204 Edwin Hewitt is given 
credit for answering a certain question in 1948 where, in actuality, the result dates to the thesis of 
Maurice Fréchet “who invented metric spaces and originated the term ‘compact.’ Fréchet called a 
set S compact if every infinite subset of S has a limit point (in the space, not necessarily in S$). In 
his thesis, Fréchet showed that every closed and compact set in a metric space has the Borel 
property (that from every denumerable open covering of the set one can extract a finite open 
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covering). Then he proved a theorem that contains the result that if a metric space is not compact, 
one can define on it a real continuous function that is not bounded. The full theorem, on page 31 
of his published thesis, is that a set S in a metric space is closed and compact if and only if every 
continuous real function on S is bounded and attains its least upper bound. In Fréchet’s very first 
paper on abstract point-set topology, for an L-space (more general than a metric space), he proved 
that a continuous real function on a closed and compact set in such a space is necessarily bounded 
and attains its least upper bound (Comptes Rendus, 139 (1904) 848-850).” 

Finally, Albert Wilansky of Lehigh University notes that, although his book, “Topology for 
Analysis” (Wiley, 1970), contains Hueber’s results, he himself copied the results from articles of 
Mrowka, Waterhouse, and Atsuji, and that MONTHLY notes “are intended to bring such results to 
the attention of readers who do not consult the research literature.” 

When the observations above were made known to him, H. Hueber responded that, not only 
he, but several noted mathematicians who had read his paper before it was submitted were 
unaware of these results. That, of course, is true also of the two knowledgeable referees consulted 
by the editors. He provided the added information that his proof that (i) = (ii) of Theorem 1 
contains a misleading argument, which arose as a consequence of his attempt to find the shortest 
and most elegant formulation of the theorem. He points out that, where he wrote that “as k tends 
to infinity, d(x,,, ¥,,) tends to zero, and hence (f(x,.) — f(Yn,))/@(%n,> Yn,) tends to infinity,” 
which is correct but misleading, it would have been better to write, “as k tends to infinity 
A( Xn,» Yn,) tends to zero whereas f(x,,.) — f(Mn,) 7 1” 

David Zeitlin writes that his paper, “A derivation of the general solution for homogeneous, 
linear difference equations with constant coefficients” (this MONTHLY, 68 (1961) 369-370), treats, 
from a slightly different point of view, the same problem as that in Albert Nijenhuis’ note, 
“Complete solutions of linear difference equations” (this MONTHLY, 87 (1980) 658-660). On 
being alerted to Zeitlin’s result, which he had not been aware of, Nijenhuis’ response was that 
“Zeitlin’s theorem and its proof ... states an interesting relationship between difference equations 
and differential equations, but it does not explicitly address itself to a proof that [the solution 
given in my note] is, in fact, the general solution. No doubt such a proof is easily obtained, and 
Zeitlin can produce one at a moment’s notice—quite possibly using some of the lines of 
computation in the paper.” 

B. M. Stewart of Michigan State University notes that he had anticipated several of the results 
in the article by J. C. Hintz (this MONTHLY, 87 (1980) 212-215) in a paper, “Sums of functions of 
digits,” which appeared in the Canadian Journal of Mathematics (12 (1960) 374-389). 

We have heard from Donald E. G. Malm of Oakland University that the algorithm for solving 
a linear diophantine equation given in Stanley Kertzner’s note, “The linear Diophantine equation” 
(this MONTHLY, 88 (1981) 200-203), appears in a number of other sources including W. A. 
Blankinship’s paper, “A new version of the Euclidean algorithm” (this MONTHLY, 70 (1963) 
742-745), and Paul Schaefer’s, “An algorithm for finding the G.C.D. of a finite set of integers” 
(Delta, 5 (1975) 19-27), as well as his own text, “A Computer Laboratory Manual for Number 
Theory” (student manual) (COMPress (1980) 89-95). In the latter, there is an extension of the 
algorithm to solve systems of simultaneous linear diophantine equations, together with programs 
in BASIC implementing the algorithm. 

Stanley Kertzner wrote that he is chagrined to learn that, in checking only the text rather than 
the journal literature as well, he missed both the Schaefer and Blankinship papers. Since he had 
submitted his paper in December of 1979, however, the Malm Manual was not yet available. 
Subsequently, he has developed a sharpened procedure to solve linear integer matrix equations of 
the form AX = B for all integer solutions, extending the algorithm that he had generalized to a 
system of diophantine equations. 

That Blankinship’s algorithm is not in popular use was decried by E. F. Assmus, Jr., of Lehigh 
University who noted that Daniel A. Marcus’s “An alternative to Euclid’s algorithm” (this 
MONTHLY, 88 (1981) 280-283), is far more complicated than the earlier published version. 
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Marcus, on the other hand, objects to the characterization of Blankinship’s algorithm as “simpler,” 
arguing that the two are significantly different, with the Blankinship algorithm having the 
advantage of applying to arbitrarily many integers rather than just two, of requiring no 
backtracking, and of relying on the familiar concept of row operations. His own algorithm, on the 
other hand, has the advantage of requiring that only the sequence of remainders be recorded and 
of producing the solutions by a single string of operations. 

The publication of M. Hendy’s note, “The retrial of the lower Slobbovian counterfeiters” (this 
MONTHLY, 87 (1980) 200-201), evoked for Juan Jorge Schaffer of Carnegie-Mellon University the 
memory that, when Problem E1096 appeared in this MONTHLY, 61 (1954) 46, he was a student in 
Ziirich. P. Erdés, who happened to visit at that time, mentioned the problem to him, and they 
“soon found the fallacious ‘solution’ that had presumably been submitted in the meantime to the 
MONTHLY, and then discovered the fallacy, as well as the correct solution. He goes on to say that, 
when later that year he found that the fallacious “solution” had been published (this MONTHLY, 
61 (1954) 472-473), he “wrote the editor of the Elementary Problems section to point out the 
error and provide a correct solution.” When he received no acknowledgement, and the error was 
not corrected nor was a valid solution published, he did not follow up the matter. He encloses a 
copy of his 1954 letter “as a curiosity,” having “nothing to complain about” and notes that “I 
shall be able to continue to boast that my Erd6s number is 1, but that it is unlisted.” 
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1. Introduction. How do we best prepare mathematics students for careers in industry? How 
should we respond to students’ requests for relevant courses? An informal survey suggests that 
solving these problems by decreasing the emphasis on classical pure mathematics is not in the 
students’ long-term interest. 

During a recent review of the undergraduate major at S.U.N.Y. at Stony Brook, 48 mathemati- 
cians in industry, business, and government replied to a questionnaire from the Department of 
Mathematics. The survey sought opinions on the relative merits, for students entering jobs with a 
bachelor’s degree, of the traditional undergraduate curriculum, advanced topics in “pure” 
mathematics, computer programming, additional computer science, and specialized or applied 
topics. The questionnaire asked for comparisons between courses without limiting the total 
number that could be recommended. 

The mathematicians who participated in the survey favored a broad background which 
combines a large dose of classical, pure mathematics with the problem-solving experience of 
applied courses and computer programming. The responses distinguished between various topics 
in both their potential value and whether they should be studied formally in courses. For example, 
a knowledge of computer programming through either course work or other experience was 
considered a basic skill, essential to the positions available today. Modeling, an important activity 
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in practice, was suggested for on-the-job learning. The theory of complex variables, less ap- 
propriate for independent study, was strongly recommended as coursework. 

Support for programming and applied mathematics had been anticipated from the extensive 
work done by recent committees of the NRC [3] and CUPM [2]. The unexpected results of the 
Stony Brook survey were the preferred emphasis within these areas and the support for including 
the more theoretical courses. The respondents articulated a disturbing discrepancy between the 
courses which make a candidate attractive to an employer initially and the training most likely to 
correlate with success in the long run. These results contradict the usual assumptions about the 
needs of nonacademic employers, on whose behalf curriculum changes have been proposed and in 
whose name departments hope to attract students. 


2. Computer Programming and Computer Science. Programming was described as an im- 
portant tool, if not the main activity, of those hired today with a bachelor’s degree. Two or three 
semesters of programming in a high-level language were recommended, with concepts such as 
structured programming included with instruction in syntax. Programming experience in mathe- 
matics courses, part-time or summer jobs, or internships would be an acceptable substitute for 
coursework. A systems programmer at a large computer company said that more than one 
programming language is “not a plus,” since evidence of the ability to do the job (such as a good 
record in mathematics) would be evidence of the ability to learn other languages “on your own,” 
as needed. 

“Certainly knowledge of a programming language and some experience in programming is 
important,” wrote a group supervisor in a computing research division. “My own requirements 
might stop there, but I believe most employers would be impressed by a considerable number of 
computer science courses.” Well, some would and some would not. The question of computer 
science beyond programming—how much and what topics—elicited the most divergent opinions. 
A consulting company mathematician called such courses “of small value for most of our 
applications.” At the other extreme, naturally enough, were the responses from employees of 
computer companies. One recommended a background in theoretical computer science and, in 
fact, would “prefer a straight C.S. degree.” Another said there is a place for majors in pure 
mathematics who have an interest in computers, a programming course, and “a few ‘advanced’ 
computer science courses.” 


3. Applied Mathematics. Experience in “getting answers to specific problems” is the purpose 
of the two or three semesters of applied mathematics that were recommended. Except in fields 
requiring a working knowledge of statistics, the particular topic was less important to those 
surveyed than the practical experience of problem-solving. “Teach them to solve differential 
equations, matrix transformations, or whatever, but require them to get numerical answers that 
can be checked against actual experience,” urged a senior advisor to a major aerospace company. 
Another consultant recommended “working on a real problem that requires the use of a computer 
and for which the straightforward textbook approach fails to give the right answer. Examples of 
such ideas include the inversion of a near singular matrix, solutions to ill-conditioned differential 
equations, or even the discrete time analysis of a continuous system.” 

Among applied courses, the preference was clearly for the more traditional topics— numerical 
analysis, statistics (using a major statistical package such as SPSS), probability, differential 
equations, numerical linear algebra, or optimization. Numerical analysis, strongly recommended 
for both computer science and applied mathematics, was seen as a rare opportunity to bring 
together these fields. There was widespread agreement that “knowledge in the area of operations 
research or modeling, although of interest, may be obtained once employed through either 
in-house training or short summer courses.” A staff member at a research laboratory described 
operations research as a marketable skill, but one of the early practitioners in the field has 
“always thought it was better learned by apprenticeship; too much mathematical apparatus 
obscures the operational facts.” A systems analyst at a national laboratory, who uses optimization 
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and simulation techniques, relies on measure theory, topology, and her undergraduate courses in 
calculus, elementary probability and statistics, symbolic logic, and programming. 


4. Relations with Scientists and Engineers. Nonacademic work often involves collaboration 
with researchers in other fields. Courses in the physical sciences were recommended to develop the 
ability to communicate with and appreciate the problems of scientists and engineers. “‘Of course,” 
wrote one research group supervisor, “many scientists know a great deal of mathematics, at least 
that of use to them. To be useful, a mathematician must exceed this knowledge in breadth or 
depth, or in both ways. This suggests to me that a mathematics major should take a great many 
mathematics courses, including those usually required of engineers, for instance, as well as the 
theoretical courses required in the normal mathematics curriculum.” 


5. Pure Mathematics. What of those theoretical courses? The questionnaire included several 
statements for the respondents to indicate a degree of agreement or disagreement. There was 
strong agreement with the statement “We prefer students with a broad background in classical 
mathematics [described as calculus, linear algebra, advanced calculus, one complex variable, etc.] 
and we will train them in more specialized knowledge.” There was similarly strong disagreement 
with the statement “With respect to [our] positions, any mathematics beyond advanced calculus 
(e.g., one complex variable, group theory, topology, etc.) is not useful in an undergraduate 
education.” Elaborating, the reaction to abstract algebra and topology ranged from “not particu- 
larly desirable” to “should be included,” but mostly favorable. Symbolic logic was called 
“invaluable” to learn “ways to structure ideas at all levels, from abstract models to FORTRAN 
programs.” A “sound foundation in set theory and linear algebra” was recommended by a senior 
computer scientist for all mathematics undergraduates. 

Pure mathematics was generally credited with developing the ability to “think clearly,” 
“abstractly,” “analytically,” and “precisely,” and to “appreciate rigor.” The issue was whether 
such experience is a “luxury,” as one researcher called it, when the student needs the “ability to 
do something which industry is willing to pay for.” The general response was that a thorough 
grounding in the classical material is (at least) as important as the immediately marketable skills 
developed in applied courses. “It is generally not possible to predict the sort of work one might be 
doing, even in a few years.” Only a broad background including advanced work in pure 
mathematics was seen to provide the “ flexibility to learn whatever else might become necessary” 
as technology and interests— one’s own and one’s employer’s— inevitably change. There was little 
~ concern that the specific material of these courses would not actually be used; the experience of 
studying it and the added insight and mastery of more elementary concepts were adequate reason 
to include them. 

The questionnaire did not explicitly address the issue of “continuous” versus “discrete” 
mathematics, but some of the replies did. Noting that a year of calculus is sufficient for their 
purposes, a data systems engineer and a systems analyst argued for replacing further study of 
continuous functions (advanced calculus, differential equations, analysis) by discrete mathematics 
(combinatorics, difference equations, graph theory, semi-groups, lattice theory). This is not, of 
course, a new idea (see [1] and [4], for example), but considering the rise of introductory “finite 
math” courses in the past two decades, it seemed to find little support among the respondents. For 
the most part, those who work with “structures” and discrete phenomena recommended the more 
traditional offerings in set theory, elementary probability and statistics, symbolic logic, and 
abstract algebra along with linear algebra and advanced calculus. 


6. Conclusions. This survey sought representative opinions of nonacademic mathematicians as 
guidelines for revising the undergraduate mathematics program and advising students at S.U.N.Y. 
at Stony Brook. The recommended training combines experience in programming and problem- 
solving courses with a background in classical, pure mathematics. A good record in such a 
program is both marketable and flexible; neither aspect can be safely ignored. “The paradox,” 
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one respondent observed, “is that specialized courses (numerical analysis, optimization, numerical 
linear algebra) increase the probability of landing a job but a broad general background (topology, 
abstract algebra, complex analysis) is more important for long term success.” 

An informal study like this one cannot provide definitive answers to complex issues. It can, 
however, uncover opinions that have not received much attention otherwise. The results here, for 
example, challenge the assumptions that replacing “theory” by “practice” is sound preparation 
for a career. To do so might attract students to the department, but it is not clear that it will serve 
them well beyond a first job. In light of proposals for significant revisions in course offerings and 
major requirements, further evidence for or against these changes is clearly needed. If the opinions 
reported here are indeed representative of our colleagues in industry, the task ahead of the 
mathematical community is not to withdraw the theoretical part of the college curriculum but to 
educate career counselors, personnel officers, and students to the recommendations of those 
working in the field. 
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RICHARD M. GRASSL, DOUGLAS A. HENSLEY, ISRAEL N. HERSTEIN, ROBERT H. JOHNSON, ELGIN H. 
JOHNSTON, MURRAY S. KLAMKIN, DANIEL J. KLEITMAN, JOSEPH D. E. KONHAUSER, FREDERICK W. 
LUTTMANN, MARVIN MARCUS, LOUISE E. Moser, M. J. PELLING, HOWARD E. REINHARDT, C. M. REIS, B. L. 
R. SHAWYER, EDWARD T. H. WANG, AND ALBERT WILANSKY. 


Send all proposed problems, in duplicate if possible, to Professor G. L. Alexanderson, Department of Math- 
ematics, University of Santa Clara, Santa Clara, CA 95053. Please include solutions, relevant references, etc. 

An asterisk (*) indicates that neither the proposer nor the editors supplied a solution. 

Solutions should be sent to the addresses given at the head of each problem set. 

A publishable solution must, above all, be correct. Given correctness, elegance and conciseness are preferred. 
The answer to the problem should appear right at the beginning. If your method yields a more general result, so 
much the better. If you discover that a MONTHLY problem has already been solved in the literature, you should of 
course tell the editors; include a copy of the solution if you can. 


ELEMENTARY PROBLEMS 


Solutions of these Elementary Problems should be mailed in duplicate to Professor G. L. Alexanderson, 
Department of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by January 31, 1984. Please 
place the solver’s name and mailing address on each (double-spaced) sheet. Include a self-addressed card or label 
(for acknowledgment). 


E 2974 [1982, 756] Correction. Proposed by Jordi Dou, Barcelona, Spain. 


This problem, posed originally in the December 1982 issue of this MONTHLY, page 756, 
contained an error introduced in the editorial process. The problem should read: “Let AMB 
(oriented clockwise) and CMD (counter-clockwise) be similar triangles. Prove that triangles ACX 
(clockwise) and YDB (clockwise), both similar to the first triangles, have the same circumcenter.” 

E 3007. Proposed by George Odom, Poughkeepsie, NY. 


Let A and B be the midpoints of the sides EF and ED of an equilateral triangle DEF. Extend 
AB to meet the circumcircle (of DEF) at C. Show that B divides AC according to the golden 
section. 

E 3008. Proposed by Roger Cuculiére, Paris, France. 

Show that for n > 2 the polynomial x” — x”~'! — x"~* — --- — x — 1 is irreducible over the 
rationals. 

E 3009. Proposed by Chris Jantzen, University of Wisconsin. 

Points X, Y, Z are chosen on the sides of a triangle 4 BC such that 


= T= Soak 


and a triangle PQR is formed using CX, AY and BZ as sides. The operation is repeated on the 
triangle POR, that is, the points X’, Y’, Z’ are chosen on the sides of POR such that 


PX’ _ OY’ _ RZ’ _ 
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and a triangle LMN is formed using RX’, PY’ and QZ’ as sides. Show that LMN is similar to 
ABC and find the ratio of similarity. 


E 3010. Proposed by F. David Hammer, Santa Cruz, CA. 
Prove that for all n > 0 the relation 
[nl 4 (n+ 1)'7 + (nt 2)'7] = [(9n + 8)'7] 


holds. Here [-] denotes the greatest integer function. 


E 3011. Proposed by C. P. Popescu, Romania. 


Let X be a connected (path connected) topological space and let A, B be two nonvoid disjoined 
subsets of X such that boundary (A) = boundary (B). Is it true that A UB = X? 


E 3012. Proposed by J. L. Selfridge, Mathematical Reviews. 


Show that 27 — 1 does not divide 3° — 1 if a> 1. More generally, 27 — 1 does not divide 
3° — | when a > 1 anda and b have the same parity. 


SOLUTIONS OF ELEMENTARY PROBLEMS 


an* + bn + c Is Always Divisible by a Prime p = 1 mod 4 


E 2883 (1981, 291]. Proposed by J. M. Patin, St. Herblain, France. 


Can integers a, b,c be found so that the nonconstant polynomial an? + bn + c has all its 
prime factors congruent to 3(mod 4) for n = 1,2,...? 


Solution by Jacob A. Brandler, Brooklyn, NY; Robert Breusch, Amherst, Mass; Benny Cheng & 
Dinh Thé Hung, students, University of California, Berkeley; Lorraine L. Foster, California State 
University, Northridge; Robert Gilmer, Florida State University; Sahib Singh, Clarion State 
College, PA; Allen Stenger, Hughes Aircraft Company, and University of South Alabama Problem 
Group. No such integers exist. Set p(n) = an* + bn + c, then 4ap(n) = (2an + b)? + 4ac — b’. 
If a = 0 or 4ac — b* = 0, then p(n) is divisible by primes of the form 1(mod 4) for infinitely 
many n so we assume that a(4ac — b*) is not zero and let p,, p>,..., p,, be the distinct odd prime 
divisors of a(4ac — b*). The Chinese Remainder Theorem and Dirichlet’s Theorem guarantee that 
there is a prime p > a such that p = |(mod p,) for all 7,1 < i < m, and p = 1(mod 8). Since 
(p,/P) = (p/p,) = (2/p) = (—1/p) = 1, there is an x such that x? = b* — 4ac(mod p). Since 
(2a, p) = 1, 2an + b = x(mod p) has a solution for n, so p| p(n) for some positive n. 

Clearly there are infinitely many such primes p. Cheng and Hung replaced 1 mod 4 by 1 mod 
m; Foster proved a considerable generalization. Advanced Problem 6340 (page 489) seems to be 
the last word in this connection. 

Every prime divisor of 4n? + 1 is congruent to 1 mod 4. 


Also solved by the proposer. 
Random Walks and Catalan Numbers 


E 2903 (1981, 619]. Proposed by Louis W. Shapiro, Howard University. 


Consider walks in the northeast quadrant which start at (0,0) and such that each step is one 
unit east or north and the points (1, 1), (3,3),...,(24 + 1,2k + 1),... are forbidden. How many 
paths are there to (2n,2n)? 


Many solvers used the method described below; some found the solution of an equivalent 
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and a triangle LMN is formed using RX’, PY’ and QZ’ as sides. Show that LMN is similar to 
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3° — 1 when a > | and a and b have the same parity. 
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Random Walks and Catalan Numbers 


E 2903 (1981, 619]. Proposed by Louis W. Shapiro, Howard University. 


Consider walks in the northeast quadrant which start at (0,0) and such that each step is one 
unit east or north and the points (1, 1), (3,3),...,@2k + 1,2k + 1),... are forbidden. How many 
paths are there to (2n,2n)? 


Many solvers used the method described below; some found the solution of an equivalent 


PROBLEMS AND SOLUTIONS 483 


and a triangle LMN is formed using RX’, PY’ and QZ’ as sides. Show that LMN is similar to 
ABC and find the ratio of similarity. 


E 3010. Proposed by F. David Hammer, Santa Cruz, CA. 
Prove that for all n > O the relation 
Ea +(n4+1)'?4+ (nt 2)'77| = [(9n + 8)'77] 
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E 3011. Proposed by C. P. Popescu, Romania. 
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subsets of X such that boundary (A) = boundary (B). Is it true that A UB = X? 


E 3012. Proposed by J. L. Selfridge, Mathematical Reviews. 


Show that 27 — 1 does not divide 3° — 1 if a> 1. More generally, 27 — 1 does not divide 
3° — ] when a > | and a and 5 have the same parity. 


SOLUTIONS OF ELEMENTARY PROBLEMS 


an* + bn + c Is Always Divisible by a Prime p = 1 mod 4 


E 2883 [1981, 291]. Proposed by J. M. Patin, St. Herblain, France. 


Can integers a, b,c be found so that the nonconstant polynomial an? + bn + c has all its 
prime factors congruent to 3(mod 4) for n = 1,2,...? 


Solution by Jacob A. Brandler, Brooklyn, NY; Robert Breusch, Amherst, Mass; Benny Cheng & 
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If a = 0 or 4ac — b* = 0, then p(n) is divisible by primes of the form 1(mod 4) for infinitely 
many n so we assume that a(4ac — b*) is not zero and let p,, p5,..., P,, be the distinct odd prime 
divisors of a(4ac — b*). The Chinese Remainder Theorem and Dirichlet’s Theorem guarantee that 
there is a prime p > a such that p = l(mod p,) for all i,1 <i < m, and p = l(mod 8). Since 
(p,/PD) = (p/p,) = (2/p) = (—1/p) = 1, there is an x such that x” = b* — 4ac(mod p). Since 
(2a, p) = 1, 2an + b = x(mod p) has a solution for n, so p| p(n) for some positive n. 

Clearly there are infinitely many such primes p. Cheng and Hung replaced 1 mod 4 by 1 mod 
m; Foster proved a considerable generalization. Advanced Problem 6340 (page 489) seems to be 
the last word in this connection. 

Every prime divisor of 4n* + 1 is congruent to 1 mod 4. 


Also solved by the proposer. 
Random Walks and Catalan Numbers 


E 2903 (1981, 619]. Proposed by Louis W. Shapiro, Howard University. 


Consider walks in the northeast quadrant which start at (0,0) and such that each step is one 
unit east or north and the points (1, 1), (3,3),...,@2kA + 1,2k + 1),... are forbidden. How many 
paths are there to (2n,2n)? 


Many solvers used the method described below; some found the solution of an equivalent 
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problem in W. Feller’s Introduction to Probability. The Catalan numbers give the number of ways 
of parenthesizing a product. 


Solution by Jo Anne Simpson Growney, Bloomsburg State College, Pennsylvania. After some 
brief historical remarks, we shall show that the number of such paths, for n = 0,1,2,..., is 


_ l 4n 
C(2n) = 7 (5”] 


where the symbol C(2n) denotes the 2nth member of the Catalan sequence. 

In an article entitled “Arrangements of m things of one sort and n things of another sort under 

certain conditions of priority” (Messenger of Math., vol. 8 (1878), pp. 105-114), W. A. Whitworth 

m—-n+limt+n 
showed that ~———| n 
(m,n) in the Cartesian plane which do not go above the line y = x and which consist of m 
eastward paces and n northern paces (m > n) each one unit in length. In the special case m = n, 
Whitworth’s formula gives the nth Catalan number. 

The Catalan numbers have a long history dating back to Euler (1758). Their frequent 
occurrence in the mathematical literature has been well documented in a Research Bibliography 
compiled by H. W. Gould (West Virginia University, Morgantown, WV 25606). 

To show that the problem given above is equivalent to Whitworth’s problem, we need to make 
two simple observations: 


gives the number of routes from the origin (0,0) to the point 


(1) The restriction, in the given problem, that points with odd and equal coordinates be 
avoided means that, as a walk from (0,0) to (2n,2n) progresses, at the 2n points 
(0, 1), 1, 0), (2, 3), (3, 2),...,(2n — 2,2n — 1),(2n — 1,2n — 2) there is only one possible 
choice for the next step. 

(2) The restriction, in Whitworth’s problem, that paths from (0,0) to (2n,2n) not go above the 
diagonal, means that at the 27 points (0,0), (1, 1), (2, 2),...,(2” — 1,2n — 1) there is only 
one possible choice for the next step. 


Thus the given problem and Whitworth’s problem both ask us to find the number of different 
sequences of 2n paces—n of which are eastward and n of which are northward—which have an 
identical number of options excluded. The solutions are therefore equivalent. 

A proof that the nth Catalan number counts the number of paths from (0,0) to (n, n) that do 
not cross the diagonal may be obtained by (a quite laborious) mathematical induction—based on 


2 2 times the number of such paths 


the demonstration that the number of paths to (n, n) is 
to(n—l1,n— 1). 
Alternatively, use of the recursion relation 


C(n) = C(0)C(n — 1) + C(1)C(n — 2) + ++ + C(n — 1) C(0), with C(0) = 0, 


(which can readily be seen to be true for northeast quadrant paths of Whitworth’s type) provides 
another basis for derivation of the formula 


C(n) =F ; 1 (77"). 


Also solved by D. M. Bloom, P. S. Bruckman, K. M. Chan and Y. M. Chow (Singapore), F. E. Crawford 
(student), J. Dou (Spain), L. L. Foster, I. Gessel, R. Z. Goldstein, J. P. Goulden (Canada), V. Hernandez (Spain), 
D. R. Hill and R. W. Irving (UK), L.-H. Hsu (Taiwan), O. Jorsboe (Denmark), O. Krafft (Germany), O. P. Lossers 
(Netherlands), W. A. Newcomb, D. E. Orr, C. R. Pranesachar (India), R. P. Savage, Jr., R. Scharlach (Germany), R. 
Sheets, A. Smuckler (Israel), D. G. Weinman, M. Woltermann, K. Zikan, and the proposer. 
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Representation of lim A” (of a Matrix) 


E 2907 [1981, 618]. Proposed by H. Kestelman, University College, London. 


Given that the sequence A, A”, A°,... converges to a nonzero matrix A~, show that A~ = 
V(WV)~'W where V is any matrix whose columns are a basis of the right kernel of A — I, and W 
is any one whose rows are a basis of the left kernel of A — J. 


Solution by Emeric Deutsch, Polytechnic Institute of New York. It is known (see U. G. 
Rothblum, Mathematical Programming Study, vol. 6, 1976, pp. 188-201), and relatively easy to 
verify, that if EF is an idempotent matrix and the columns of a matrix X (resp. Y) form a basis of 
range E (resp. range /*), then 
(1) E = X(Y*X) 'Y*. 


By assumption, we have AV = V, WA = W, which imply at once that A~V = V and 
WA@ = W. It is also easy to see that A®~ is idempotent. Thus, the columns of V (resp. W*) form a 
basis of range A® (resp. range (A~)*). Then, from (1) we obtain the desired equality. 

Also solved by E. Deutsch (second solution), A. L. Duarte and J. F. Quiero (Portugal), D. E. Orr, P.-Y. Wu 
(Taiwan), and the proposer. 


A Polynomial Congruence Modulo Higher and Higher Powers 


E 2908 [1981, 705]. Proposed by Ken Brown, Nova High School, Fort Lauderdale, FL. 
Let p(x) = 2x? + 3x* — 1. Define the sequence {a,) by 
a, = 4, Gn+l = p(a,) 7 10/[ p(a,)/10/], n> 0, f= 2". 


Show that a, satisfies the congruence x* = x (mod 10°), e = 2”7!. 


Solution by Ira Gessel, Massachusetts Institute of Technology, and University of South Alabama 
Problem Group. It can be shown by induction that mod 5°, a, = —1 if n is odd; a, = 0 if n is 
even. Also mod 2’°, a, = Oif nis odd; a, = —1 ifn is even. Thus if m is odd, a” = a, (mod 10°). 


O. P. Lossers and C-N. Lee showed that a2 + a,, = 0 (mod 10°). L. L. Foster gave further generalizations. 


Also solved by an anonymous Chinese solver, L. L. Foster, P. Kumar (India), C-N. Lee, O. P. Lossers 
(Netherlands), A. Stenger, and the proposer. 


ADVANCED PROBLEMS 


Solutions of these Advanced Problems should be mailed in duplicate to Professor G. L. Alexanderson, 
Department of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by January 31, 1984. The 
solver’s full post-office address should be on each sheet. 


6436. Proposed by H. Kestelman, University College, London. 


Let V be a real vector space normed by a scalar product. Let f: V — V be such that whenever 
a, b,c are vertices of a triangle right-angled at a, then f(a), f(b), f(c) are vertices of a triangle 
right-angled at f(a). Show that fis a continuous collineation. 


6437. Proposed by Bhaskar Bagchi, Indian Statistical Institute, Calcutta. 


Given n meromorphic functions f,, f,,..., f, on a planar region U, show that there exists a 
holomorphic function g on U such that f(z) # g(z) for 1 <j < n and for all z in U. 
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6438. Proposed by J. L. Brenner, Palo Alto, California. 


Construct a differentiable real function that is positive at every rational point, but zero at an 
uncountable number of points, or prove that no such function exists. 


SOLUTIONS OF ADVANCED PROBLEMS 
Three Circles with Collinear Centers 


3887 (1938, 482]. Proposed by V. Thébault, La Mans, France 


Through the vertex A of a triangle ABC, a straight line AM is drawn cutting the side BC in M. 
Let 2@ be the angle AMC; O and I the centers of the circumscribed circle (O) and the inscribed 
circle (J) of ABC. The circles (w,) and (w,) with centers w, and w, and radii p, and p, are each 
tangent to (O) and the first is tangent also to the two sides of angle AMC while the second is 
tangent to the two sides of angle AMB. Prove that: (1) The straight line joining w, and w, passes 
through J. (2) The point J divides the segment ww, in the ratio tan’ 6:1; and p, + p, = r’sec* 8, 
where r is the radius of (J). (See Fig. 1.) 


Fic. I. 


Editorial comment. The above problem, proposed over 40 years ago by the prolific and 
penetrating geometer V. Thébault, has remained unsolved until recently, when K. B. Taylor of 
England submitted a solution. Mr. Taylor had run across a restatement of the problem in C. 
Stanley Ogilvy: Tomorrow’s Math, Unsolved Problems for the Amateur, Oxford University Press, 
1962, p. 70. The solution given by Mr. Taylor occupies 24 manuscript pages and is based upon a 
lengthy calculation of involved distances. For lack of space, and with apologies to Mr. Taylor, we 
here reproduce his solution in a brief summarized form. Thébault’s finding can be more 
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6438. Proposed by J. L. Brenner, Palo Alto, California. 


Construct a differentiable real function that is positive at every rational point, but zero at an 
uncountable number of points, or prove that no such function exists. 


SOLUTIONS OF ADVANCED PROBLEMS 
Three Circles with Collinear Centers 


3887 [1938, 482]. Proposed by V. Thébault, La Mans, France 


Through the vertex A of a triangle ABC, a straight line AM is drawn cutting the side BC in M. 
Let 2@ be the angle AMC; O and I the centers of the circumscribed circle (O) and the inscribed 
circle (J) of ABC. The circles (w,) and (w,) with centers w, and w, and radii p, and p, are each 
tangent to (O) and the first is tangent also to the two sides of angle AMC while the second is 
tangent to the two sides of angle AMB. Prove that: (1) The straight line joining w, and w, passes 
through J. (2) The point J divides the segment ww, in the ratio tan’ 6:1; and p, + p, = r’sec* 8, 
where r is the radius of (/). (See Fig. 1.) 


Fic. I. 


Editorial comment. The above problem, proposed over 40 years ago by the prolific and 
penetrating geometer V. Thébault, has remained unsolved until recently, when K. B. Taylor of 
England submitted a solution. Mr. Taylor had run across a restatement of the problem in C. 
Stanley Ogilvy: Tomorrow’s Math, Unsolved Problems for the Amateur, Oxford University Press, 
1962, p. 70. The solution given by Mr. Taylor occupies 24 manuscript pages and is based upon a 
lengthy calculation of involved distances. For lack of space, and with apologies to Mr. Taylor, we 
here reproduce his solution in a brief summarized form. Thébault’s finding can be more 
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attractively stated as follows: Let P be a quadrilateral inscribed in a circle (O) and let Q be the 
quadrilateral formed by the centers of the four circles internally touching (O) and each of the two 
diagonals of P. Then the incenters of the four triangles having for sides the sides and diagonals of 
P form a rectangle R inscribed in Q. 


Solution by K. B. Taylor, Salford, England. Let E, F, K be the points of tangency of 
(,),(@,),(7) with line BC. By brute calculation we find, using standard notation applied to 
triangle ABC, 


po, = r{—rtan? 6 — (c — b)tand +7,}/(r, -— 1), 
p, = r{—rcot? 6+ (c— b)cot?+7,}/(1 — 17), 
FK=rcot@, KE=rtané@. 
Using these expressions, one can show that 
slope w,/ = slope Iw,, 
whence w,, J, w, are collinear. One can also show that 
wT: Iw, = FK: KE = 1: tan’ 0, 
and that 
r= p,cos’ 6 + p,sin? 6. 


(The relation p, + p, = r*sec* # given by the proposer is manifestly false, since it is dimen- 
sionally incorrect.) 

From expressions similar to the above distance relations, one can show that if J’ is the excenter 
opposite vertex A, (w{) and (w4) are the circles externally tangent to (O), both also tangent to BC 
and AM produced, then w}, J’, w5 are collinear, and 


wT’: I'w, = tan? 0:1, 7, = pcos? 6 + ps sin’ 6. 


When M is coincident with the point K of tangency of the incircle (J) with BC, then 
0) = py =r, and tan20 = 27r,/(c — b). When M is coincident with K’, p, = p, = r and tan26 
= 2r/(c — b). 


Borel Subsets of a Product Space 


6023* [1975, 308; 1983, 136]. Proposed by S. J. Sidney, University of Connecticut. 


If for each k in the uncountable index set K, J, denotes a copy of [0,1] and U, denotes the 
copy of (0, 1] contained therein, prove or disprove that II,U, is a Borel set in the compact space 
II, ;,. 


A solution by M. G. Pelling was published in the February 1983 issue of this MONTHLY. A. H. 
Stone (The University of Rochester) has pointed out that in a paper by D. Maharam and himself, 
Borel Boxes, Pacific J. Math., 81 (1979) 471-473, it is shown that a nonempty product IT{Y,: 
k € K} is Borel in II{ X,: k € K}, where each X, is first countable and a Hausdorff space, if and 
only if (1) each Y, is Borel in X,, and (2) for all but countably many k’s, Y, is closed in X,. 

This immediately yields the negative answer to the problem and the proof is simpler than 
Pelling’s. 


Additive Set Functions of Bounded Variation 


6256 [1979, 132]. Proposed by A. Kussmaul and P. E. Kopp, University of Hull, England. 


Prove or disprove the assertion in a recent text that every countably additive real-valued set 
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attractively stated as follows: Let P be a quadrilateral inscribed in a circle (O) and let Q be the 
quadrilateral formed by the centers of the four circles internally touching (O) and each of the two 
diagonals of P. Then the incenters of the four triangles having for sides the sides and diagonals of 
P form a rectangle R inscribed in Q. 


Solution by K. B. Taylor, Salford, England. Let E, F, K be the points of tangency of 
(w,),(@,),(7) with line BC. By brute calculation we find, using standard notation applied to 
triangle ABC, 


po, =r{—rtan’ 6 — (c — b)tan9 + 7,}/(r, — r), 
p, = r{—rcot?6+ (c— b)cot8+7r,}/(7, - 1), 
FK=rcot@, KE=rtané@. 
Using these expressions, one can show that 
slope w,lI = slope Iw,, 
whence w,, J, w, are collinear. One can also show that 
wT: Iw, = FK: KE = 1: tan’ 0, 
and that 
r= p,cos*@ + p,sin’ 0. 


(The relation p, + p, = r’?sec* @ given by the proposer is manifestly false, since it is dimen- 
sionally incorrect.) 

From expressions similar to the above distance relations, one can show that if J’ is the excenter 
opposite vertex A, (w‘) and (w4) are the circles externally tangent to (O), both also tangent to BC 
and AM produced, then w}, J’, w are collinear, and 


wT’: I’w, = tan? 0:1, 7, = pcos? 6 + p35 sin’ 6. 


When M is coincident with the point K of tangency of the incircle (J) with BC, then 
Pp) = ps =r, and tan20 = 2r,/(c — b). When M is coincident with K’, p, = p, = r and tan26 
= 2r/(c — b). 


Borel Subsets of a Product Space 


6023* [1975, 308; 1983, 136]. Proposed by S. J. Sidney, University of Connecticut. 


If for each k in the uncountable index set K, J, denotes a copy of [0,1] and U, denotes the 
copy of (0, 1] contained therein, prove or disprove that II,U, is a Borel set in the compact space 
Il k I, e 


A solution by M. G. Pelling was published in the February 1983 issue of this MONTHLY. A. H. 
Stone (The University of Rochester) has pointed out that in a paper by D. Maharam and himself, 
Borel Boxes, Pacific J. Math., 81 (1979) 471-473, it is shown that a nonempty product IT{Y,: 
k € K} is Borel in II{ X,: k € K}, where each X, is first countable and a Hausdorff space, if and 
only if (1) each Y, is Borel in X,, and (2) for all but countably many k’s, Y, is closed in X,. 

This immediately yields the negative answer to the problem and the proof is simpler than 
Pelling’s. 


Additive Set Functions of Bounded Variation 


6256 [1979, 132]. Proposed by A. Kussmaul and P. E. Kopp, University of Hull, England. 


Prove or disprove the assertion in a recent text that every countably additive real-valued set 
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attractively stated as follows: Let P be a quadrilateral inscribed in a circle (O) and let Q be the 
quadrilateral formed by the centers of the four circles internally touching (O) and each of the two 
diagonals of P. Then the incenters of the four triangles having for sides the sides and diagonals of 
P form a rectangle R inscribed in Q. 


Solution by K. B. Taylor, Salford, England. Let E, F, K be the points of tangency of 
(w,),(@,),(7) with line BC. By brute calculation we find, using standard notation applied to 
triangle ABC, 


op, = r{—rtan’® 6 — (c— b)tan8 + 7,}/(r, — 7), 
p, = r{—rcot?6 + (c— b)cot9+7r,}/(7 — 7), 
FK=rcot@, KE=rtané@. 
Using these expressions, one can show that 
slope w,l = slope Jw,, 
whence w,, J, w, are collinear. One can also show that 
wT: Iw, = FK: KE = 1: tan’ 0, 
and that 
r= p,cos*@ + p,sin’ 0. 


(The relation p, + p, = r’?sec* @ given by the proposer is manifestly false, since it is dimen- 
sionally incorrect.) 

From expressions similar to the above distance relations, one can show that if J’ is the excenter 
opposite vertex A, (w{) and (w;) are the circles externally tangent to (O), both also tangent to BC 
and AM produced, then w}, J’, w‘ are collinear, and 


wT’: I’w, = tan? 0:1, 7, = pcos? 6 + p5 sin’ 6. 


When M is coincident with the point K of tangency of the incircle (J) with BC, then 
Pp) = p5 =r, and tan20 = 2r,/(c — b). When M is coincident with K’, p, = p, = r and tan26 
= 2r/(c — b). 


Borel Subsets of a Product Space 


6023* [1975, 308; 1983, 136]. Proposed by S. J. Sidney, University of Connecticut. 


If for each k in the uncountable index set K, J, denotes a copy of [0,1] and U, denotes the 
copy of (0, 1] contained therein, prove or disprove that II,U, is a Borel set in the compact space 
II, J,. 


A solution by M. G. Pelling was published in the February 1983 issue of this MONTHLY. A. H. 
Stone (The University of Rochester) has pointed out that in a paper by D. Maharam and himself, 
Borel Boxes, Pacific J. Math., 81 (1979) 471-473, it is shown that a nonempty product IT{Y,: 
k € K} is Borel in II{ X,: k € K}, where each X, is first countable and a Hausdorff space, if and 
only if (1) each Y, is Borel in X,, and (2) for all but countably many k’s, Y, is closed in X,. 

This immediately yields the negative answer to the problem and the proof is simpler than 
Pelling’s. 


Additive Set Functions of Bounded Variation 


6256 [1979, 132]. Proposed by A. Kussmaul and P. E. Kopp, University of Hull, England. 


Prove or disprove the assertion in a recent text that every countably additive real-valued set 
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function on a ring R of sets is of bounded variation. Is the assertion true if R is an algebra of sets? 


Solved by A. Brunnschweiler (Switzerland), F. S. Cater, A. Janssen (West Germany), R. Bruce Mericle, Charles 
Riley, J. C. Smit (Netherlands) and the proposers. All solvers gave an example where R is the ring of subsets A of a 
set X such that either A or X — A is finite, defining (A) to be the cardinal of A for A finite, and minus the cardinal 
of X — A for X — A finite. 


A Condition on Entire Functions 


6279 [1979, 793; 1981, 542]. Proposed by Lee A. Rubel, University of Illinois, Urbana-Champaign. 


Let f(z) be an entire function such that the maximum modulus over every closed line segment 
L is achieved at one of the endpoints a and b of L; that is, 


max(|f(z)|:z © L} = max{|f(a)L,|f()p. 


Prove that f(z) has either the form A(z — B)” or the form A exp Bz, where A and B are constants 
and n is a nonnegative integer. 

The proposer pointed out that there is a gap in the published solution. Below are two correct 
solutions. 


I. Solution by P. R. Chernoff and J. Essick, University of California, Berkeley. If f has more than 
one zero, then | /| vanishes on the segment joining two of its zeros; so f is identically zero. Assume 
henceforth that f is not identically zero. 


Case 1. f has one zero. Without loss of generality we may suppose f(0) = 0. If f is a 
polynomial, it is necessarily of the form cz”. But if f is not a polynomial, then by the 
Casorati-Weierstrass theorem there is a sequence of points p,, with | p,| > oo and|f(p,)| = ¢, — 0. 
By passing to a subsequence, we may suppose that the unit complex numbers w, = p,/|p,| 
converge to a limit w. Then | f(z)| < e,, on the line segment joining 0 to p,, so that | f(z)| must be 
identically 0 on the ray through 0 along direction w. This is a contradiction. 


Case 2. f has no zeros. Then f = e® for an entire function g. The real part u(z) of g(z), when 
restricted to any line L, can have no local maxima. Consequently, if the directional derivative 
D,,u(a) along the direction w at the point a should be positive, it follows that u(a + tw) must be 
nondecreasing for t > 0, and thus u(z) > u(a) for all z on the ray emanating from a along 
direction w. Note that D,u(a) = Re(wg’(a)). 

We claim that g’ is constant. If not, we can obviously find three points a,, a, a, such that the 
numbers c, = g’(a,) are very close to wk, k = 1,2,3, where w) = e*”'/°. Suppose now that w is 
any complex number with Re(wc,) > 0. Then D,u(a,) > 0, so that u(z) > u(a,) for all z in the 
half plane 


P, = {z:Re((z — a,)c,)) > 0. 


Now, by the choice of c,, c,, c3, the union P, U P, U P, of these half planes includes all of C 
except possibly a bounded triangular region, and therefore u(z) is bounded below on the entire 
plane. This means that g(z) must be constant, contradicting our assumption about g’. Thus g’ is a 
constant A, g(z) = Az + B, and f(z) = exp(Az + B). 


II. Solution by Jan Maly and Zdenek Vlasek, Charles University, Prague. If f vanishes at the 
endpoints of some segment L = [a, b], then f = 0 on L, and therefore f = 0 on C. Thus we may 
assume that f has at most one zero in C. If fis a polynomial, this shows that f(z) = A(z — B)” for 
some n > 0. Assume then that f has an essential singularity at infinity. Let a, — oo be such that 
f(a,,) ~ 0, and such that a,/|a,| — s for some |s| = 1. If f(z.) = 0 for some z,, then the point 


b, = Zo + (a, Zo )/\an _ Zo| 
lies on the segment [Z), a, and so |f(5,)| < |f(a,,)|. Since b, ~ z) + s, this forces f to have 
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function on a ring R of sets is of bounded variation. Is the assertion true if R is an algebra of sets? 


Solved by A. Brunnschweiler (Switzerland), F. S. Cater, A. Janssen (West Germany), R. Bruce Mericle, Charles 
Riley, J. C. Smit (Netherlands) and the proposers. All solvers gave an example where R is the ring of subsets A of a 
set X such that either A or X — A is finite, defining (A) to be the cardinal of A for A finite, and minus the cardinal 
of X — A for X — A finite. 


A Condition on Entire Functions 


6279 [1979, 793; 1981, 542]. Proposed by Lee A. Rubel, University of Illinois, Urbana-Champaign. 


Let f(z) be an entire function such that the maximum modulus over every closed line segment 
L is achieved at one of the endpoints a and b of L; that is, 


max(|f(z)|:z © L} = max{|f(a)L,|f(d)p. 


Prove that f(z) has either the form A(z — B)” or the form A exp Bz, where A and B are constants 
and n is a nonnegative integer. 

The proposer pointed out that there is a gap in the published solution. Below are two correct 
solutions. 


I. Solution by P. R. Chernoff and J. Essick, University of California, Berkeley. If f has more than 
one zero, then | /| vanishes on the segment joining two of its zeros; so f is identically zero. Assume 
henceforth that f is not identically zero. 


Case 1. f has one zero. Without loss of generality we may suppose f(0) = 0. If f is a 
polynomial, it is necessarily of the form cz”. But if f is not a polynomial, then by the 
Casorati- Weierstrass theorem there is a sequence of points p, with | p,,| > oo and|/f(p,)| = ¢, — 0. 
By passing to a subsequence, we may suppose that the unit complex numbers w, = p,,/|p,| 
converge to a limit w. Then | f(z)| < e,, on the line segment joining 0 to p,,, so that | f(z)| must be 
identically 0 on the ray through 0 along direction w. This is a contradiction. 


Case 2. f has no zeros. Then f = e® for an entire function g. The real part u(z) of g(z), when 
restricted to any line L, can have no local maxima. Consequently, if the directional derivative 
D,u(a) along the direction w at the point a should be positive, it follows that u(a + tw) must be 
nondecreasing for ¢ > 0, and thus u(z) > u(a) for all z on the ray emanating from a along 
direction w. Note that D,u(a) = Re(wg’(a)). 

We claim that g’ is constant. If not, we can obviously find three points a,, a,, a, such that the 
numbers c, = g’(a,) are very close to wk, k = 1,2,3, where w, = e*”'/°. Suppose now that w is 
any complex number with Re(wc,) > 0. Then D,u(a,) > 0, so that u(z) > u(a,) for all z in the 
half plane 


P, = {z:Re((z — a,)c,)) > 0. 


Now, by the choice of c,, c,, c3, the union P, U P, U P, of these half planes includes all of C 
except possibly a bounded triangular region, and therefore u(z) is bounded below on the entire 
plane. This means that g(z) must be constant, contradicting our assumption about g’. Thus g’ is a 
constant A, g(z) = Az + B, and f(z) = exp(Az + B). 


II. Solution by Jan Maly and Zdenek Vlasek, Charles University, Prague. If f vanishes at the 
endpoints of some segment L = [a, b], then f = 0 on L, and therefore f = 0 on C. Thus we may 
assume that f has at most one zero in C. If fis a polynomial, this shows that f(z) = A(z — B)” for 
some n > 0. Assume then that f has an essential singularity at infinity. Let a, -> oo be such that 
f(a,,) > 0, and such that a,/|a,,| — s for some |s| = 1. If f(z.) = 0 for some z,, then the point 


b,, = Zo + (a,, -_ Z9)/\a, -_ Zo| 


lies on the segment [Z,, a,,] and so |f(5,)| < |f(a,)|. Since b, > z) + s, this forces f to have 
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another zero z, + s. We may therefore assume that f(z) # 0. For sufficiently large n the point 


lies on the segment [z, a,,], which implies that | f(c,,)| < |f(z)|. Clearly c, - z. Now 


(22) = rel tin t= in fed =) 


f(z) f(z) "len — 2| " C, — Z 
tim. |__| fh) _ 1\) - 
~h liter 4 i)| <0 


because Re( f(c,,)/f(z)) < |f(c,)/f(Z)| < 1. Since z was arbitrary, this shows that the range of 
f’/f is contained in a half-plane, consequently f’/f = constant. Thus in this case there are 
constants A, B such that f(z) = A exp( Bz). 


Values Assumed by a Modular Polynomial 


6340 [1981, 294]. Proposed by J. M. Patin, St. Herblain, France, and H. Stark, University of 
California, San Diego. 


Given a nonconstant polynomial f(x) over Z and a positive m € Z, show that there exists 
n © Zand a prime p = | (mod m) such that p divides f(). (For a special case of this problem see 
Elementary Problem E 2883, page 483.) 


Solution by John Cremona, University of Michigan. Without loss of generality assume that f is 
irreducible. Let a,, a,,..., a, be the roots of f, let w be a primitive mth root of unity, and put 
K = Q(, a), a5,..., a@,). Since K is normal over Q, it follows from the Cebotarev density 
theorem (see Algebraic Number Theory, editors Cassels and Frohlich, p. 227) that the asymptotic 
proportion of rational primes p which split completely in K is 1/[ K:Q]. 

Let p be such a prime, which does not divide the discriminant of K. Then p must split 
completely in any subfield of K, and in particular p splits completely in Q(a,). By a fundamental 
theorem of Dirichlet, since p does not divide the discriminant of Q(a,) and p splits completely in 
@(a,), the congruence f(x) = 0 (mod p) has k distinct solutions. 

Let ®, denote the mth cyclotomic polynomial. Arguing similarly, with Q(a,) replaced by 
Q(w), we see that the congruence ®, (x) = 0 (mod p) has exactly ¢(m) roots; let r be one of 
them. Then the order of r in (Z/pZ)”* is exactly m, and hence m divides p — 1. 

Thus p has the desired properties. 


Also solved by Ronald Evans, Harold Stark, and Ernst Kleinert (W. Germany). 


Lattices of Periodic Functions 


6358 [1981, 623]. Proposed by Simon Fitzpatrick and Lee Rubel, University of Illinois at 
Urbana-Champaign. 


The periodic functions on the real line R (of possibly different periods) form a partially 
ordered set under pointwise < . Do they form a lattice—that is, is there a sup and an inf (V and 
A) of any two periodic functions, not necessarily the pointwise sup and inf? How about the 
functions that are periodic and bounded? Periodic and continuous? 


Solution by Alberto Facchini, Universita di Padova, Italy. The answer is no in general and for 
the periodic bounded functions, and it is yes for the functions that are periodic and continuous. 

Denote the supremum (infimum) of two functions in the partially ordered set by V (A) and 
the pointwise supremum (infimum) by sup (inf). If S is a subset of R, let x(S) be the 
characteristic function of S. 
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another zero z, + s. We may therefore assume that f(z) # 0. For sufficiently large n the point 
lies on the segment [z, a,,], which implies that | f(c,,)| < |f(z)|. Clearly c,, - z. Now 


(LL) = rel Etim Ein, He) LE) 
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because Re( f(c,,)/f(z)) < |f(c,)/f(Z)| < 1. Since z was arbitrary, this shows that the range of 
f’/f is contained in a half-plane, consequently f’/f = constant. Thus in this case there are 
constants A, B such that f(z) = A exp( Bz). 


Values Assumed by a Modular Polynomial 


6340 [1981, 294]. Proposed by J. M. Patin, St. Herblain, France, and H. Stark, University of 
California, San Diego. 


Given a nonconstant polynomial f(x) over Z and a positive m € Z, show that there exists 
n © Zand a prime p = | (mod m) such that p divides f(7). (For a special case of this problem see 
Elementary Problem E 2883, page 483.) 


Solution by John Cremona, University of Michigan. Without loss of generality assume that f is 
irreducible. Let a,, a,,..., a, be the roots of f, let w be a primitive mth root of unity, and put 
K = Q(, a, a,..., a,). Since K is normal over Q, it follows from the Cebotarev density 
theorem (see Algebraic Number Theory, editors Cassels and Frohlich, p. 227) that the asymptotic 
proportion of rational primes p which split completely in K is 1/[ K:Q]. 

Let p be such a prime, which does not divide the discriminant of K. Then p must split 
completely in any subfield of K, and in particular p splits completely in Q(a,). By a fundamental 
theorem of Dirichlet, since p does not divide the discriminant of Q(a,) and p splits completely in 
@(a,), the congruence f(x) = 0 (mod p) has k distinct solutions. 

Let ®, denote the mth cyclotomic polynomial. Arguing similarly, with Q(a,) replaced by 
Q(w), we see that the congruence ®, (x) = 0 (mod p) has exactly ¢(m) roots; let r be one of 
them. Then the order of r in (Z/pZ)* is exactly m, and hence m divides p — 1. 

Thus p has the desired properties. 


Also solved by Ronald Evans, Harold Stark, and Ernst Kleinert (W. Germany). 


Lattices of Periodic Functions 


6358 [1981, 623]. Proposed by Simon Fitzpatrick and Lee Rubel, University of Illinois at 
Urbana-Champaign. 


The periodic functions on the real line R (of possibly different periods) form a partially 
ordered set under pointwise < . Do they form a lattice—that is, is there a sup and an inf (V and 
A.) of any two periodic functions, not necessarily the pointwise sup and inf? How about the 
functions that are periodic and bounded? Periodic and continuous? 


Solution by Alberto Facchini, Universita di Padova, Italy. The answer is no in general and for 
the periodic bounded functions, and it is yes for the functions that are periodic and continuous. 

Denote the supremum (infimum) of two functions in the partially ordered set by V (A) and 
the pointwise supremum (infimum) by sup (inf). If S is a subset of R, let x(S) be the 
characteristic function of S. 
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another zero Zz) + s. We may therefore assume that f(z) # 0. For sufficiently large n the point 
Ch = 2+ (a, — 2)/ (nla, — 2) 
lies on the segment [z, a,,], which implies that | f(c,,)| < |f(z)|. Clearly c, - z. Now 


(2) = Re tm = tim, EDL) 
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because Re(f(c,,)/f(z)) < |f(c,)/f(Z)| < 1. Since z was arbitrary, this shows that the range of 
f’/f is contained in a half-plane, consequently f’/f = constant. Thus in this case there are 
constants A, B such that f(z) = A exp( Bz). 


Values Assumed by a Modular Polynomial 


6340 [1981, 294]. Proposed by J. M. Patin, St. Herblain, France, and H. Stark, University of 
California, San Diego. 


Given a nonconstant polynomial f(x) over Z and a positive m € Z, show that there exists 
n © Zand a prime p = | (mod m) such that p divides f(7). (For a special case of this problem see 
Elementary Problem E 2883, page 483.) 


Solution by John Cremona, University of Michigan. Without loss of generality assume that f is 
irreducible. Let a,, a,,..., a, be the roots of f, let w be a primitive mth root of unity, and put 
K = Q(w, a, a5,...,a,). Since K is normal over Q, it follows from the Cebotarev density 
theorem (see Algebraic Number Theory, editors Cassels and Frohlich, p. 227) that the asymptotic 
proportion of rational primes p which split completely in K is 1 /[ K:Q]. 

Let p be such a prime, which does not divide the discriminant of K. Then p must split 
completely in any subfield of K, and in particular p splits completely in Q(a,). By a fundamental 
theorem of Dirichlet, since p does not divide the discriminant of Q(a,) and p splits completely in 
@(a,), the congruence f(x) = 0 (mod p) has k distinct solutions. 

Let ®, denote the mth cyclotomic polynomial. Arguing similarly, with Q(a,) replaced by 
Q(w), we see that the congruence ®, (x) = 0 (mod p) has exactly ¢(m) roots; let r be one of 
them. Then the order of r in (Z/pZ)”* is exactly m, and hence m divides p — 1. 

Thus p has the desired properties. 


Also solved by Ronald Evans, Harold Stark, and Ernst Kleinert (W. Germany). 


Lattices of Periodic Functions 


6358 [1981, 623]. Proposed by Simon Fitzpatrick and Lee Rubel, University of Illinois at 
Urbana-Champaign. 


The periodic functions on the real line R (of possibly different periods) form a partially 
ordered set under pointwise < . Do they form a lattice—that is, is there a sup and an inf (V and 
A.) of any two periodic functions, not necessarily the pointwise sup and inf? How about the 
functions that are periodic and bounded? Periodic and continuous? 


Solution by Alberto Facchini, Universita di Padova, Italy. The answer is no in general and for 
the periodic bounded functions, and it is yes for the functions that are periodic and continuous. 

Denote the supremum (infimum) of two functions in the partially ordered set by V (A) and 
the pointwise supremum (infimum) by sup (inf). If S is a subset of R, let x(S) be the 
characteristic function of S. 


490 PROBLEMS AND SOLUTIONS [August-September 


To show that the answer is no in general and for the functions that are periodic and bounded, 
we consider the functions 


fi=x(2), fr =x(Z- v2), fy=x(Z+Z-v2), 
f= x((Z+Z-V3)U(Z-¥2+Z-Vv3)), fo=x(ZUZ-y2). 


Then f,, f., 3, f, ate periodic with periods 1,/2,1,V3, respectively, and f, is not periodic. If 
f, V f, exists, then 


fs = sup(fi.f)<fi VA < inf( fy, fi) =f, 


ie., f, V f, =f; iS not periodic, and this is a contradiction. 

To show that the functions that are periodic and continuous do form a lattice, it is obviously 
enough to prove that any two such functions have a V. Let f,, f, be continuous periodic functions 
with A and B as periods respectively. If A /B is rational, A/B = m/n say, thenf, V f, = sup(/,, f,) 
has nA = mB as a period. Suppose 4/B is not rational. Let M, = max f, for i = 1,2 (M, exists 
because f, is periodic and continuous). We may assume M, < M,. Let g = sup(M,, f,). We show 
that g (continuous and periodic of period B) is f, V f,. Let h be any periodic continuous function 
of period C, such that h > f,, f,. Then either A/C or B/C is not rational. If A/C is not rational 
and f,(x 9) = M,, then, for all integers n, m, we have 


h(x) + nA + mC) =h(x. + nA) > fi (x9 + nA) =f) (x) = M,. 
The set {x9 + nA + mC|n, m © Z) is dense in R and h is continuous; therefore h > M, and 


h > g. Similarly if B/C is not rational, then h > M, and therefore h > sup(M,, f,) = g. This 
proves that g =f, V f,. 


Also solved by F. S. Cater, James Caveny, G. A. Edgar, O. P. Lossers (The Netherlands), Mark D. Meyerson, 
and the proposers. Partially solved by Bruce Richter and Herb Shank (Canada). 
The Distribution Function of @(n) /n 
6363 [1981, 710-711]. Proposed by H. G. Diamond, University of Illinois and P. Erdos, 
Hungarian Academy of Sciences. 


Let ¢ denote Euler’s function and set 
D(a) = lim ~#(n <x: 6(n)/n < a}, 
Xx 0 
the distribution function of ¢(n)/n. Prove that 
l ] 
(a) - (a) * (a) Plig) + = 2 
Solution by O. P. Lossers, Department of Mathematics and Computing Science, Eindhoven 


University of Technology, Eindhoven, the Netherlands. Define, for / > 0, 


o 


D,(a, x) = ~#(n <x:n=2'm, moddando(n)/n < a}, 


D(a) = lim D,(a, x). 
x 0 
The existence of D,(a) can be established in much the same way as the existence of D(a) was 
established by I. J. Schoenberg, Uber die Asymptotische Verleitung Reeler Zahlen mod 1, Math. 
Z., 28 (1928) 171-199. Evidently 


(1) x #(n <x: 6(n)/n <a) =F Di(a, x). 
=0 


490 PROBLEMS AND SOLUTIONS [August-September 


To show that the answer is no in general and for the functions that are periodic and bounded, 
we consider the functions 


fi=x(Z), fp=x(Z- v2), fy =x(Z+Z-y2), 
fe=x((Z+Z-V3)U(Z-V2+Z-¥3)), fp=x(ZUZ- v2). 


Then f,, fo, f3, f4 ate periodic with periods 1,/2,1,V¥3, respectively, and f, is not periodic. If 
f, Vv f, exists, then 


fs=sup(fi. fi) <A VA <inf(h, fi) =f, 
ie., f, V f, =f, 18 not periodic, and this is a contradiction. 

To show that the functions that are periodic and continuous do form a lattice, it is obviously 
enough to prove that any two such functions have a V . Let f,, f, be continuous periodic functions 
with A and B as periods respectively. If A/B is rational, A /B = m/nsay, thenf, V f, = sup(/,, fo) 
has nA = mB as a period. Suppose 4/B is not rational. Let M, = max f, for i = 1,2 (M, exists 
because f, is periodic and continuous). We may assume M, < M,. Let g = sup(M,, f,). We show 
that g (continuous and periodic of period B) is f, V f,. Let h be any periodic continuous function 
of period C, such that h > f,, f,. Then either A/C or B/C is not rational. If A/C is not rational 
and f,(x 9) = M,, then, for all integers n, m, we have 


h(x) + nA + mC) =h(x9 + nA) > fi (x + nA) =f, (%0) = My. 
The set {x9 + nA + mC\|n, m © Z) is dense in R and fA is continuous; therefore h > M, and 


h > g. Similarly if B/C is not rational, then h > M, and therefore h > sup(M,, f,) = g. This 
proves that g =f, V fh. 


Also solved by F. S. Cater, James Caveny, G. A. Edgar, O. P. Lossers (The Netherlands), Mark D. Meyerson, 
and the proposers. Partially solved by Bruce Richter and Herb Shank (Canada). 
The Distribution Function of o(n) /n 
6363 [1981, 710-711]. Proposed by H. G. Diamond, University of Illinois and P. Erdos, 
Hungarian Academy of Sciences. 
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] ] ] ] 
(a) (4) +(5)- (a6) = 
Solution by O. P. Lossers, Department of Mathematics and Computing Science, Eindhoven 


University of Technology, Eindhoven, the Netherlands. Define, for / > 0, 
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D,(a, x) = ~#(n <x:n=2'm, moddand¢(n)/n < a}, 


D(a) = lim D,(a, x). 
x 0 
The existence of D,(a) can be established in much the same way as the existence of D(a) was 
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(1) x #(n <x: 6(n)/n <a) = D(a, x). 
=0 
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For n = 2'm, m odd, / > 1, we have $(n)/n = 6(m)/(2m), and hence 
(2) D(a,x)=27'Dy)(2a,x) for! > 1. 
Since D,)(2a, x) < 1, it follows from (1) and (2), by the Weierstrass M-test, that 


D(a) = Do(a) + », 27'Do (2a) = Do(a) + Do (2a). 
/=1 
Consequently, 


LD (-S'DQa) = Dyla) + (1) "Dy (2a) > Doe) 


as n — oo, the distribution function D, being upper semicontinuous at 0. Hence 


0(8)-2(§) -2(8)-2(i] =m 


and this equals 1 when a > 1. 


Also solved by Robert Breusch, William A. Newcomb, R. K. Odoni (U.K.), Lajos Takacs, Charles R. Wall, and 
the proposers. 


REVIEWS 


EDITED BY ALLAN L. EDMONDS AND JOHN H. EWING 
COLLABORATING EDITOR FOR FILMS: SEYMOUR SCHUSTER, CARLETON COLLEGE 


A History of the Calculus of Variations from the 17th through the 19th Century. By Herman H. 
Goldstine. Studies in the History of Mathematics and Physical Sciences. Edited by G. J. 
Toomer, vol. 5. Springer-Verlag, New York /Heidelberg/Berlin, 1980. pp. xviii + 410, ISBN 
0-387-90521-9. 


HELENA M. Pycior 
Department of History, University of Wisconsin-Milwaukee, Milwaukee, WI 53201 


There are two kinds of history of mathematics. One, written primarily by mathematicians, 
examines the past from a presentist perspective. It seeks the roots of modern mathematics in the 
answers to such questions as: When was this result first proved, and by whom? Was the original 
proof correct? This kind of history is usually “technical. ..[and] written on a high mathematical 
level” [2, p. 439]. The other, written primarily by historians, evidences more concern for the past 
per se. This second kind of history tends to be less technical, more expository, and broader in 
perspective. It sees historical periods as rich fabrics, and particular mathematical developments as 
threads in those fabrics. Thus, in this kind of history, study of a particular development becomes, 
to a reasonable extent, study of the period in which it occurred. As far as they relate to the given 
topic, the period’s general mathematics, and possibly even science, philosophy, religion, politics, 
and education are considered. 

The book under review is clearly a mathematician’s history, evidencing the strengths and some 
of the weaknesses of that historical genre. It details effectively evolution of the calculus of 
variations from its roots in Fermat’s papers of 1662 on the passage of light through optical media 
to Hilbert’s discovery of the invariant integral and existence theory. At the same time, it highlights 
the importance of problems (as stimuli), of generalization (as method), and of physical applicabil- 
ity (as incentive) to the development of the calculus of variations. These three themes appear early 
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(but implicitly) in the book. The first chapter, for example, deals with Newton’s problem of 
motion in a resisting medium, and the brachystochrone problem. Newton’s problem is to find the 
solid of revolution which moves through a resisting medium in the direction of its axis with the 
least resistance. Goldstine describes the problem as “the first genuine problem of the calculus of 
variations” (p. 7), partially because it was the first in the field to be correctly solved, and because 
techniques used in its solution proved useful in the later development of the field. Newton’s 
techniques were used, in particular, in some solutions of the brachystochrone problem. Solved 
early by Newton, Leibniz, and the Bernoulli brothers, this problem is: given two points in a 
vertical plane, find the curve joining those points along which a frictionless particle will fall in the 
shortest time. Goldstine’s emphasis is on James Bernoulli’s solution which, although less elegant 
than his brother’s, was more general and therefore applicable to other problems. These additional 
problems, together with Newton’s and the brachystochrone, constituted a new field of mathe- 
matics. 

Chapter 2 covers the work of Euler, who was probably led into the field by the Bernoullis and 
by 1744 gave it a general framework. Euler also enunciated the principle of least action, when he 
wrote: “all the effects of Nature follow a certain law of maxima or minima” (p. 106). Discussed in 
some detail by Goldstine, this principle, of course, became the field’s physical raison d’étre. 
Euler’s general approach to the calculus of maxima and minima was dependent on geometry 
through 1755. In that year, Lagrange (then only nineteen) proposed the idea of a variation which 
enabled mathematicians to bypass the geometrical aspects of Euler’s theory and to develop a truly 
analytic theory in its place. Euler seized Lagrange’s idea and renamed the field of their joint 
endeavors the calculus of variations. Later chapters, covering Jacobi, Weierstrass, Clebsch, Mayer, 
Hilbert, and Kneser, continue the story of the emergence of the classical theory of calculus of 
variations (which, Goldstine maintains, was essentially completed by Hilbert’s work) and further 
underscore the vital roles of problems, generalization, and physical applicability in that theory. 

Goldstine’s deep understanding of the calculus of variations inspires his history and assures its 
mathematical accuracy. Thus, as a mathematician, he may be trusted “to select [judiciously] those 
papers and authors whose works have played key roles in the classical calculus of variations as we 
understand the subject today” (p. vii). Furthermore, when Goldstine explains the content of these 
select works, his mathematical ability enables him to translate readily from old concepts and 
notation to that of modern mathematics. In his discussion of Newton’s problem of motion in a 
resisting medium, for example, he gives Newton’s solution in more modern notation first, and 
only later in the original. This technique clearly facilitates the reader’s transition from the familiar 
to the archaic. 

Goldstine’s experience as a mathematician positively influences not only his selection and 
explanation of historical texts, but the questions he puts to the texts as well. Thus the reader feels 
the touch of the mathematician as Goldstine deals with the concepts of the calculus of variations 
on two levels—as “formalistic constructs” and as ideas with some physical or geometrical 
backing. This bilevel analysis marks, for example, the discussion of the relative contributions of 
Euler and Lagrange to the idea of a variation. Relying apparently on little firm evidence but 
rather on mathematical instinct, Goldstine concludes that Lagrange probably viewed his variation 
operator “ds a purely formalistic construct” (p. 112), and that Euler was the first to see it as a 
means of effecting a comparison of curves. 

Yet, in spite of Goldstine’s many strengths as a mathematician-historian, parts of his book 
contain historical shortcomings. Sometimes Goldstine is guilty of extreme presentism; other times, 
of hero worship and of tunnel vision. Goldstine’s presentist perspective leads him to bypass what 
Kenneth O. May, the late dean of North American historians of mathematics, once described as 
“understanding past work as it was, rather than as we would now do it” [3, p. 451]. In the section 
on Leibniz’s solution of the brachystochrone problem, for example, Goldstine dismisses part of 
Leibniz’s proof as “unnecessary” and goes directly to what he considers its “essence.” He gives no 
indication of the nature of the superfluous material, or of the possible reasons behind its inclusion 
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in the original proof. Goldstine’s treatment of Clebsch’s work on the problem of Lagrange is even 
more objectionable: here he does not merely delete original material, but actually replaces it with 
his own. Complementing an original proof with a modern formulation (as in Goldstine’s 
treatment of Newton) is laudable; omitting part of the original (as with Leibniz) is tolerable. But 
substituting for the original is inexcusable. In his handling of Clebsch’s work, then, Goldstine 
crosses the line from history to mathematics. Abandoning the historian’s task of recreating the 
past, Goldstine redoes it. 

At least once in the book, Goldstine also crosses the line between appropriate respect for past 
mathematical genius and hero worship. “The historian,” May once wrote, “finds hilarious such 
naive historical mistakes as assuming that... a brilliant mathematician of past centuries must have 
understood a concept or had a proof because these would be evident to lesser lights today” (3, p. 
453]. This mistake is precisely the kind of hero worship to which Goldstine falls victim. He 
laments Newton’s failure to include a key observation in his lean solution of the brachystochrone 
problem. Then he says that Newton knew what he failed to record. “It is reasonable,” Goldstine 
argues, “that he must have seen [this] at once... That is, he certainly must have been aware of the 
fact... .” (p. 35). The historical fact is that Newton did not record the observation, and really that 
is all that Goldstine can or should state at this point. 

More substantively, in its quest for mathematical detail, Goldstine’s work sacrifices breadth of 
coverage. True to the genre of the mathematician’s history of mathematics, Goldstine very 
narrowly delimits his topic. Replete with equations and sometimes multiple proofs of the same 
theorem, his work evidences little interest in possible external influences on the development of 
the calculus of variations, or even a wide mathematical perspective on its subject. Yet, these more 
general considerations, as the book itself suggests, are not artificial, but arise naturally in 
discussion of the history of the calculus of variations. In his section on priority for the principle of 
least action, Goldstine notes that Caratheodory ascribed a philosophical origin to Euler’s 
statement of the principle. Evidencing historical tunnel vision, Goldstine pursues this issue no 
further, leaving the reader with but a footnote to Carathéodory’s work. If a broader extrinsic 
perspective seems advisable, a perspective which takes into account the general development of 
the differential and integral calculus seems mandatory. Yet Goldstine actually says very little 
about the calculus. In fact, he misses a few golden opportunities to connect its development with 
that of the calculus of variations. Thus, for example, he fails to study Lagrange’s work in the 
context of the eighteenth-century movement in the calculus away from geometrical and towards 
analytical foundations. 

There is, finally, one general criticism of the book. Its style is not representative of the finest 
mathematical (or historical) exposition. It contains no adequate statement of purpose or audience, 
makes but few attempts to highlight major ideas through clear prose statements, and evidences 
throughout an extremely tight, technical style. The history of mathematics is not mathematics, and 
seemingly should be directed to audiences beyond mathematical specialists—to general historians 
of science and mathematics majors, in particular. Admittedly, the nature of the calculus of 
variations probably precludes reduction of its history to prose statements and easily understand- 
able equations. But the expository shortcomings of Goldstine’s book cannot all be attributed to 
the mathematical complexity of its subject. The first volume of the Carus Mathematical Mono- 
graphs series was Calculus of Variations [1], Gilbert Ames Bliss’s eminently readable account of 
the field. 

When one puts aside speculations concerning how much Goldstine’s book could have been 
improved had he avoided some common historical pitfalls and adopted a less technical writing 
style, the conclusion is that the work is a solid contribution written in the tradition of the 
mathematician’s history of mathematics. It will probably soon emerge as a standard source on the 
history of the calculus of variations, and be extended by representatives of both genres of 
the history of mathematics. But, clearly, responsibility for further development of the calculus of 
variations now rests with practitioners of the second kind of history of mathematics. Historians 
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should begin with Goldstine’s work and add to it historical texture—biographical detail and 
exploration of the past interrelationships among the calculus of variations, general mathematics, 
science, and appropriate cultural factors such as philosophy. 
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Computer Logic, Testing and Verification. By Paul Roth. Pitman Publishing Limited, London, 
1980. xx + 176 pp. 


J. R. ARMSTRONG 
Electrical Engineering Department, Virginia Polytechnic Institute, Blacksburg, VA 24061 


1. Introduction. Like all man-made devices, any computer that is built will eventually fail. It is 
important that these failures be detected so that the computer’s results will not be accepted as 
being computationally correct while it is in the failed state. The problem of fault detection in 
computer circuits is thus an important research area for computer scientists and engineers [1], [2]. 


A Combinational Logic Circuit. 
Fic. | 


Computer circuitry is of two basic types: combinational logic and sequential logic. Combina- 
tional logic is made up of gates which in their most primitive form implement the logical AND, 
OR, and INVERT functions. A combinational logic function is thus composed from these basic 
primitives; for example, the logic circuit shown in Fig. 1 has the output function: 


f=(A-B)+(C-D). 


Combinational logic circuits have no memory; i.e., the value of the output is solely dependent 
upon the value of the inputs, and thus will change only when the inputs change and the circuit 
function dictates an output change. In real logic circuits, of course, there is a signal propagation 
delay between input and output changes. 

Sequential logic has memory. The value of the output at a future time is dependent upon the 
present state of the device as well as the present value of its inputs. Fig. 2 shows the state table for 
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T Q(STATE) 


Toggle (7) Flipflop. 


Fic. 2 


a “toggle (T) flipflop.” Note that the device retains its old state when T = 0, but inverts its old 
state when 7 = 1. In sequential circuits, the state to state transition times are controlled either by 
internal circuit signal delays or by the period of a clock which may be applied to the element. 

In real computer circuits combinational and sequential logic are frequently mixed within a 
given logic card, but the approach to testing the two types of logic is different. Let us first 
consider tests for combinational logic. 

In developing these tests, one usually assumes that only one fault is present. Also, the fault 
model that is frequently employed is the “stuck-at” model, i.e., the effect of the fault is to stick a 
signal line at either 1 or at 0. Thus only permanent faults are considered with this model. Note in 
Fig. 1 that all of the signal lines have been numbered. Assume that the fault, line 1 stuck at a 0, is 
present. To develop a test for this fault, we determine an input vector (A, B, C, D) which will give 
one output when the fault is present and the logical complement of this output when it is not. 
There are three basic approaches to this test development: intuitive (which we will not consider 
here), algebraic, and algorithmic. 

First let us consider an example of an algebraic technique. The first step is to derive the output 
function for the circuit in the presence of the fault. For the circuit in Fig. 1 this function is: 


fi/=B+C-D. 

We next employ the EXCLUSIVE OR operation (®) which has the following definition: 
xOyrF=x-yrue-y. 
The set of tests (J) for the fault in question is then given by the expression: 
T=f@f';=(A-B+C-D)-(B+C-D)+(A-B+C-D)-(B+C:D), 

which can be simplified using postulates of Boolean Algebra to the form: 

T= AB(C+D). 
T gives us the complete set of tests for fault, in this case: A = 1, B = 1,C = 0, D = X (don’t care) 
and A= 1, B=1, C= X, D=0. Algebraic methods of this type will always work, in theory, 
provided that the test exists. However, for circuits of even reasonable size the initial expressions 
for T become very complex and the simplification of these expressions using the postulates of 
Boolean algebra is a difficult process not amenable to automation. 


The most important algorithmic approach to logic testing is the “D algorithm.” It is based on a 
calculus of five values: 0,1, X¥, D and D. 0 and 1 have their normal meaning, X denotes a logic 
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signal value which is unspecified, D a logic signal value which has the value 0 when the fault is 
present and 1 otherwise, and D a logic signal value that has the value 1 when the fault is present 
and 0 otherwise. Using these five values, the state of an n wire logic circuit is then represented as 
an n dimensional vector (cube). Fig. 3 shows two such cubes for the circuit in Fig. 1. 


LINES: l 2 3 4 5 6 7 8 
CUBES: D D l 0 X D 0) D 
D D l X 0 D 0 D 


Test Cubes for the Circuit in Fig. 1. 


FIG. 3 


The two cubes shown in Fig. 3 are in fact “test cubes” for the fault, line 1 stuck at a 0. Note 
that the output wire has the value D and thus will have the value 1 if this fault is present, the value 
O otherwise. The values on lines 1, 2 and 6 also switch appropriately depending on the presence of 
this fault. The two test cubes are in fact the end product of the D algorithm which assigns the line 
at the site of the fault a value D for a stuck at 0 and a value D for a stuck at 1. It then determines 
the conditions necessary to propagate the D or D to the circuit output. These conditions are then 
“justified” in terms of input signal values, thus developing a test for the fault. In order to 
propagate the D or D to the output, it is necessary to redefine the AND, OR and INVERT to 
allow for D and D inputs (see Fic. 4 below). 


Inv 


~] 


+ 01DD 01D 


Primitives for Propagation of D and D. 


FIG. 4 


Thus for our example, in order to drive the D on line 1 to the output, the algorithm first 
determines that line 3 has to be a logic 1 and that line 7 has to be a logic 0. The 0 on line 7 is then 
justified by setting either line 4 or line 5 to 0, thus generating the two test cubes for the fault, line 
1 stuck at a 0. 

Based on our interpretation of the values D and D, note that the above two tests also detect the 
faults lines 2, 6 and 8 stuck at a 1. Although not evident from Fig. 3, further reflection will show 
that the two tests also detect the fault line 7, stuck at a 1. A typical approach then is to: (1) 
assume the existence of a stuck-at-fault, (2) use the D algorithm to determine test(s) for that fault 
and as a by-product other faults along the drive path, (3) use “further reflection,” as implemented 
in another algorithm, to determine other faults detected by the tests determined in (2). This 
approach has been found to be very effective and is widely used in the computer industry today. 

When a logic circuit contains sequential logic, i.e., has memory, the testing problem is greatly 
complicated by the fact that the states of these devices are unknown in the presence of faults. 
Because of this, from a theoretical point of view, the testing problem has never been satisfactorily 
solved for sequential circuits. An approach that has worked successfully in practice is to provide 
special test inputs and outputs which can be used to preset and observe the states of memory 
elements. Once this capability is in place, the rest of the circuit, being combinational logic, can be 
tested using the testing techniques described above. For example, suppose that the circuit in Fig. 1 
receives its inputs (A, B,C, D), not from system input lines, but from four memory devices 
(flipflops). Then if one has the ability to provide inputs to these flipflops (Scan In) and observe 
that the flipflops have been set correctly (Scan Out), the rest of the circuit is easily tested. 
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The word “authority” is much misused, but if any one deserves this characterization in the field 
of logic testing, it is Paul Roth, author of Computer Logic, Testing and Verification. His 
experience in the field of logic design and testing dates back to his early work with the Von 
Neumann team at the Institute for Advanced Study, which built the first stored program 
Computer in the United States. However, his most significant accomplishment was the develop- 
ment of cubical calculus and its most important application in the field of logic testing: the D 
algorithm. In addition he has been instrumental in the development of effective means of “further 
reflection’”’ for determining all faults covered by a test input. This work at IBM has been a major 
factor in the development of a test capability that has developed tests for systems as large as 
750,000 gates! 

The cubical calculus that Roth developed has other important applications. It provides a basic 
method of circuit description which can be used for logic minimization and verification. Logic 
minimization involves finding a minimal set of cubes which cover the function. Fig. 5 below gives 
a minimal “cover” for the function implemented by the circuit in Fig. 1. 


Line No. 6 8 


1 2 3 4 5 7 

O 1 1 xX X 1 xX 1 
xX X X Tot x 11 
Minimal Cover for the Circuit in Fig. 1. 


Fic. 5 


If one has the cubical description for two circuits, these descriptions can be compared to 
determine if the two circuits perform the same function. Thus it can also be used for design 
verification, both in initial phases of the design process, and also during latter phases when 
engineering changes are made to the logic. In summary, cubical calculus provides a basis for the 
algorithmic solution of three of the most important problems in the field of design automation. 

One of the ironies of this LSI era of logic design is that, even though the capability of being 
able to put tens of thousands of gates on a single chip has allowed us to design very reliable 
systems, the problem of testing these devices has become progressively more difficult as the chip 
size increases. This is because the observability of internal logic signals decreases with chip size. 
Also, in order to place more gates on a single chip, it is necessary to shrink the physical size of the 
individual gates and also the spacing between interconnect lines. Thus if one has a physical defect 
in a chip, it is increasingly likely that it will cause a number of gates to malfunction. This brings 
into question the single fault assumption mentioned above and also the stuck-at-fault model which 
is the basis for most approaches to logic testing. It is increasingly important to develop test 
techniques which treat an LSI device as a single functional entity instead of a collection of 
thousands of gates. To be sure, various approaches to functional level modeling are beginning to 
appear [3] but corresponding activity in test algorithm design using higher level models has lagged. 
Some of this is due to the complexity of the problem, but it also may be that the computer 
industry has gotten too comfortable with the gate level model. The challenge then is to develop 
high level modeling techniques which will not only reproduce the behavior of good LSI devices 
but faulty ones as well. A particularly interesting problem is that of determining how physical 
defects in integrated circuits map into effects at the functional model level. Solutions to problems 
of this type are necessary if advances in logic testing techniques are to keep pace with the 
technological explosion that is VLSI. 
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Elementary Stability and Bifurcation Theory. By Gérard Iooss and Daniel D. Joseph. Springer- 
Verlag, New York, 1980. xv + 286 pp. 


STEPHEN SCHECTER 
Department of Mathematics, North Carolina State University, Raleigh, NC 27650 


Bifurcation is the appearance of new solutions to a problem when a parameter changes. A 


simple example is what happens (or at least what one can imagine happens) when one places a 
marble in the groove of an upright hula hoop and spins the hoop about its vertical axis.* 


@ 


P 


fast 


slow 


If the hoop does not spin too fast, the marble sits at the bottom of the hoop. If the hoop spins 
faster than a critical angular velocity, the marble creeps up the hoop, to a point where gravity and 
centrifugal force balance, and stays there. One cannot predict to which side of the hoop the 
marble will move. Thus we have the “ pitchfork” bifurcation diagram: 


angular velocity of hoop 


position of marble on hoop 


*This example is taken from J. Marsden and M. McCracken, The Hopf Bifurcation and its Applications, 
Springer-Verlag, 1976, p. 3. 
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The bottom position remains a solution for all angular velocities of the hoop, i.e., if the marble sits 
precisely at the bottom of the hoop, there is no force on it. However, when the angular velocity of 
the hoop passes a critical value, the bottom solution becomes unstable, and two new stable 
solutions—one of which we actually observe—appear. (Since the force on the marble depends on 
the marble’s velocity relative to the hoop as well as its position and the angular velocity of the 
hoop, the state space for this problem actually has another dimension. Nevertheless the diagram 
captures the essential features.) 

The study of bifurcations has been invigorated during the past twenty years by the injection of 
ideas from various areas of mathematics. These ideas include topological degree and related 
topological invariants; symmetry groups; methods of proof that depend on the center manifold 
theorem and hard implicit function theorems; the notions of contact equivalence, normal forms, 
and unfolding from the theory of singularities of maps; and the understanding of complicated 
qualitative behavior of solutions of differential equations achieved by dynamical systems theory. 
At the same time the traditional power series methods for computing bifurcating solutions and 
determining their stability have been refined and applied to new situations. Iooss and Joseph’s 
book is mostly about power series methods. 

Let us work out an example. In the hoop-and-marble example we were concerned with 
bifurcation of equilibrium solutions of an ordinary differential equation defined on a finite 
dimensional space. It is more usual in Iooss and Joseph’s book to be concerned with parabolic 
partial differential equations. So for our example we shall look for equilibrium solutions of 


u,=u,,+ (1+ A)ut wv? 
u(0) = u(7) = 0. 


Here u(x,t) is a function of x € [0,7] and time ¢; A is a parameter. We are interested in 
equilibrium solutions (u, = 0), so we have 
u”+(1+A)u+u?=0 


() u(0) = u(7) = 0. 


Now uw is a function of x only. For each fixed A (1) is a nonlinear boundary value problem which 
has at least the trivial solution u = 0. 

We shall first set up this problem as an analyst might, then formally compute a solution using 
power series, then indicate how the computation can be justified using the analyst’s framework. 

We shall look for solutions of (1) in the space of functions u on [0, 7] such that (i) u is 
differentiable, (ii) u’ is absolutely continuous, (iii) the function u” of which u’ is the antiderivative 
is square integrable, and (iv) u(0) = u() = 0. Analysts denote this space by H? M Hj; we shall 
call it H. Every function in H can of course be expanded in a Fourier sine series; in fact, 


00 00 
H = fe [0,7] > R: u(x) = da, sin nx and )n‘4a? < x}. 
I l 
For a norm on H one may take 
00 1/2 
yal =| Sontea] 
1 


Define F: H X R > L*[0, 7] by F(u,A) =u” + (1 +A)u+t u?. We shall think of L*[0, 7] 
as (La, sin nx: La? < oo). F is a continuous linear mapping, (u, A) > u’’ + u, plus a continv- 
ous quadratic mapping, (u,A) > Au + u’. (The continuity of u > u? follows from a Sobolev 
inequality.) So Fis an analytic map. We want solutions of F(u, A) = 0 besides the trivial solutions 
u = 0. 

We shall look for a curve of nontrivial solutions defined near u = 0, A = 0, ie., a mapping 
(u(e), A(e)): an interval around 0 in R — H XR such that u(e) = 0 if and only if e = 0, 
(0) = 0, and (u(e), A(e)) satisfies (1) for each ec. If u(e) and A(e) are analytic functions of e, we 
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can write 
(2) (u(2),A(e)) = EF (w. A), 


Each coefficient u‘*) € H and each A) € R. Of course there are many ways to parameterize a 
curve, such as by arc length or by projection onto some axis. In our computation we shall assume 
our curve can be parameterized by the first Fourier coefficient of u, so e = first Fourier coefficient 
of u(e). 

To determine u‘*) and A‘), we substitute (2) into (1), group by powers of e, and set the 
coefficients equal to 0: 

Coefficient of e: u“?” + u“ = 0. The only solutions in H are u“ = Csin x. Since « is to be the 
first Fourier coefficient of u(e), we set u“? = sin x. 

2 


Coefficient of a : We find that 


(3) uP" + y® = —I2VWO) yO — 2ayOP, 


Remember, we have not yet chosen A“). This is a good thing, because whether or not (3) can be 
solved for u® depends on what A‘ is. Suppose the right-hand side of (3) is X%b, sin nx. Writing 
yu = Y“q, sin nx, we find that 


oO oO 
¥>(1 — n*)a, sin nx = > b, sin nx. 


If b, + O, there are no solutions; if b, = 0, the general solution is 


00 


b 
(4) w=) “> sin nx + Csin x. 
2 — A 


(This is the Fredholm alternative for u > u” + u from H to L?[0, 7].) Since u“? = sin x we may 
choose A‘) so that the right hand side of (3), when expanded in a Fourier sine series, has b, = 0. 
We then take u® to be the solution (4) with C = 0 (because of our decision about how to 
parameterize the curve). 

In general, when we look at the coefficient of eX/k!, we find that 


(5) ul” 4 y®) = —kNE-D yO + terms involving u™,..., u*~ 9), AM, AEH 


(which have already been determined). 


We choose A‘ ~ ) so that (5) has solutions, then take u‘“) to be the solution whose first Fourier 
coefficient is 0. 

So we have a formal solution (2). To understand it better, we shall look at F(u, A) = 0 from a 
more geometric viewpoint. We can use the implicit function theorem to reduce the problem of 
solving F(u, A) = O to just one real equation in two real unknowns, and we can easily determine 
enough about this equation to see that it has a unique nontrivial analytic solution curve through 
u = 0, A = O. This is the curve whose Taylor expansion we have found, so our solution is not just 
formal. Here is the argument: 

The derivative of F at (u, A) is a linear map from H x R to L’[0, 7]. We have DF(0, 0) - (v, ») 
=v" + v. We let 


K = span of sin x in H. 
Kt 
R 


subspace of AH consisting of functions whose first Fourier coefficient is 0. 


subspace of L?[0, 1] consisting of functions whose first Fourier coefficient is 0. 


R+ = span of sin x in L?[0, 1]. 
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The linear map v > v” + v from A to L?[0, 7] has kernel K and range R. (If this linear map 
were an isomorphism, the implicit function theorem would tell us there could be no nontrivial 
solutions near u = 0,A = 0.) Let P: L*[0, 7] ~ R denote projection onto R with kernel R* , so 
that 


ee) ee) ee) 
P{ Xb sin nx = )'b, sin nx and (I — p)| So, ns = b,sin x. 
l 2 l 


Now F(u, A) = 0 if and only if PF(u,A) =0 and (J — P)F(u, A) = 0. Since P - DF(O,0) is 
surjective, the implicit function theorem tells us that near (0,0) the set of (u,A) such that 
PF(u, A) = 0 is a two-dimensional surface tangent to K X R. In other words, PF(u, A) = 0 if 
and only if 
u=(asinx + v(a,A),A), 

where v(a, A) is an analytic mapping of R? into K* , v(0,0) = 0, and Dv(0,0) = 0. To solve 
F(u, \) = 0 we need only solve (J — P) F(u, A) = 0 where (u, A) has the above form. This is just 
one analytic equation in two unknowns. (This reduction is called the Liapunov-Schmidt proce- 


dure.) 
Writing (J — P)F(asin x + v(a,A), A) = 0, we find 


(6) (I — P)(Aasin x + a*sin’x + higher order terms) = 0. 
Equation (6) just says that the first Fourier coefficient of the function in parentheses is 0: 
(7) -f (Aasin x + a*sin’x + --- )sin x dx = 0 
T 0 
8 2 
Aa+>z—-ae+::: =0. 
3% 


An equation in two variables of the form (7) has for its solution set near (0, 0) two analytic curves, 
one tangent to a = 0, one tangent to A + 8a/3a = 0. The first is just the trivial solution, the 
second is the curve of nontrivial solutions we computed earlier. This justifies our power series 
computation and shows, moreover, that there are no other solutions near u = 0, A = 0. 

Most of Iooss and Joseph’s book is careful computation of various kinds of bifurcating 
solutions and their stability by power series methods. The variety of problems that can be handled 
by this technique is impressive. I want to mention in particular the treatment of subharmonic 
bifurcation, which is the bifurcation, from a periodic solution of a periodically forced differential 
equation, of periodic solutions with longer period. The justification of power series methods, 
which may require difficult analysis, is only sketched. A mathematician—even one familiar with 
the “modern” approaches to bifurcation theory—can learn a lot from Iooss and Joseph’s book, 
but he will have to fill in some theory. The book’s natural audience is applied mathematicians, 
engineers and scientists, at the beginning graduate level on up, who are motivated to learn the 
methods by bifurcation problems they have encountered in their work. Springer-Verlag has 
included Iooss and Joseph’s book in its undergraduate text series, but the book is probably too 
demanding,for an undergraduate course. 


MISCELLANEA 


111. You state this result and then try to mortify me by saying that it is easy to prove. Since I 
can’t succeed in doing so, I appeal to your good nature to help me out of the difficulty. 


— Hermite to Stieltjes, 26 April 1892. 
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The linear map v > v” + v from A to L?[0, 7] has kernel K and range R. (If this linear map 
were an isomorphism, the implicit function theorem would tell us there could be no nontrivial 
solutions near u = 0,A = 0.) Let P: L*[0, 7] > R denote projection onto R with kernel R* , so 
that 


00 00 00 
P{ Xb sin nx] = )' 5, sin nx and (I — p)| So, ns = b,sin x. 
I 2 I 


Now F(u, A) = 0 if and only if PF(u,A) = 0 and (J — P)F(u, A) = 0. Since P - DF(0,0) is 
surjective, the implicit function theorem tells us that near (0,0) the set of (u, A) such that 
PF(u, A) = 0 is a two-dimensional surface tangent to K X R. In other words, PF(u, A) = 0 if 
and only if 
u=(asinx + v(a,d),A), 

where v(a, A) is an analytic mapping of R? into K* , v(0,0) = 0, and Dv(0,0) = 0. To solve 
F(u, \) = 0 we need only solve (J — P) F(u, A) = 0 where (u, A) has the above form. This is just 
one analytic equation in two unknowns. (This reduction is called the Liapunov-Schmidt proce- 


dure.) , 
Writing (J — P)F(asin x + v(a,A), A) = 0, we find 


(6) (I — P)(Aasin x + a*sin’x + higher order terms) = 0. 
Equation (6) just says that the first Fourier coefficient of the function in parentheses is 0: 
(7) = "(Aa sin x + a’sin’x + --- )sin x dx = 0 
7 0 
8 2 
Aa+>—-avt+::: =0. 
37 


An equation in two variables of the form (7) has for its solution set near (0, 0) two analytic curves, 
one tangent to a = 0, one tangent to A + 8a/37 = 0. The first is just the trivial solution, the 
second is the curve of nontrivial solutions we computed earlier. This justifies our power series 
computation and shows, moreover, that there are no other solutions near u = 0, A = 0. 

Most of Iooss and Joseph’s book is careful computation of various kinds of bifurcating 
solutions and their stability by power series methods. The variety of problems that can be handled 
by this technique is impressive. I want to mention in particular the treatment of subharmonic 
bifurcation, which is the bifurcation, from a periodic solution of a periodically forced differential 
equation, of periodic solutions with longer period. The justification of power series methods, 
which may require difficult analysis, is only sketched. A mathematician—even one familiar with 
the “modern” approaches to bifurcation theory—can learn a lot from Iooss and Joseph’s book, 
but he will have to fill in some theory. The book’s natural audience is applied mathematicians, 
engineers and scientists, at the beginning graduate level on up, who are motivated to learn the 
methods by bifurcation problems they have encountered in their work. Springer-Verlag has 
included Iooss and Joseph’s book in its undergraduate text series, but the book is probably too 
demanding, for an undergraduate course. 


MISCELLANEA 


111. You state this result and then try to mortify me by saying that it is easy to prove. Since I 
can’t succeed in doing so, I appeal to your good nature to help me out of the difficulty. 


— Hermite to Stieltjes, 26 April 1892. 


LETTERS TO THE EDITOR 


Material for this department should be prepared exactly the same way as submitted manuscripts (see the inside 


front cover) and sent to Professor P. R. Halmos, Department of Mathematics, Indiana University, Bloomington, 
IN 47405. 


Editor: 


When Professor Mark Kac wrote the review (this Monthly, 89 (1982) 713-714) of my 
double-biographical essay, John von Neumann and Norbert Wiener: From Mathematics to the 
Technologies of Life and Death, he evidently based it on some misinformation. The purpose of this 
note is the set the record straight. 

Kac is under the mistaken impression that I never met Mr. Nicholas Vonneuman, John von 
Neumann’s brother. In fact Mr. Vonneuman and I spent two hours together over lunch well over a 
decade ago, on May 18, 1971, in Philadelphia. The purpose of our lunch meeting was primarily to 
get acquainted, since already at that time I was planning to write about John von Neumann. 


—S. J. Heims 


Department of Physics 
University of Massachusetts, Boston. 


Professor Kac comments as follows: 


When I learned from a third person that Mr. Vonneumann claimed never to have met Mr. 
Heims, I asked Mr. Vonneumann for permission to include this information in the review and I 
sent him a copy of the review. Though no formal permission has been given (on the grounds that 
it was unnecessary), there has been no objection on the part of Mr. Vonneumann to its inclusion. 


—Mark Kac 


Editor: 


I have been trying to document the reason for there not being a Nobel Prize in Mathematics. Is 
this actually due to a feud between Nobel and Mittag-Leffler? 
I should be interested in hearing from someone who has a reference on this point. 
—Amy C. King 
Department of Mathematics 
Eastern Kentucky University 
Richmond, Kentucky 40475 


If anyone can supply some documentation that is not just iterated gossip, probably most 
readers of the MONTHLY would like to know about it. It might be quite appropriate to publish an 
answer to Professor King’s question as another Letter to the Editor. 


—Editor 
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Studies in Computer Science continues the tradition of excellence of 
the respected series MAA Studies in Mathematics. Written by com- 
puter scientists for mathematicians, the book presents a readable and 
balanced discussion of the role of mathematics in computer science, 
and the contributions of computing to mathematics. 

Opening with an historical overview of the development of com- 
puter science, the narrative continues with three articles on the 
nature of computer programs and the programming process, a dis- 
cussion of computational complexity, and a concluding group of 
articles on the impact of computer science On artificial intelligence, 
numerical analysis, statistics, and simulation. 

In the introduction the editor expresses his hope that the book 
will provide ‘‘an interesting look at an explosive field in the process of 
becoming.’’ What an understatement! The process is well advanced; 
the impact of computer science On mathematics and its teaching is 
already immense. And this book is not just interesting, it is exciting 
and important as background reading for anyone interested in the 
interactions between mathematics and computer science. Teachers 
will find that it enriches their teaching and students will find it 
rewarding supplementary reading. Order your copy today! 
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THE QUARTERLY REPORTS OF S. RAMANUJAN 


BRUCE C. BERNDT* 
Department of Mathematics, University of Illinois, Urbana, IL 61801 


Introduction. From about the age of 15 or 16 until his untimely death at the age of 32, 
Ramanujan devoted all of his energy to the pursuit of mathematics. His research was accom- 
plished in isolation and obscurity until, at the age of 22, he obtained a meeting with V. R. Aiyar, 
the founder of the Indian Mathematical Society. From that moment in 1910, word of Ramanujan’s 
mathematical genius slowly began to spread among mathematicians in southeast India. Several 
people, including R. Ramachandra Rao, P. V. Seshu Aiyar, S. N. Aiyar, Sir Francis Spring, and 
Sir Gilbert Walker, took a kindly interest in Ramanujan through financial support, employment, 
and encouragement. In particular, on February 26, 1913, the English astronomer Walker sent a 
letter to the registrar of the University of Madras, Francis Dewsbury, with the emphatic 
recommendation, “The University would be justified in enabling S. Ramanujan for a few years at 
least to spend the whole of his time on mathematics, without any anxiety as to his livelihood.” The 
Board of Studies at the University of Madras agreed to this request, and its chairman, Professor B. 
Hanumantha Rao, wrote a letter to the Vice-Chancellor on March 25, 1913, with an exhortation 
that Ramanujan be awarded a scholarship of 75 rupees per month. Again, the decision was swift, 
and Ramanujan was granted a scholarship commencing on May 1, 1913. 

A stipulation in the scholarship required Ramanujan to submit quarterly reports detailing his 
research to the Board of Studies in Mathematics. Ramanujan wrote three such quarterly reports 
dated 5th August 1913, 7th November 1913, and 9th March 1914 before he departed for England 
on March 17, 1914. Possibly these reports still remain at the University of Madras, but they 
evidently have been either lost or misplaced. Fortunately, in 1925, T. A. Satagopan made a 
handwritten copy of the reports on 51 foolscap pages. This copy was sent to G. H. Hardy and is 
now at the library of Trinity College, Cambridge. Also on file at Trinity College is a second copy 
of the reports made by G. N. Watson, who, along with B. M. Wilson, attempted to edit 
Ramanujan’s notebooks [22] in the 1930’s. Although the reports have never been published, Hardy 
used material from the reports as the basis for Chapter 11 of his book [14] on Ramanujan’s work. 

Besides the quarterly reports, other manuscripts of Ramanujan remain unpublished. Two quite 
different and fascinating descriptions of some of these papers have recently been given by K. G. 
Ramanathan [19] and R. A. Rankin [23]. An interesting account of Ramanujan’s “lost” notebook 
has been written for this MONTHLY by G. E. Andrews [1], who is in the process of attempting to 
prove all of the formulas in this manuscript. An unedited facsimile edition of Ramanujan’s 
notebooks [22] has been published. For general descriptions of the notebooks, see papers of 
Watson [25], and the author [3] who currently is editing chapters of the notebooks. 

The purpose of this paper is to describe the most significant results found in the quarterly 
reports and to place them in an historical perspective. A complete description of the reports is 
being published elsewhere [4]. In contrast to his notebooks which contain very few proofs, or even 
sketches, the quarterly reports offer several fairly detailed proofs. However, many of these proofs, 
especially those for the principal theorems, are formal and not rigorous. Nonetheless, Ramanujan’s 
proofs are enormously interesting because they provide insight into how Ramanujan reasoned, 


The author received his A.B. degree from Albion College in 1961 and his Ph.D. from the University of Wisconsin 
in 1966 under the direction of J. R. Smart. After one year at the University of Glasgow, he came to the University of 
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1979 he received an Allendoerfer award from the MAA for his paper, Ramanujan’s Notebooks. 
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and so we shall describe some of Ramanujan’s arguments. We shall also indicate, frequently with 
references to the literature, how Ramanujan’s theorems can be put on firm foundations. 

The first two reports and a portion of the third are concerned with certain integral formulas 
which, in a sense, are interpolation formulas and which are connected with the theory of integral 
transforms. In discussing one of these formulas in an article for the MONTHLY in 1937, Hardy [13, 
p. 150], {14, Chapter 1] remarks, “There is one particularly interesting formula... of which he was 
especially fond and made continual use.” In Chapter 11 of [14], Hardy further observes, in 
connection with the aforementioned formulas, that Ramanujan “had not ‘really’ proved any of 
the formulae which I have quoted. It was impossible that he should have done so because the 
‘natural’ conditions involve ideas of which he knew nothing in 1914, and which he had hardly 
absorbed before his death.” The natural conditions to which Hardy refers are from the theory of 
functions of a complex variable. Indeed, as we shall momentarily see, the proofs given in the 
quarterly reports are not rigorous. After arriving in England, Ramanujan evidently learned, 
probably from Hardy, that his proofs were not sound. For in a paper published in 1915, after 
offering some beautiful integral evaluations, Ramanujan [20], [21, p. 57] remarks that “My own 
proofs of the above results make use of a general formula, the truth of which depends on 
conditions which I have not yet investigated completely. A direct proof depending on Cauchy’s 
theorem will be found in Mr. Hardy’s note which follows this paper.” 

Some of the principal integral formulas and their applications in the quarterly reports can be 
found in the notebooks [22], primarily in Chapters 3 and 4, which have now been thoroughly 
examined and edited [5], [6]. As remarked above, some of the reports’ contents have been 
discussed by Hardy [14]. However, the quarterly reports contain many additional results not 
covered in the aforementioned sources. In particular, a beautiful and very useful generalization of 
Frullani’s integral theorem is found in the reports. This hitherto unobserved theorem provides a 
powerful tool for evaluating many integrals and deserves more attention. 

In addition to material on integral interpolation formulas and transforms, the third quarterly 
report contains results on orders of infinity, fractional composition of functions, and fractional 
differentiation. Each of these topics is treated only briefly. It might be recalled that in his first 
letter to Hardy [21, p. xxii], dated 16th January 1913, Ramanujan mentions that he has been 
reading Hardy’s tract, Orders of Infinity [12]. 

The author is grateful to the Master and Fellows at Trinity College, Cambridge, for a copy of 
the quarterly reports and permission to publish an analysis of them. 


1. The First Quarterly Report. In contrast to the second and third reports, the first report 
commences with a letter of introduction which we completely reproduce below. 


From S. Ramanyan, Scholarship holder in mathematics. 
To the Board of Studies in Mathematics. 

Through the Registrar, University of Madras. 
Gentlemen, 

With reference to para. 2 of the University Registrar’s letter no. 1631 dated the 9th April 
1913, I beg to submit herewith my quarterly Progress Report for the quarter ended the 31st 
July, 1913. 

The Progress Report is merely the exposition of a new theorem I have discovered in Integral 
Calculus. At present there are many definite integrals the values of which we know to be finite 
but still not possible of evaluation by the present known methods. This theorem will be an 
instrument by which at least some of the definite integrals whose values are at present not 
known can be evaluated. For instance, the integral treated in Ex(v) note Art. 5 in the paper, Mr. 
G. H. Hardy, M.A., F.R.S. of Trinity College, Cambridge, considers to be “new and 
interesting.” Similarly the integral connected with the Besselian function of the nth order which 
at present requires many complicated manipulations to evaluate can be readily inferred from 
the theorem given in the paper. I have also utilised this theorem in definite integrals for the 
expansion of functions which can now be ordinarily done by Lagrange’s, Burmann’s, or Abel’s 
theorems. For instance, the expansions marked as examples nos. (3) and (4), Art. 6, in the 
second part of the paper. 
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The investigations I have made on the basis of this theorem are not all contained in the 
attached paper. There is ample scope for new and interesting results out of this theorem. This 
paper may be considered the first installment of the results I have got out of the theorem. Other 
new results based on the theorem I shall communicate in my later reports. 

I beg to submit this, my maiden attempt, and I humbly request that the Members of the 
Board will make allowance for any defect which they may notice to my want of usual training 
which is now undergone by college students and view sympathetically my humble effort in the 
attached paper. 


I beg to remain, 
Gentlemen 

Your obedient servant 
S. Ramanujan 


We now describe the primary theorem of the quarterly reports. We emphasize that the 
hypotheses below are those supplied by Ramanujan and need to be strengthened to insure the 
stated conclusions. 


THEOREM | (Ramanujan’s Master Theorem). Suppose that, in some neighborhood of x = 0 


F(x) = x olka) x)t 


Then 
(1.1) T= fox" 'F(x) dx =T(n)o(—-n), 


where I’ denotes the gamma function. Conversely, if the value of I is given by (1.1), then the 
Maclaurin coefficients of F(x) can be determined. 


Ramanujan’s Master Theorem needs some explanation. Generally, n is not a nonpositive 
integer. Thus, Ramanujan is assuming that there exists a continuous extension of $, defined 
initially on the set of nonnegative integers. Ramanujan provides lengthy arguments in order to 
convince the reader that there always exists a unique, “natural,” continuous extension of ¢, but, 
as to be expected, his pleas are unconvincing. Ramanujan requires just four simple hypotheses for 
his theorem: (1) F(x) can be expanded in a Maclaurin series, (2) F(x) is continuous on (0, 00), (3) 
n > 0, and (4) x"F(x) tends to 0 as x tends to oo. Ramanujan remarks that the fourth condition 
can be relaxed if F is a circular function. 

Not surprisingly, more stringent hypotheses need to be put on F and ¢ for such a theorem to 
exist. We shall first give Ramanujan’s “proof’ and then we shall state Hardy’s rigorous 
reformulation of Ramanujan’s theorem. 


Ramanujan’s Proof. Recall Euler’s integral representation of the gamma function 


foerm x"! dx =m "T(n), 
0 


where m,n > 0. Let m= r* with r > 0, multiply both sides by f“(a)h*/k!, where f shall be 
specified ‘ater, and sum on k, 0 < k < 09, to obtain 


ELON eset de = Pn) £ f(a) hr") 


t 
kao k! 


Next, expand exp(—r*x), 0 < k < oo, in its Maclaurin series, invert the order of summation and 
integration, invert the order of summation, and apply Taylor’s theorem to deduce that 


(1.2) fox i dx =T(n)f(hr7" + a). 
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Now define f(hr” + a) = $(m), where m is real, and a, h, and r are regarded as constants. Then 
(1.2) may be rewritten in the form 


[os ie 1S eltaxy x)’ PLATA! oy =T(n)o(— n), 


j=0 


which completes Ramanuyjan’s proof. 

Ramanujan was evidently quite fond of the clever, original technique described above, and he 
employed it in many contexts. See [6] for several additional illustrations. 

In preparation for stating Hardy’s version of Ramanujan’s theorem, we need to introduce some 
notation. Let s = o + it with o and ¢ both real. Let H(S) = {s: o> —8), where 0 <6 < 1. 
Suppose that y/(s) is analytic on H(8) and that there exist constants C, P, and A with A < 7 such 
that 


(1.3) lW(s)| < CePot Alél 
for alls € H(8). For x > 0 and 0 < c < 6, define 


¥(x) = wl = ¥(—s)x~* ds 


io sin(as yv 


If 0 < x < e~”, an application of the residue theorem yields [14, p. 189] 
io 2) 
k 
¥(x)= Li v(k)(-x)". 
k=0 


THEOREM (Hardy [14, pp. 189, 190]). Let y and WV satisfy the conditions set forth in the preceding 
paragraph. Suppose that 0 < 0 < 6. Then 


(1.4) [O¥ (x) de = 


sin( zs ) ¥(—s). 

Formula (1.4) yields (1.1) upon replacing y(s) by $(s)/IT'(s + 1). In Hardy’s book [14], 
formulas (1.1) and (1.4) are (B) and (A), respectively, on p. 186. 

Ramanujan devotes the remainder of the first quarterly report, all of the second report, and a 
large portion of the third to giving examples and applications of his Master Theorem. We shall 
present some of the more interesting examples in the sequel. In some instances, Hardy’s theorem 
can bé invoked to provide a legitimate proof. In other examples, Hardy’s theorem does not apply. 
However, in every case a rigorous proof can be given, and so we shall often cite the literature 
where proofs may be found. 


EXAMPLE 1.1. Let m,n > 0 and set x = y/(1 + y) to obtain 
eae _ x)" l dx = fy" +y) "dy 
0 0 


From the binomial series, 


(1.5) Q+y)"=5 

k=0 
we find that ¢(s) =T'(s +m+n)/T(m+n). By Stirling’s formula, the hypothesis (1.3) of 
Hardy’s theorem is readily verified. Hence, Ramanujan’s Master theorem yields the following 
well-known representation of the beta-function, 


Sara -y)", I<], 


P(m)T(n) 


B(m,n) = fx”-1(1 ~ x)" 'dx =T(m)o(—m) = T(m+n)- 


EXAMPLE 1.2. For 0 < g < 1, define 


1983] THE QUARTERLY REPORTS OF S. RAMANUJAN 509 


and 


which is actually the particular case a = 0 of the more general integral 
— At; q)oo at 
a oo, lal <q”. 
f(a) = [aes a 


This is one of those integrals in [20], [21, pp. 53-58] to which we referred in the Introduction. 
Also, f(a) is that integral mentioned by Ramanujan in his prefatory comments. For a thorough 
discussion of f(a), see a recent article in this MONTHLY by R. A. Askey [2]. 

We now describe Ramanuyjan’s proof, which is also briefly sketched by Hardy [14, p. 194]. 
Writing 


ape LO" Ml <1, 


and using the fact that (1 + t)(— qt; gq), =(—6 4)» we easily derive the recursion formula 
W(k) = vas 1) k>1. 
— q* 
Since y(0) = 1, an inductive argument shows that 


0) = 


which satisfies the hypotheses of Hardy’s theorem. Hence, by Ramanujan’s Master theorem, or 
Hardy’s theorem, 


) k>0, 


y(—n )= a(q'—"; I) co 


Sawn) sin(71)(q3 9)oo 
EXAMPLE 1.3. Let a > O and let x be the unique positive solution to the equation Log x = — ax. 
For each positive number n, we want to expand x” in ascending powers of a. Letting 0 < p < n, 
a = —(Log x)/x, and then x = e~ ¥, Ramanujan finds that 
o@) l p-\ _— 
[Par-'x" da = f (- =) xt EB ay 
0 0 x x 


io) 
-f yP "0 + y)e vn~P) dy 


__al(p) 


(np)? 
Thus, in the notation of the Master Theorem, $(p) = n(n + p)?~'. Therefore, Ramanujan 
concludes that 


(1.6) vend (n+k)U (~a)” 


t 
kno k! 


Using Stirling’s formula, one can show that the infinite series in (1.6) converges for 0 < a < 1/e. 
In fact, (1.6) is valid for every real number 7 and |a| < 1/e. The expansion (1.6) can be found in 
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Chapter 3 of Ramanujan’s second notebook [22, p. 32], and a rigorous proof has been given in [5, 
Entry 13]. 


EXAMPLE 1.4. Consider the trinomial equation 
(1.7) agx? +x7=1, 


where a > 0 and 0 < g < p. Using a procedure similar to that in Example 1.3, Ramanujan derives 
the power series expansion 


+ pk k 
2 T("*P*)(-qa) 
as eat fle PO 
Teno p(" Pe ge ale 


where n > 0 and x is a particular root of (1.7). By Stirling’s formula, this expansion converges 
when 


lal < p-?/( p — q)? ™. 


Equality (1.8) can also be found in Chapter 3 of Ramanujan’s second notebook [22, vol. 2, p. 33], 
and a legitimate proof has been given in [5, Entry 14]. 

Formulas (1.6) and (1.8) have a very long history. The latter was first discovered in 1758 by 
Lambert [16], while the former was initially found by Euler [9] in 1779. In 1770, Lagrange [15] 
discovered his famous “Lagrange inversion formula” [18, p. 145], sometimes called the Lagrange- 
Biirmann theorem, and as an application derived (1.8). Recall that in the introduction to the first 
report, Ramanujan makes a reference to theorems of Lagrange and Biirmann. Ramanujan 
evidently learned about these theorems from Carr’s Synopsis [7, pp. 278-282], which was the most 
influential book in Ramanujan’s development. Polya and Szegé [18, p. 146] have offered (1.6) and 
(1.8) as exercises illustrating the Lagrange inversion formula. The derivations of (1.6) and (1.8) in 
[5] are developed ab initio, however. For additional references to these formulas, consult [5]. 


2. The Second Quarterly Report. The first part of the second report is devoted to several 
additional illustrations of the Master Theorem. 


EXAMPLE 2.1. We shall suppress the details in Ramanujan’s derivation of the very unusual 
expansion 


ae~ °*sin( cx © d,[{e~ *sin(cx) \* 
(2.1) et = ] + ———* + + 
where, for k > 2, 


n—-1 
a(a+ kb) J] {(a+ kb)° + (2 jc)’}, if k = 2n is even, 
j=l 
dij ~ n—-1 
al] {(a+ kb) +((2j-1)c)}, — ifk = 2n — Lisodd. 
j=l 


Here a, b, and c are real with b,c > 0, and 0 < x < (1/c)tan '(c/b). Ramanujan discusses 
several associated expansions and consequences of (2.1). We mention just one special instance; 
letting c = 0 in (2.1), we find that, for 0 < x < I/D, 


2 (at kb)" 'xke~ #ex 
ax —_— arn eee 
e ah 7 


Ramanujan evidently first learned of this well-known formula from Carr’s book [7, p. 282]. This 
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expansion can be used to establish a more general theorem due to Abel [7, p. 282]. This is the 
theorem to which Ramanujan refers in his introductory letter to the reports. 

We now come to perhaps the most significant feature of the quarterly reports, Ramanujan’s 
generalization of Frullani’s integral theorem. Suppose that, in some neighborhood of x = 0, 


f(x) foo) = © OCD” and g(x) ~ a(«) = & POE 


k=0 
where 


f(co) = lim f(x) and g(oo) = lim g(x), 
which we assume exist. 


THEOREM 2. In the notation above, let f and g be continuous functions on [0, 00). Assume that 
u(s)/T'(s + 1) and v(s)/T(s + 1) satisfy the hypotheses of Hardy’s theorem. Furthermore, assume 
that f(0) = g(0) and f(00) = g(00). Then if a, b > 0, 

(2.2) lim J, = lim "x" f(ax) — g(bx)}dx 
0 


n 
n—>O+ n—>OQO+ 


= {f(0) - f(co)){ Low’ +i {to oo) } 


Ramanujan tacitly assumes that the limit can be taken under the integral sign. If f(x) = g(x), 
Theorem 2 reduces to 


(2.3) [7 LP=LE9 1, = (1(0) -f(ce) L08(0/2), 


since in this instance the limit on 1 can always be taken under the integral sign. Formula (2.3) is 
known as Frullani’s theorem and holds for any continuous function f such that f(0o) exists. If 
fo) does not exist, but f(x)/x is integrable over [c, 00) for some c > 0, then (2.3) still holds, but 
with f(co) replaced by 0. According to the reports, Ramanujan likely learned of Frullani’s 
theorem from Williamson’s book [26] on integral calculus. 


Ramanujan’s Proof. Applying the Master Theorem with 0 < n < 1, we find that 
I, = [x"-"\((f(ax) ~ f(c0)) — (g(bx) ~ g(20)}) ax 
=T(n){a~"u(—-n) — b- "v(-n)} 
-T(n+ if a~"u(—n) - oe 


n 


Letting ” tend to 0, we deduce that 


(2.4) lim I, = tim { Pes) — a") | 


n—>0 n 
= lim {b(n )Log b + b"v’(n) — a®u(n)Log a — a"u'(n)} 
= {f(0) — f(co)}Log(b/a) + v’(0) — u’(0) 


~ (/(0) ~ f(e2)){ Log(b/a) +-£.{Lo nO). ' 


where we have used the fact that u(0) = o(0) = f(0) — f(00). 
We record just one of several examples given by Ramanyan. 
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EXAMPLE 2.2. Recall again the binomial series (1.5). Using the standard notation (x) = 
I'’(x)/T(x), we find from Theorem 2 that, for a, b, p, q > 0, 


(8) [SERRE t= Lonlb/a) + hoe EIT | 


= Log(b/a) + ¥(q) — ¥(p) 
= Log(b/a) + Yo ( } 


Kn0 pt+tk qtk 


since [10, p. 943] 
«)--y- F(t) 
v(x y hop \ kK +x k+1)? 
where y denotes Euler’s constant. Ramanujan lists four special cases of (2.5), one of which is 


dx = Log(b/a) +7, 


i (1+ ax) '“*-—(1 + bx)” 
0 XxX 


where Leibniz’s series for 7/4 was employed. 

James Hafner has kindly shown us that Theorem 2 can be significantly strengthened. Hafner’s 
improvement allows Ramanujan’s formula (2.2) to be applied to a wider variety of functions (and 
also for the limit in (2.2) to be taken under the integral sign). We now present Hafner’s argument. 

By Frullani’s Theorem (2.3), 


[eda 8009). f° LLw) = Hb) + (Fox) = 80) 4, 


Xx 


0 f(x) — g(x 
= (/(0) ~ f(c0))Log(b/a) + f° LOP— C9 ay. 
By (2.4), it then remains to show that 
pr __ g(x) dx —_— _ {u’(0) _ v’(0)). 
0 x 
Replacing f(x) — g(x) by f(x), it now suffices to prove the following lemma. (A similar result 
was established by Carlson [8] under stronger hypotheses.) 
LEMMA. Suppose that f is analytic in a neighborhood of the nonnegative real axis. Put 
00 k 
u(k)(—z 
f(z) = 5 MOC 


k=0 


for \z| sufficiently small. Assume that, for some positive number 6, 
00 
J f(x)x~*dx 


converges uniformly for 1 — 8 < Re(a) < 2. Suppose furthermore that f(0) = 0. Then u(s) can be 
extended in a neighborhood of s = 0 so that u is differentiable at s = 0 and so that 


Hoy . . [7 £OD) 
(2.6) u'(0) = if — dx. 
Proof. For e > 0 sufficiently small, we shall, in fact, define u by 


(2.7) u(s) =f f(a) 


271 
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and then show that u has the desired properties. (We shall assume that principal branches are 
always taken.) First, observe that if s is a nonnegative integer, (2.7) is valid by Cauchy’s integral 
formula for derivatives. Since f is analytic in a neighborhood of the positive real axis, by Cauchy’s 
theorem, 


271 
fs f(z)2-8"' dz — f"f(z)z-9"" dz = 0, O<e<R<oo. 


It follows from (2.7) that 
u(s) = Most De" : ler {g f(z)z ~~ 'dz + (e777 — I) [Haya ae}. 
|z|=e € 


Assuming that —6 < Re(s) < 1 and using our hypotheses, we find that, upon letting e tend to 0 
and R tend to oo, 


u(s) = FE Die) f(x)x757! 


— _ I(s + 1)sin(zs) mor 
W 0 


27 i 
wy, b's 


From this formula and our hypotheses, it follows that u(s) is analytic in a neighborhood of s = 0. 
Calculating u’(0) from the formula above, we deduce (2.6) to complete the proof. 

Hardy [11] has written a fascinating paper on generalizations of Frullani’s theorem. Included in 
the paper is a dazzling assortment of examples that are instances of (2.2) and that were developed 
by Hardy ad hoc. For example, 

Close 
0 x 


x2 


— dx = _Y 
0 


and 


x 2’ 


where y denotes Euler’s constant. These two examples do not fall under the province of Theorem 
2; Hafner’s lemma is needed to justify the use of (2.2). Although Hardy discovered several 
generalizations of Frullani’s theorem in [11] and other papers, it is interesting that he, or anyone 
else, never discerned Ramanujan’s beautiful generalization. 

In the last portion of the second quarterly report, Ramanujan derives eight corollaries of his 
Master Theorem and employs two of them and analogues to establish the inversion theorem for 
Fourier cosine and sine transforms. We give those two corollaries needed for the cosine formulae. 


COROLLARY 1. 


cos(nx) dx = x (—-2k — 1)(—n?)*. 


k=0 


in yee x)" 


Ramanujan’s Proof. Expand cos(nx) into its Maclaurin series, invert the order of integration 
and summation for this series, and then apply the Master Theorem to each term. 

For a rigorous proof of Corollary 1, see Hardy’s book [14, pp. 200-201]. For Ramanujan’s 
proof of Corollary 2 as well as a rigorous proof, see [6]. 


COROLLARY 2. 


oS 2) k _a2 & o(-k- 1)(-n)" 
f 2 $Qk)(—x )“cos(nx) dx 5 Le aaa or Ee 
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The inversion theorem below is valid if f is of bounded variation on every finite interval in 
[0, 00) and fis absolutely integrable over (0, 00) [24, pp. 434-435]. 


THEOREM 3. Let n> 0. If 


(2.8) [> f(x)cos(nx) dx = g(n), 
then 
(2.9) [al x)eos(nx) dx = Ff (n). 


Ramanujan’s Proof. If 


-r oUOK o(k)(-x)* 


then by Corollary 1, we observe that (2.8) holds with 
g(x) =  o(-2k - 1)(-x?)*. 
k=0 


An application of Corollary 2 then yields (2.9). 
Ramanujan concludes the second quarterly report with several well-known examples to 
illustrate Theorem 3. 


3. The Third Quarterly Report. In the last of his quarterly reports, Ramanujan considers 
additional ramifications of his Master Theorem, but examines a few new topics as well. He first 
discusses the ordinary composition of functions and extends this concept to “the fractional order 
of functions.” 

Define F°(x) = x, F'(x) = F(x), and F"(x) = F(F"~ '(x)), for n > 2 and any function F. 
Of course, the range of F”~' must be contained in the domain of F. In order to define “the 
fractional order” of a particular function F, Ramanujan needs to first find a general formula for 
F"(x) when n is a nonnegative integer. He then extends this formula to all real values of 7. 

To illustrate Ramanujan’s ideas, let F(x) = x” — 2, x > 2. Set x = y + (1/y). Then 


F(x)=y?+y"*, F(x)=y*+y%, 
and, in general, 


2 2" _ fr2 _ 2" 
Pi(x)ayh ty = (24 x *) +(? x *) | 
2 2 
Hence, for example, 


Fie 3/Log2( x) _ x3 _ 3x. 


As a natural outgrowth of his study of the fractional composition of functions, Ramanujan 
briefly studies fractional differentiation. For each nonnegative integer n, let D"f(x) = f(x). 
Ramanujan assumes that there exists a unique, “natural” function of n passing through the points 
D° f(x), Df(x), D?f(x),..., and in this rather imprecise fashion defines fractional derivatives of 
f. Thus, for every real number n, Ramanujan defines D"e*°* = a”e*~. Assuming that the kth 
fractional derivative f‘*)(t) is well-defined, Ramanujan shows that it must satisfy the following 
relation. 


THEOREM 4. If n > 0, then 


(3.1) [ox '7(a- x) dx =T(n)f""™(a). 
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Ramanyjan points out that Theorem 4 can actually be used to define the fractional derivative 
f™. Let r be any nonnegative integer greater than k and let n = r — k. Thus, n > 0, and by (3.1), 


(3.2) fox F(a x) dx =T(n) f(a). 


Since the left side of (3.2) has a definite meaning from elementary calculus, (3.2) can be used to 
define the fractional derivative f‘“)(a) for any real number k. It is remarkable that (3.2) is 
precisely the same definition that Liouville [17], one of the founders of fractional calculus, gave 
for the fractional derivative in 1832. 

We now present Ramanujan’s amazingly simple, but formal proof. 


Ramanujan’s Proof. By Taylor’s theorem and the Master Theorem, 
(r+k) 
[ox '7(a— x) de= fox n-| x f beX= x)", 
0 
= (nya), 


The last portion of the third report is devoted primarily to various types of integral formulas 
and transforms. Besides using his Master Theorem, Ramanujan also employs operational meth- 
ods. We offer one such example which Ramanujan prefaces by declaring that “If a result is true 
only for real values of a quantity (say a), then the result got by using the operator for a is true only 
if when the new function can be expressed in terms of the original function...” As an illustration, 
Ramanujan considers 


i‘ cos( ax) dn = ten! 
0 x*+1 2 
which is valid only for (nonnegative) real values of a. Multiplying both sides by (0) and 
replacing a by the differential operator D, we get 


(3.3) fee, (0) dx =e ~ (0). 


Now 


cos( Dx) (0) = £40) 


_ > > (ix) + (<2) soo) _ (ix) + olnix) . 


Similarly, e~ “¢(0) = ¢(— 1). Thus, (3.3) can be written in the form 


o(ix) +o(-ix) , 7 
i (etry ~~ 2°)» 


which Ramanujan claims is valid only when $(ix) + @(—ix) can be written as a linear combina- 
tion of cosines. 
For many additional examples see Hardy’s book [14, Chapter 11] and our paper [4]. 


Conclusion. Although Ramanujan gave only formal proofs for most of the results found in the 
quarterly reports, it is remarkable that all of his theorems are correct (if hypotheses are appended) 
and that all of his examples are likewise valid. Despite the shortage of hypotheses for his 
theorems, Ramanujan possessed an uncanny ability in discerning when an application or formula 
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was true and when it was not. His creative capacity to produce wondrous formulas is not only 
unsurpassed in this century, when beautiful formulas are no longer at their zenith, but perhaps in 
the history of mathematics as well. The quarterly reports may contain a message for contemporary 
mathematicians. We might allow our thoughts to occasionally escape from the chains of rigor and, 
in their freedom, to discover new pathways through the forest. 
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570 PROBLEMS AND SOLUTIONS [October 
SOLUTIONS OF ADVANCED PROBLEMS 
Flexible Linkages of Polygons 


6367 [1981, 711]. Proposed by A. Ehrenfeucht and J. Mycielski, University of Colorado. 


Let A be a finite collection of distinct but possibly overlapping regular n-gons of the same size 

on the plane such that every vertex of every n-gon of A is a vertex of exactly two n-gons of A. 
(a) Construct a collection A of 2 n-gons such that, even if the n-gons are rigid, A is flexible. 
(b)* For which n is a rigid A possible? 


Solution to part (a) by Kit Hanes, Eastern Washington University. We establish a more general 
result. Let B be any n-gon with vertices B,,..., B, and let C be any m-gon with vertices 
C,,..., C,,- Let 8 be the angle between B,B, and C,C,. Then, for i € {1,..., n} let C, = B, + C, 
and for j € {1,..., m} let B, =B+C,. Let A= (Cryeees C., B,,..., B,). A is a collection of 
n + m polygons each of which is a translate of either B or C. Any vertex of any polygon in A isa 
point B; + C;, which is a vertex just of B and of C,. This construction is valid independently of 9; 
as @ varies A flexes. 


Also solved by Michael Goldberg and the proposers. 


Part (b). The proposers state that they know of only two cases when A as defined in the 
problem is rigid, namely n = 2 (a triangle) and n = 4 (a configuration of 12 squares inside a 
regular 12-gon whose outer edges form the 12-gon). 


ANSWER TO PHOTOS ON PAGE 517 


They are all outstanding mathematicians, and they were all presidents of the A.M.S. Here are 
their names, followed by the years of their presidency. Top row: J. von Neumann (1951-1952), 
R. L. Wilder (1955-1956); bottom row: J. L. Doob (1963-1964), Oscar Zariski (1969-1970). 


THE EULER ¢-FUNCTION IN THE GAUSSIAN INTEGERS 


JAMES T. CROSS 
Department of Mathematics, The University of the South, Sewanee, TN 37375 


1. Introduction. The set of all complex numbers a + bi, where a and 5 are integers, forms an 
integral domain with the usual complex number operations. This domain, the Gaussian Integers, 
which we denote by G, includes Z, the domain of rational (ordinary) integers, and behaves like Z 
in many respects. 

Let 8 denote a nonzero Gaussian integer and G/( 8) the quotient ring of G (mod £). It is fun 
to discover that these rings furnish accessible and interesting examples of familiar concepts 
including equivalence classes, fields, cyclic and noncyclic groups, and the Euler ¢-function. In this 
section we review the ¢-function, extend it to G, and discuss some questions whose answers are 
well known in Z and which we intend to study in G. 

Let n be a nonzero member of Z and let Z/(n) denote the quotient ring of Z (mod n). If 
n> O, then 


Z/(n) = {{0], (1), [2],.--, 4" — 1}, 


where the brackets denote equivalence classes. The rings Z/(n) and Z/(—n) are identical, so that 
we may confine our attention to positive n. The units (those members having multiplicative 
inverses) of this ring form a multiplicative group which is denoted by ®,(n) and the Euler 
¢-function ¢7(n) is defined to be the order of this group. A simple argument in elementary 
number theory proves that the member [k] of Z/(n) is a unit if and only if (k, n) = 1; that is, if 
and only if k and n are relatively prime. Thus for n > 1, 67(n) is the number of positive integers 
less than n and prime to n. If ®;(n) is cyclic and [k] generates this group, then k is called a 
primitive root for n. Simple construction of multiplication tables leads to the conclusion that 
primitive roots exist for n = 1, 2, 3, 4, 5, 6, and 7. However no primitive root exists for 8 since 
®,(8) = (1), [3], [5], [7]} has members of orders 1 and 2 only. 

The determination of those integers that have primitive roots is a popular undertaking in 
number theory. There it is proved that n has primitive roots if and only if n = 2 or 4, orn isa 
power of an odd prime or twice a power of an odd prime. In fact, a great deal more is established. 
Let the symbol = denote isomorphism and Z,, denote the (cyclic) additive group of Z/(n). Then 
for m > 1, the structure of ®,(m) is given as follows [1, pages 46-51]: 


®7(2) = Z,, 

7(4) = Z), 

0,(2”) = Z, X Zon-21f n > 2, 

O7( p") = Z,»_pn-1 if p is an odd prime, 


®,, is multiplicative in the sense that (mn) = ®,(m) X ®7(n) if (m,n) = 1. 
(The structure of ®,(m) for arbitrary m is gotten through its prime factorization.) 


The Euler ¢-function extends naturally to G. Since Z is contained in G and the units of G/( 6) 
also form a multiplicative group, we let ®,(8) denote this group and let ¢,(f) denote its order. 
The following questions arise immediately: 


James T. Cross: I received the Ph.D. from the University of Tennessee where Professor Eckford Cohen directed 
my work. I majored in mathematics at Brown University, took an M.S. in applied mathematics at Harvard, and have 
been at Sewanee since 1955. The problem discussed in this paper occurred to me as a result of some “primitive root” 
studies done by two of my senior honors students. My extracurricular activities are extensive and intensive (too 
much so). I mention two: mini-farming and ping-pong. 
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. If p is an odd rational prime, then is ®,( p”) still cyclic in this larger setting? 

. More generally, which Gaussian integers have primitive roots? 

Can we determine the structure of ®,( 8”), where B is prime in G? 

. If the answer to question 3 is yes, can we extend our results to an arbitrary Gaussian integer 
through its prime factorization? 


& WN 


Question 1 can be settled quickly in the negative by determining the orders of the members of 
(37). As we shall show later in Example 4, a complete and nonrepeating set of representatives 
for the members (equivalence classes) of this group is the set of all a + bi, where a and b range 
independently from 0 through 8 and at least one of a and b is prime to 3. The 72 members of this 
group, and their orders, are tabulated in Fig. 1. This table is taken from an unpublished paper [2], 
written by my student, Michael S. Crowe, who answered questions 1 and 2. 


[7] 
4 


[27] 


[47] 


[57] 


[7i] 


[87] 


12 12 12 12 4 
[1] [1 + 7] [1 + 2i] [1 + 37] [1 + 47] [1 + Si] [1 + 67] [1 + 7i] [1 + 87] 
l 24 24 3 24 24 3 24 24 
[2] [2 + i] [2 + 2/] [2 + 37] [2 + 47] [2 + Si] [2 + 67] [2 + 7i] [2 + 87] 
6 24 8 6 24 24 6 8 24 
[3 + i] [3 + 27] [3 + 47] [3 + Si] [3 + 7] [3 + 87] 
12 12 12 12 12 12 
[4] [4 + i] [4 + 27] [4 + 37] [4 + 47] [4 + Si] [4 + 67] [4 + 7] [4 + 87] 
3 24 24 3 24 24 3 24 24 
[5] [5 + i] [5 + 27] [5 + 32] [5 + 4i] [5 + Si] [5 + 67] [5S + 7i] [5 + 8i] 
6 24 24 6 24 24 6 24 24 
[6 + i] [6 + 2i] [6 + 4i] [6 + Si] [6 + 72] [6 + 87] 
12 12 12 12 12 12 
[7] [7 + i] [7 + 2:] [7 + 37] [7 + 47] [7 + Si] [7 + 67] [7 + 7i] [7 + 87] 
3 24 8 3 24 24 3 8 24 
[8] [8 +7] [8 + 27] [8 + 37] [8 + 4/7] [8 + Si] [8 + 67] [8 + 7i] [8 + 82] 
2 24 24 6 24 24 6 24 24 
FIG. 1. 


The elements of ®,(37) and their orders. 


In the table, for which the computations were done by computer, square brackets denote 
equivalence classes while the numbers without brackets indicate orders. For example, the order of 
[4 + 7i] is 24, the order of [5] is 6, and the order of [1 + 37] is 3. These orders are easy to 
determine though the calculations are tedious. For example, the order of [1 + 37] is 3 because 


(1 + 3i)° = 14 97 + 277? + 2773 = 1 (mod 9), 
while 
(1 + 31)? = 14 61 + 9i2 # 1 (mod 9). 


Since the highest order observed in the table is 24, it is clear that there exist odd primes for which 
the answer to question | is no. (We will show in Example 7 below that ®,(37) = Z; X Z3 X Zz.) 

Our objective in this paper is to answer questions 3 and 4, obtaining Crowe’s answer to 
question 2 as a by-product. Our methods are direct and constructive: 


(i). We identify the primes in G. Fortunately, we have to consider only three types. 
(ii). For B a prime in G, we find equivalence class representatives of G/(8”") and its units 
group, ®,(B"). 
(iii). For a power of each type of prime, we determine the structure of the corresponding units 
group, answering question 3. 
(iv). We show that ®, is multiplicative. The answer to question 4 will then be at hand. 


520 JAMES T. CROSS [October 


2. The Primes in G. Our principal goal in this section is to identify the primes in G; in 
conjunction with this program we also give some relevant facts concerning the Gaussian Integers. 
The reader can find these topics discussed in [1, pp. 82~112] or [3, pp. 246-260]: 

For B = a + biin G, the norm M(B) of B is defined by N(B) = (a + bi)(a — bi) = a? + b’. 
Note that N(B) is in Z. The norm of a product equals the product of the norms. If B divides y in 
G, then N(B) divides N(y) in Z. 

If # is in G, then p is a unit if and only if N() = 1. It follows that the units in G are 1, — 1, i, 
and —i. 

If g is a positive prime in Z and g = 1 (mod 4), then g = 77 for some 7 in G, where 7 is the 
complex conjugate of 7. Here 7 and 7 are prime in G and they are not associates. (Members 8 and 
Y in G are associates, which we denote by 6 ~ y, if B = py for some unit p in G.) 


EXAMPLE 1. In G,5 = (1 + 2i)(1 — 2i). If, for example, 1 + 27 = By in G, then N(1 + 2i) = 
5 = N(B) N(y) in Z. Then N(B) = 1 or N(y) = 1, so that one of the factors is a unit and the 
factorization is trivial. Thus 1 + 27 is prime; similarly, so is 1 — 27. To see that they are not 
associates, we need only try to get from one to the other by means of the four units. 

If p is a positive prime in Z and p = 3 (mod 4), then p is prime in G. 


EXAMPLE 2. Let p = 3. If 3 = By in G, then N(3)=9 = N(B) N(y) in Z. Then either 
N(B) = N(y) = 3 or the norm of one of the factors is 1. Since a* + b? = 3 is not solvable in Z, 
the first alternative must be discarded. Then one of B and y is a unit, the factorization of 3 is 
trivial, and 3 is prime in G. 

In G, 2 = (1 + 7)(1 — i), where these factors are primes. However, they are associates since 
1 —i= —i(1 +i). Then 2 = p(1 + i)”, where p is a unit in G, and we see that 2 is a power of a 
prime in G. If a denotes 1 + i, then a? ~ 2, a? ~ 2a, a7 ~ 4,...,a7" ~ 2”, and a2”"*! ~ 2a. 

The primes described above (a and primes of the types 7 and p) are, together with their 
associates, the only primes in G. 

Factorization into primes in G is unique in the following sense: two factorizations may look 
different, but we can get from one to the other by multiplying by units. For example, as we saw 
above, 2 = a& = —ia’. This phenomenon also occurs in Z: 6 = (3)(2) = (—3)(—2). 


3. The Equivalence Classes in G /(B”) and ®,-(B”), Where B Is Prime in G. Now that we 
have identified the primes in G, we want to be able to raise them to powers and find 
representatives for the equivalence classes of the corresponding quotient rings and units groups. 
Theorems 1 and 2 below address this problem. Following the theorems we consider some 
examples. 

In the remainder of the paper we will continue with the following notation: g and p will denote 
positive primes in Z satisfying g = 1 (mod 4) and p = 3 (mod 4), 7 will denote a prime factor of q 
in G, and a will denote 1 + i. 


THEOREM 1. The equivalence classes of G modulo a power of a prime are given as follows: 


l. G/(7") = {{[a]:0 <a <q" — 1), 

2. G/(p") = {la + bil:0 <a<p"—land0 <b< pp" —- J), 

3. G/(a?”") = {fa t+ bi]: 0 <a<2"-—land0<b<2"-)N, 

4. G/(a?"*') = {a + bi]:0<a<2™*!—1land0<b<2"- 1. 


In the statements of this theorem, as well as in the examples to be considered below, we intend to 
imply that the given sets of representatives are complete and nonrepeating. 


Proof. First we observe that G/(a?”) = G/(2”) and G/(a?"*!) = G/(2™a) because a?” ~ 
2”. Now if a + bi = c + di (mod a””), then 2” divides both a — c and b — d, so that the classes 
of (3) are distinct. A similar argument applies to the classes in (2). If [a] = [b] in G/(7”), then 7” 
divides a — b. Let 7”y = a — b for some y in G. Then taking complex conjugates, we get 7” 
=a—b=a-—b), so that 7” also divides a — b. Since 7 and 7 are not associates, 7”"7" = q” 
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divides a — b implying that the classes in (1) are distinct. If [a + bi] =[c + di] in G/(a?™*!) = 
G/(2”a), then 2a divides a — c + (b — d)i. Then 2” divides b — d; since each is less than 2”, 
b =d. Then 2”a divides a — c. Let a— c = 2k, where k is in Z because the only rational 
members of G are integers. Then a divides k so that N(a) = 2 divides N(k) = k. Then 2 divides 
k, so that 2”*! divides a — c and the classes in (4) are distinct. 

Now let B = x + yi bein G. Reducing x and y by multiples of 2” gets @ in one of the classes in 
(3). Reducing by multiples of p”, we get B in one of the classes of (2). Reducing by multiples of 
2”*! yields B = c + di (mod 2”*'), where each of c and d is nonnegative and less than 2”*!. If 
d < 2”, we have B in one of the classes of (4). If d > 2”, then we subtract and add 2’"a, getting 


c+ di=(c—2”")+ (d-—2™)i+ 2a. 
Then 
B=(c— 2”) + (d— 2™)i (mod a?”*!), 


where 0 < d — 2” < 2”. Then c — 2” can be reduced by multiples of 2”*', getting B in one of 
the classes in (4). Now we show that i belongs to one of the classes in (1). Let 7” = a — bi, so that 
bi = a (mod 7”). Now gq is prime to ), for if qg divides 5, then both 7 and 7 divide b. Then 7 
divides a. But this implies that g divides a. Then g divides 7”; this is absurd because g = 77. Thus 
(q, b) = 1 and the congruence, zb = 1 (mod gq”) is solvable in Z. Then the congruence, bi = a 
(mod q”), yields i = za (mod 7”). Then za can be reduced by multiples of p” to find i in one of 
the given classes. Since each of x, y, and i belongs to one of the classes of (1), then so does 
B=x-+yi.0 


This theorem implies that G/(7”) has g” members, G/( p”) has p”” members, and G/(a”) has 
2” members. These facts are special cases of the well-known result [4, page 54] that the order of 


G/(B) is N(B). 


Now we are ready to identify the units of the rings whose members are specified in Theorem 1. 


THEOREM 2. Let [a] be in G/(7"). Then [a] is a unit if and only if (q, a) = 1. Let [a + bi] be in 
G/(p”). Then {a + bi] is a unit if and only if at least one of a and b is prime to p. Let {a + bi] be in 
G/(a"). Then [a + bi] is a unit if and only if a # b (mod 2). 


Proof. Let B and y be in G. Then [f] is a unit in G/(y) if and only if [B][6] = [1] in G/(y), 
for some 6 in G. Then [8] is a unit if and only if Bd = 1 (mod y); that is, if and only if 
Bd + ny = 1 for some y in G. Thus [8] is a unit in G/(y) if and only if B is prime to y. It follows 
that [a] in G/(7”) is a unit if and only if 7 does not divide a, but 7 does not divide a if and only 
if g does not divide a. Next, [a + bi] in G/(p”) is a unit if and only if p does not divide a + bi; p 
does not divide a + bi if and only if p is prime to at least one of a and b. Finally, [a + bi] in 
G/(a") is a unit if and only if a does not divide a + bi. But 


(a+ bi)/a = (a+ bi)/(1 +i) =(a4+b)/2+i(b—- a)/2. 


This quotient is a Gaussian integer if and only if a = b (mod 2). Thus a does not divide a + bi if 
and only if a # b (mod 2). 0 


EXAMPLE 3. Let 7 = 1 + 27. We know that 7 is prime and 77 = 5. Theorems | and 2 assert 
that 


G/(m*) = {(0], [1], [2],-.. [24] 


and [a] in G/(7”) is a unit if and only if 5 does not divide a. It may be instructive to find the 
class to which i belongs. Now 77 = —3+ 4i, so that 4i=3 (mod 7”). Multiplying this 
congruence through by 19, we get 76i = 57 (mod z”). But since 7” divides 25, this congruence 
reduces to i = 7 (mod 7”). We can check this result directly: (7 — i)/(—3 + 41) = —1 — i, so 
that 7? divides 7 — i in G. Thus i = 7 (mod 7”), and i belongs to [7]. 

In this example the reader should convince himself that ®,((1 + 2i)*) is isomorphic to ®7(25). 
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Since ®,(25) is cyclic, (1 + 21)? = —3 + 4i has primitive roots. In fact, 2 serves as such a root. 

EXAMPLE 4. By Theorems 1 and 2, ®,(37) = ®,(9) = 

{la + bi]:0 <a < 8,0 <b < 8, and 3 is prime to at least one of a and b}. 

Note that ®7(9) is included isomorphically in ®,(9). Thus ®,(9) = {{1], [2], [4], [5], [7], [8]} can be 
identified in an obvious way with {{1], [2], [4], [5], [7], [8]} in ®,(9). 

EXAMPLE 5. By Theorems | and 2, 

®,(a°) = 0, (4a) 
= {{1], [3], [5], [7], [J], [2 + é], [44+ 2], [6 + @], [1 + 22], [3 + 22], 
[5 + 2i], [7 + 2], [37], [2 + 37], [4 + 3], [6 + 3i]}. 


Note that ®,(8) = {{1], [3], [5], [7]} is included isomorphically in ®,(a°). Since ®,(8) is not cyclic 
and since any subgroup of a cyclic group is cyclic, a° has no primitive root. 

Let us be more ambitious and find the structure of ®,(a°). Let H denote the subgroup 
generated by [1 + 2], and let K and J denote the subgroups generated by [5] and [7], respectively. 
Then 


A= {{i],[1+2i}}, K= (1), [5}, 
and 
[= ({1], [i], [-1], [-i]} — {{1], [i], [7], [4 + 3i]}. 


Now HX K = (1, [5], [1 + 27], [5 + 2i]}. Since H x K intersects J only at the identity and since 
each of H x K and J has order 4, the order of H x K X I is 16 = $¢(a°). Then 


®.(a°)=HxXKXxXI=Z, xX Z, xX Z,. 


Now, using Theorems | and 2, we can count the members of the units groups and find the 
following results: 


$g(7") = o2(4") = 4" — gq"! =q" "(q-)), 
6¢(p”) = pn — pn? = p*"?( p? _ 1), 
$,(a”) — 7" _ qn-l _ Qn-1 


The reader is invited to verify these results in Examples 3-5. 
In our program outlined in (i)—(iv) of Section 1, we have now reached (iii). We are ready to 
find the structures of the units groups that we have been discussing. 


4. The Structure of 9, (1"). We show that Example 3 is typical; that is, ®,(7”) is cyclic. 
‘THEOREM 3, O.(7") = Ly" n-l, 


q 
Proof. By Theorem 2, 
®,(7") = {[a]; 1 <a < q" and (q, a) = 1}. 
Formally, ®,(7”) looks like ®5(q”"), and, in fact, the map 
[a] in ®7(q") > [a] in ®,(7”) 
is an isomorphism onto ®,(7”). Thus, ®¢(7”) behaves algebraically like ®5(q”). Since ®5(q") 


= Zjn_qr-1, the proof is complete. While this theorem may be pleasing, it is not very exciting. 
Fortunately we encounter more interesting ones below. 


5. Some Preliminary Ideas Concerning ®,(a") and ®-(p"). We dealt in Section 4 with one 
type of prime in G. The units groups corresponding to the other two types are generally not cyclic 


1983] THE EULER $-FUNCTION IN THE GAUSSIAN INTEGERS 523 


as we saw in the Introduction and in Example 5. We showed in Example 5 that ®,(a°) = H x K 
x I. Let us see whether we may expect a similar structure when a is raised to an even power. 


EXAMPLE 6. By Theorems | and 2, 
®.(a°) = ©,(8) = (a + bi]:0<a<7,0<b <7, anda # b(mod2)}. 
Again let H, K, and J be generated by [1 + 27], [5], and [7], respectively. Then 
HA = {{1], [1 + 27], [5 + 47], [5 + 67]}, 


K = {{1], [5]}, and 7 = (1), [2], (71, [7/D. 


Then K X J = {{1], [i], [7], [77], [5], [57], [3], [37]}, so that (K x J) O H = {{1)}}. Then the order of 
H x K X Tis 32 = 6,(a°®). Then again ®,(a°) = H x K x I. 

Examples 5 and 6 lead us to conjecture that ®,(a") = H X K X I, where these subgroups are 
generated by [1 + 2/], [5], and [7], respectively. We delay checking out this conjecture in order to 
motivate another one concerning ®,(p”). This conjecture involves a subgroup generated by 
[1 + pi], and we shall study this subgroup and the subgroup H simultaneously. 

Now ¢,(37) = 72, $¢(3°) = 648, and (7°) = 2352; it is clear that ideas concerning the 
structure of ®,( p”) cannot be gotten by displaying multiplication tables. However, we can write a 
computer program to find the orders of the members of some of these groups and hope to observe 
something significant. The results of this effort are summarized below, where we have tabulated 
the highest orders observed and the order of [1 + pi]: 


Highest order ane of 
Group Order of an element 1 + pil 


2,3" ) 72 = 3°37 — 1) 24 = 3(3° — 1) 

648 = 34(37 — 1) 72 = 3°(37 — 1) 
2352 = 77(77 — 1) 336 = 7(77 — 1) 
115248 = 74(77 — 1) 2352 = 77(77 — 1) 
14520 = 117(117 — 1) 1320 = 11(117 — 1) 


Note that in each case the highest order observed is p”" '( p* — 1) and the product of this number 
with p”~'| gives $¢(p"). This is a short table; the evidence is skimpy, but we are led to conjecture 
that ®.(p”") = L Xx K, where L has order p”~ '( p* — 1) and K has order p”~'. This would imply 
that ®.(p") = H x K x R, where each of H and K has order p"' and R has order p” — 1. We 
note also that [1 + pi] appears to have order p”~!. 


EXAMPLE 7. Referring to Fig. 1, we let H, K, and R be the subgroups of ®,(9) generated by 
[1 + 37], [4], and [7 + 2i], respectively. Then 


A= (ij, [1 + 37], [1 + 67]}, &K = <«{1], [4], (7D, 
and 
R = {{1], [7 + 27], [7], [7 + 7], [8], [2 + 7], [87], [2 + 2i]}. 
Since HM K = {{1]), H X K has order 9, and since 8 is prime to 9, H X K intersects R only at the 


identity. Then H X K X R has order 72 = $,(37). Then ®,(37) = H X K X R. 
Now we recapitulate. Examples 5—7 and our display of orders have led to two conjectures: 


(i). ®,(a") = HX K XI, where H, K, and J are generated by [1 + 27], [5], and [i], 
respectively. 

(ii). ®.(p") = H x K X R, where H is generated by [1 + pi], K has order p"~', and R has 
order p” — 1. 


We prove these conjectures in the next two sections (one of them requires some qualification). 
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At this point we study the subgroups denoted by H in (i) and (ii). Our principal tool is the 
following lemma. 


LEMMA 1. Let k denote a positive integer. Then, 


(A) (1 + pi)? = 1+ p*t!; + p**2y, where y is in G. 
(B) (1 + 21)? =1+42**\(a + i) + 2**7y, where y is in G and a is an odd integer. 


Proof. Let B be in G, let r denote a prime in Z, let k be a positive integer, and let o denote 
(1 + Br)’ . The reader can readily convince himself that it is probably true that 


o = 1+ Brkt! + (B7/2)(r* — 1)r*t? + yr*t?, 


where y is some member of G. A proof based on the Binomial Theorem is somewhat tedious but 
not difficult. We omit the details. Now, 


ifr +2, theno = 1+ Br**! (modr**?). 
If r = 2, then 
o=1 +4 2**'B + B*(2* — 1)2**! (mod 2***) 
= 1 + 2**1((2* — 1)B? + B) (mod 2**?). 
The statements of the lemma follow from putting B = i in these results. 0 
The following lemma gives the orders of the subgroups denoted by A in conjectures (1) and (ii). 
LEMMA 2. Let each of m and n be a positive integer greater than 1. Then 


A. Let p denote 1 + pi. The order of [p] in ®,(p") is p"~'. 
B. Let 8 denote 1 + 2i. The order of [8] in ®¢(a?") and in ®,(a?"*!) is 2"~!. 


Proof. Put k = n — 1 in (A) and k = m — 1 in (B) of Lemma 1. Then 
p?” = 1 (mod p") 
and 
62"! =14+2"(a+i) (mod2”*!), 
But a divides a + i since a is odd, and a” ~ 2”. Then 
§2”-' = 1 (mod a2”*!). 


Then the order of [p] in ®,(p”) is a divisor of p"~', and the order of [6] in ®,(a?”) and in 
®.(a?"*!) is a divisor of 2”~'. It suffices to show now that p?)  # 1 (mod p”) and 6?” # 1 
(mod 2”). Put k = n — 2 in (A) and k = m — 2 in (B) of Lemma 1. Then 


p?” =] + p""'i # 1(modp"), 
and 
62" =142"-'\(a+i) (mod2”). 
Since 2 does not divide a + i, 8?” # 1 (mod 2”). O 
As we implied in the conjectures, we intend to use the subgroups generated by [1 + pi] and 
[1 + 2i] and denoted by H as factors in direct products. For this purpose it will be necessary to 


have the following lemma which asserts that no member, except the identity, of these subgroups 
contains a real number or a pure imaginary number. 


LEMMA 3. Let each of m and n be a positive integer greater than |. Let p denote 1 + pi and 6 
denote 1 + 2i. No member, except [1], of the subgroup of D¢( p”) generated by [p] and no member, 
except [1], of the subgroup of ®,(a*”) or of B®, (a?"*') generated by [8] can be represented by a real 
number or a pure imaginary number. 
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Proof. Let a complex number c be called special if c is in Z or c = ic, for some c, in Z. 
Suppose now that 6? is congruent (mod 2”) to a special number, for some b satisfying 
0<b<2”™—!. Let B denote the set of all such “bad” b. First, we show that 2”~? is not in B. Put 
k = m — 2 in part (B) of Lemma 1. Then 


§2"-° =14+2"-'(a 4+ i) (mod 2”). 
Now supposing §2"* congruent (mod 2”) to a special number s, we have 
1+2"-'a—s+2"~'i = 0 (mod 2”). 


If s is real, 2” divides 2”~ |. If s is imaginary, 2”~' divides 1. These contradictions guarantee that 
2™~? is not in B. Our technique for completing the proof is Fermat’s “method of descent.” Let L 
be the least member of B and let 


2"-!=Td+r, 
where 0<r<L. If r=0, then L=2' for some ¢ satisfying 0 < t< m-— 2. (We showed 
t = m — 2.) Then 


go"? _ stam ny” 


Since L is in B, 8“ is congruent (mod 2”) to a special number. It follows that 822” °~ = §2”” is 


also congruent (mod 2”) to such a number, which we proved impossible. Then r + 0. Since the 
order of [6] is 2”~ |, 


(1) [1] = [8°¢][8"] = [s][8”] in ®,(2”), 

for some special number s. Let s = x or s = ix for some x in Z. Since [s] is in the units group, x is 
odd. Then let y in Z satisfy [yx] = [1] in ®,(2”). Then 2” divides yx — 1 in Z, implying that 2” 
divides yx — 1 in G, so that [yx] = [1] in ®,(2”). From (1) we get 

(2) | [y] =[ys][8"] in ®,(2”). 

If s = x, then [6] = [y]. If s = ix, we multiply (2) by [—i], getting [— iy] = [6’]. In either case, 
6” is congruent (mod 2”) to a special number, contradicting the minimality of L. The set B is 


empty. This completes the proof of the lemma as it applies to a. Replacing 2 by p and m by n, we 
can use the same argument to prove the lemma as it applies to p. O 


6. The Structure of ®.(p"). We conjectured in Section 5 that ®,(p”) = H X K X R, where 
H is generated by [1 + pi], K has order p"~', and R has order p” — 1. We established the useful 
properties of H. Now we go to work on K. 

We saw in Example 4 that ®,(9) is included in ®,(9) by an obvious isomorphism. More 
generally, the map, 

[a]in ®7(p") > [a] in ®,(p"), 
is an isomorphism. We know that ®,(p”) is cyclic and that ¢5(p”) = p” '(p — 1). It follows 
that some [a] in ®5( p”) has order p"~'. The isomorphism then implies that [a] has order p”~ ' in 
®.(p”). We let K be the subgroup generated by [a]. Each member of K can be represented by a 
real number, so that HM K = {{1]} and H x K has order p?"~*. 

Next, we turn to R. Since p is prime in G, G/(p) is a field and ®,(p) is cyclic, being the 
multiplicative group of a finite field. The order of this group is p* — 1. Let [B] generate ®-( p). 
Then 8” ~' = 1(mod p) so that B? ~! = 1 + yp for some y in G. Then from the proof of Lemma 
l, 

n-1 


(g?-')"  =14+mp" 


for some y in G. Hence, 


(g°"')? | = 1 (mod p"), 
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so [B?” '] has order ¢, where ¢ divides p? — 1. It follows that B’?" | = 1 (mod p), whence p? — 1 
divides tp”~'. Then p? — 1 divides t, so that t = p? — 1 and[B?" ] has order p? — 1 in ®,(p”). 
We let R denote the subgroup of ®.(p”) generated by [B?” ']. 

Now since every member of ‘H X K has order a power of p, (H X K) 1 R= {1}, and the 
order of H X KX R is p*"~*( p* — 1) = $¢(p"). Our conjecture has checked out; ®,(p”) = 
H X K X R and we have proved the following theorem: 


THEOREM 4. ®,( p") = Z,n-1 X Zyn-1 X Z,2_ 4. 
Our proof of this theorem is valid only for n > 1, but the theorem holds also for n = 1. In this 
case Z,n-1 is trivial. 


REMARK. In Example 7 the subgroups K and R can be gotten without resort to Fig. 1: Just 
examine the 6 members of ®,(9), finding that [4] has order 3, and let K be generated by [4] in 
®.(9). Then write out the 8 members of ®,(3), getting [1 + i] to be a generator. Then let R be 
generated by [(1 + i)?] in ®,(9). 


7. The Structure of ®.(a"). We conjectured in Section 5 that ®,(a") = H X K X I, where H 
is generated by [1 + 2i] while K and J are generated by [5] and [i], respectively. However, in 
Lemmas 2 and 3 we assumed m > 1 so that for n < 4 we determine directly the structure of 
®.(a"). Moreover, in ®,(a*) = ®,(4), the subgroup generated by [5] is trivial. Therefore, we give 
the structure of ®,(a") in two theorems, the first applicable for n = 1, 2, 3, or 4, while the second 
applies if n > 5. 

THEOREM 5. ®.(a) = Z,, ®,(a*) = Z,, ®¢(a*) = Z,, and O,(a*) = Z, X Zz. 

Proof. By Theorem 2, 

®,(a) = {[1]}, 
®,(a*) = ({1], fi), 
®,(a°) = <{1], [3], [i], [2 + i}, 
®;(a*) = [1], (3), [i], [34], [1 + 2], [2 + i], [2 + 32], [3 + 27]}. 
One can verify that [i] generates ®,(a°) and that ®,(a*) is the direct product of the subgroups, 
{{1], [1 + 27] and {{1], [2], [3], [37}. 0 


Now we assume n > 5 and prove conjecture (i) of Section 5. We investigated the subgroup H. 
The properties of K that are important for us are given in the following lemma. 


Lemma 4. Let n > 5. The order of K is 2"~* or 2”~', according asn = 2morn =2m + 1. The 
member [ — 1] of ®¢(a") is not in K. 


Proof. The maps 
[a] in®7(2”) > [a] in ®,(a*”) 
and 
[a] in®,(2"*') > [a] in ®,(a*"*") 


are isomorphisms. We noted one of them in Example 5. Now ®,(2”) and ®,(2”*') are direct 
products of the subgroups generated by [— 1] and [5]. (See [1, page 46].) The order of the subgroup 
generated by [—1] is 2. Then the order of the subgroup generated by [5] is $,(2”)/2 or 
o7(2"*!)/2; that is, 2"~? in ®,(2”) and 2”~' in ©,(2”*'). Then the given isomorphisms 
establish the truth of the lemma. 0 


Now since [— 1] is not in K and since [i] generates J, K intersects J only at the identity. Since 
each member of K x J can be represented by a real number or a pure imaginary number, 


1983] THE EULER $-FUNCTION IN THE GAUSSIAN INTEGERS 527 


HO (K X I) = {1)). The order of H x K X Tis, therefore, 
gm-l+m-2+2 — 97-1 = g (q@") ifn =2m, 
gm itm 142 = 97-1 = gg (a")ifn=2m4t 1. 
Then ®,(a") = H X K X I, and we have proved the following theorem. 
THEOREM 6. Let n > 5. Then 
O.(a") = Zym-1 X Zym-2 X Z, ifn = 2m, 
O.(a") = Zym-1 X Zym-1 X Z ifn =@Zm + 1. 


8. ®. Is Multiplicative. Question 3 of the Introduction has been answered by Theorems 3-6. 
We will show now that 


®;( BB.) = ®¢(B,) X ®,(B,) 


when £, is prime to £,, thus answering question 4 in the affirmative. Now it is easy to modify a 
“Z” proof of a version of the Chinese Remainder Theorem so as to establish that if B,, B,, 7), 
and 1n, are members of G with £6, prime to £,, then there exists X in G satisfying X = n, (mod B,) 
and X = n, (mod B,). Moreover, X is unique (mod £, £,). These facts are helpful in establishing 
the following theorem. 


THEOREM 7. Let 8, and B, be in G with (B,, B,) = 1. Let f map 9¢(B,) X Og(B,) to G/(B\B2) 
such that 


fC], [m2))) = Col, 


where n = n; (mod B,) for i = 1 or 2. Then the image of f is ®-(8,B,) and f is an isomorphism. 


Proof. The Chinese Remainder Theorem implies that f is well defined. For i= 1 or 2, 
(n, B,) = (n,, B;) = 1 since n = n; (mod B,) and 7; is prime to B,, being a member of ®,(B;). Thus 
n is prime to 8B, 8, and f maps to ®,(8,8,) . A routine check verifies that fis an isomorphism onto 


05 (8B). 0 


EXAMPLE 8. Assuming that it might be fun to see the function f in action, we apply f to 
®.(3) X ®-(a°). We know that ®,(3) is cyclic with generator [1 + i]. (See the Remark following 
Theorem 4.) From Theorem 5 we have that ®, (a?) = ®,(2.a) is cyclic with generator [i]. In view 
of Examples 4 and 5 and Theorems | and 2, we might conjecture that G/(6a) = 


{[a + bi]:0<a< lland0 <b< 5} 


and that [a + bi] is a unit if a # b (mod 2) and (3, a) = 1 or (3, b) = 1. This conjecture does 
check out, and a systematic listing of a set of representatives of ®,(6a) might go as follows: 
i,5i,1,1 + 27,1 4+ 47,2 + i,..., 11+ 4i. This set has 32 members, checking that $,(6a) = 
6,(3)¢,(a?) = 8(4). Now using the notation of Theorem 7 with 8, = 3 and B, = 2a, we let 


n =n, + (3 + 6i)(n — 71). 
(One who works through a proof of the Chinese Remainder Theorem can see the motivation.) 


Then » =n, (mod 3) and »=7, (mod 2a). For example, to find f(({1 + i], [3])), we let 
n, = 1+ i and n, = 3. Then 


n=1+i+(3—-6i)(2—-i)=1- 141=7 + 4i (mod 6a), 
since 127 = 0 (mod 6a) and 7+ 4i = 1 — 27 + 6a. Thus f(({] + i], [3])) is [7 + 4]. Fig. 2 is a 
display of f in its entirety. In this display we omit the brackets that denote equivalence classes. In 
the upper left corner we isolate the subgroup ®,,(3) x {{1]}, which is, of course, isomorphic to 


®,,(3); in the upper right we isolate {{1]} x ®,(2a). We denote the images of these subgroups by S 
and 7. The reader who wishes to practice his computational skills is urged to show directly that 
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[1 + 47] in ®,(6a) generates S and [10 + 3i] generates T. It is clear that § M T = {{1]}, so that 
(6a) = S X T. 


05 (3) X @6(2a) > Bo(6a) 


f f 
(1,1) > 1 (1,1) a I 
(1 + i,1) 1+ 4i (1, i) 10+ 38 7 
(2i, 1) 342i (1,3) 7 
(1 + 2i,1) 1+ 2i (1,2 + i) 4+ 3i 
(2, 1) 5 S 
(2 + 2i,1) 1] + 2i (2i,2 + i) 64+ 5i 
(i, 1) 9+ 4i (1 + i,3) 1 + 4i 
(2 + i, 1) 5 + 4i (1 + i, i) 44+i 
(1+i,2 +i) 10 +i 
(2,3) 11 (1 + 2i,3) 1 + 2i 
(2, i) 2+ 31 (1 + 2i, i) 445i 
(2,2 + i) 8+ 31 (1 + 21,2 +2) 10 + 5i 
(i, 3) 344i (2 + i,3) ll + 4i 
(i, i) i (2 + i, i) 8 +i 
(i,2 + i) 6+i (2+i,2 +i) 2+i 
(2i, 3) 9+ 2i (2 + 2i,3) 5 + 2i 
(2i, i) 5i (2 + 2i, i) 8 + Si 
(2 + 23,2 + i) 24+ 5i 


FIG. 2. 


9. Primitive Roots. Theorem 3 says that ®,(7”) is cyclic, Theorem 4 implies that ®,(p”) is 
cyclic only if n = 1, while Theorems 5 and 6 yield that ®,(a”) is cyclic only if n < 3. We 
summarize: . 


only ifn = 1 
only ifn < 3 


Thus a, a”, a?, 7”, and p have primitive roots. The product of cyclic groups is cyclic if and only 
if their orders are relatively prime. Then av” and ap also have primitive roots, but these are the 
only additional ones that we get in this manner. We therefore have the following theorem. 


THEOREM 8. The Gaussian integers a, a’, 0, 2”, p, am”, and ap have primitive roots. These and 
their associates are the only Gaussian integers having primitive roots. 


As we mentioned in the Introduction, Crowe proves Theorem 8. Since his paper is unpublished, 
we comment on his methods. Versions of Theorems 1—3 appear in his work, so that he obtains 
that ®.(7”) is cyclic. He then observes that ®5(2”) or ®7(2”*') is included isomorphically in 
®,.(a”"), implying that a” has no primitive root if n > 3. Finally, for n > 1, he finds two distinct 
subgroups of order p in ®,( p”), implying that ®,( p”) is not cyclic. 
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Department of Pure Mathematics, University of Liverpool, Liverpool L69 3BX, England 


While it is impossible to say anything about everything, 
one can say almost everything about almost anything. 


Introduction. This article is about smooth curves embedded in the plane. We stick to this 
simplest (but far from trivial) case here, though our methods apply also to higher dimensions. An 
embedded curve C in the plane does not cross itself, and does not have kinks or cusps. (From 
Section 1, we only consider closed curves, without loose ends.) It does have a tangent line at each 
point. Some tangent lines have closer contact than others: the tangent line to y = f(x) at 
(Xo, f(Xo)) is said to have n-point contact iff the ith derivative, f (x9), of f at x9 is zero for each i 
with 2 <i <n-—1 and nonzero for i = n. Ordinary or 2-point contact just means f(x) # 0; 
higher contact is also called inflexional contact (Fig. 1). Thus, for example, when n is an integer 
> 1, the curve y = x” has exactly n-point contact with its tangent line (the x-axis) at (0,0) 


Fic. 1. Inflexional tangents. 
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yuxX yprx ypHFx 
FIG. 2. 
(Fig. 2), and for n > 3 this contact is inflexional. 


Sometimes a single line can be a bitangent of C (tangent at two different points of C), or even a 
tritangent (Fig. 3). 


tritangent 


bitangent 


FIG. 3. 


Here are two results about tangents to plane curves. 

(a) Suppose that the tangents to C are all inflexional. Then C is a straight line (or several 
separate straight lines). 

(b) Almost all plane curves possess no tangents whose order of contact is > 4 and no 
tritangents. 

The results (a) and (b) could hardly be more different. The first has an extremely strong 
hypothesis, and tells us, not surprisingly, that virtually nothing satisfies this hypothesis—only 
straight lines. It is typical of the results of differential geometry of the form “if something (a 
curve) is sufficiently like something else (a line) at every point, then it is something else.” 

Result (b) cannot be accused of having a strong hypothesis—indeed, it may not be entirely 
clear whether it has one at all. It asserts that, “in general,” plane curves do not have peculiarities 
such as 4-point contact tangents or tritangents. If by accident they do, then these peculiarities can 
be removed by a tiny change in the curve—they are unstable (Fig. 4)—-whereas the properties in 
(b) are stable—not removable by small perturbations. Generic Geometry is the study of properties 
of smooth curves, surfaces and higher dimensional manifolds in a euclidean space which hold “in 
general.” We shall be more precise in § 4. 

Although 19th century geometers were interested in results which hold in general, the 
techniques to make these results precise and to prove them were not available until relatively 
recently. Following suggestions by René Thom, some very interesting results were proved by Ian 
Porteous [13] and Eduard Looijenga [11] in the early 1970’s. Since then many mathematicians 
have contributed to this exciting and significant development in differential geometry. 

We hope to convey the flavor of the subject in this article. In §§ 1—4, we take the reader 
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a al = 


yun y=x'- ex 


Fic. 4. 


through a proof of part of result (b) which is more direct and geometrical than proofs obtained by 
specializing more general theorems in the literature. We use the crucial concept of transversality, 
introduced in § 2, and a device which we call the Taylor map, introduced in § 1. After that, in 
§§ 5-6, we interpret result (b) as a theorem about duals and evolutes, using another keystone of 
modern singularity theory, versal unfoldings, which have here a simple and striking application. 
We cannot include all the details, of course, but we hope that the main themes are clear. 

We have no space for the more challenging results of Generic Geometry in higher dimensions, 
but recommend to the reader references [2], [3], [5], [11], [13], [14]. In fact [3], [14] are good general 
references. Many of the ideas sketched here are presented in more detail in [7]. A good 
introduction to singularity theory is [4], while the underlying theory of smooth manifolds is 
presented in [12], [10]. 


1. The Taylor Map. Throughout this article we consider smooth (infinitely differentiable) 
functions and curves. ~ 

Let e: S' > R? be a smooth embedding of the unit circle S' into the plane R*. (Thus using, 
say, the angle coordinate t on S! and writing e(t) = (X(t), Y(t)), the functions X and Y are 
smooth of period 27, the derivatives X’(t) and Y’(t) are never both zero for the same t, and 
distinct values of t map to distinct points of R*.) We call e or its image e(S') = C an embedded 
curve in the plane. These embedded curves are the object of our study. 

Let V, be the set of polynomials in a single variable (call it , but for the most part it can 
remain anonymous) of the form 


at? + at fieee + a,&* 


where the a, are real numbers. Thus, V, is a real (kK — 1)-dimensional vector space, which we can 


think of as R“~' with coordinates (a, a3,..., @;). 
One of our main tools will be what we call the Taylor map 
Y: C md V;, 


defined as follows. Choose a point p = e(t) of C. At p there is a unit tangent vector 7(t) in the 
direction (X’(t), Y’(t)) and a unit normal vector M(t) in the direction (— Y’(t), X’(t)). This gives 
us two axes at p, in the J and AN directions, and we can write the equation of C, close to p at any 
rate, as, say, n = f(§) relative to those axes. (Thus, € is along the T direction, n along N.) See Fig. 
5. Since the €-axis is tangent to C at p, we have f(0) = f’(0) = 0 where ’ here means d/d§&. Of 
course, f depends on p, but for a given p, f is uniquely determined. (Clearly it is also unchanged if 
we reparametrize the curve C.) We can form the Taylor series (Maclaurin series) of f at € = 0, 
carried as far as the term £*. This gives us the polynomial y( p) which we seek. 

As a simple example, if C is a circle of radius r, then for k = 3, a, = 1/2r and a, is zero, no 
matter what p is. Thus, y has image consisting of the single point (1 /27r,0) € V, in this case. Other 
examples are rather harder to work out explicitly! See Fig. 6 below, and (2.2). 
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Fic. 5. Setting up the Taylor map of C. 


The coefficients (or coordinates) a; have interpretations in terms of the curvature of C at p, and 
its derivatives. (See, for example, [9].) In fact, using the local equation y = f(&), ie., the 
parametrization € +> (&, f(§)), the curvature at (, f(§)) of C is 


K(E)=F"(E)/ (14 (FEY). 


It follows quite easily that «(0) = 2a,, «’(0) = 6a, and «’’(0) = 24(a, — a3). (Remember that ’ 
means d/dé. Actually, for the first two derivatives of «, the derivative with respect to arc-length 
on C gives the same answers.) Thus, p is an inflexion of C iff a, = 0; a higher (or degenerate) 
inflexion, where the tangent line has at least 4-point contact, iff a, = a, = 0; a vertex of C (point 
of stationary curvature) iff a, = 0; and a higher (or degenerate) vertex iff a, = a, — a} = 0. An 
inflexion or vertex which is not higher is called ordinary. 

Let us take k = 3 and consider our map y: C > V3. We picture V; as a plane with coordinates 


a3 


a3 


Image of Taylor map of ellipse. Image of Taylor map of limacon. 


FIG. 6. 
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(a,,a,). The image of y will be a curve y(C) in V;—or it might collapse as in the case of Ca 
circle. If y(C) passes through the origin, then C has a higher inflexion; if y(C) crosses the a,-axis, 
then C has a vertex, and so on. Fig. 6 shows the curve y(C) for an ellipse and for a limacon. 
(For the ellipse, y(C) is traversed twice: there are four vertices.) 

Suppose y(C) does pass through the origin. Then, we would expect that by slightly deforming 
C to C’, say, y(C’) could be made to miss the origin: it should be “accidental” that y(C) passes 
through the selected point (0,0). Thus, we expect that curves with higher inflexions will be 
exceptional. 

Likewise with k = 4, using y: C — V,, we obtain a curve y(C) in the 3-space with coordinates 
(a), 43, a,). (Again y(C) might collapse, of course.) The set with equations a; = a, — a; = Oin 
V, is a curve (see Fig. 7) and “in general” we would expect y(C) to miss this curve: if y(C) does 
happen to intersect it, then we would expect that by changing C slightly to C’, we could arrange 
for y(C’) to miss the curve. Thus, we would expect a “general” C to have no higher vertices. The 
set a; = 0 is a plane in V,, and if y(C) meets this plane (but does not touch it), we would expect 
nearby curves y(C’) to continue to meet the plane. (See Fig. 8.) This suggests that ordinary 
vertices persist under small changes in C. 


a 
a, 4 


¥(C) 
¥(C’) 


a3 


a3 


az ay 


Fic. 7. The higher vertex set a, = a, — a3 = 0. Fic. 8. 


To sum up, it seems reasonable that the following properties will hold “in general” for curves 
C in the plane: 


(1.1) (i) C has no higher inflexions, 
(ii) C has no higher vertices. 

It is pretty clear that there are objections to this kind of argument: apart from its general air of 
vagueness, it is far from obvious how changes in C are going to affect the curve y(C). There are 
indeed hidden constraints on the curve y(C): for instance, when k = 3, y(C) cannot touch the 
axis a, = 0 except at the origin, and a famous theorem called the four vertex theorem [9, p. 36] 
implies that at least four points of C must map to the axis a, = 0. 

In the following sections, we try to show how modern singularity theory overcomes these 
obscurities. We hope that an occasional lack of detail in what follows (necessitated by pressure of 
space) will not be mistaken for a more complicated form of obscurity. 


2. Transversality. Two embedded curves C, and C, in the plane are said to be transverse at an 
intersection point g if their tangent lines at g are different. This is the same as saying that the 
tangent vectors 7, and T, to the curves at g span all the “tangents to the plane” at g: every vector 
based at g is a linear combination a7, + BT, of T, and T, with real coefficients a, B. See Fig. 9. 

More generally in a Euclidean space V = R? and R? or V, (regarded as R«~'), two smooth 
submanifolds Q, and Q,, (think of each as a point, a curve or a surface) are transverse at an 
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T, 
Cy 


C 


Fic. 9. Transverse curves. 


intersection point g € V provided every vector based at g is a linear combination of a tangent 
vector to Q, and a tangent vector to Q,. If Q, is a single point g, then it has no nonzero tangent 
vectors, so when g € Q,, Q, is transverse to g provided every vector based at qg is tangent to Q,. 
In that case Q, contains a neighborhood of g. 

We call Q, and Q, transverse if they are transverse at every g € Q,; M Q,. This has the usual 
mathematical quirk that if Q, and Q, fail to intersect, then they are automatically transverse, for 
there cannot possibly be a g at which they fail to be transverse! 

Two surfaces in R°, or a curve and a surface in R° are transverse at a common point q 
provided they do not just touch at g: one cuts cleanly through the other. Two curves in R? can 
never be transverse at a common point, for the dimensional reason that two vectors are not 
enough to span a three-dimensional space. Thus if we can prove that two curves in R° are 
transverse, then the curves cannot intersect. The same goes for a curve and a point in R? (or R°): 
if they are transverse, then the point cannot lie on the curve. 

For the Taylor map y: C — V, as in §1, we want to say that y is transverse to a submanifold 
(point, curve, surface... ) QO in V, if QO and y(C) are transverse. Because C may get collapsed by 
y, we have to spell this out more carefully. 

Let pp © C have parameter say ty € S' so that py = e(to). Then as the real variable s moves 
away from zero, ty + s gives us a path in C, consisting of points p, = e(t) + s). Using y, we get a 
path in V, consisting of points y(p,). The tangent vector to C at po is taken, by definition, to the 
vector 


0.1 sm tm We) — 100) 
s-0 S 
based at y( po) in V,. (Thus v is the derivative of y ° e at ty.) See Fig. 10. 
If y(C) is a genuine embedded curve in V,, then the vector v is simply its tangent vector at 


R2 Vier tts 
¥(C) 


Y(Ps) 


/ Y( Po) D 


Fic. 10. 
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Y( Po), but the definition works in any case. For example, if C is a circle, then v is always zero. 

We say that y is transverse to Q provided the following holds for every py € C with 
Y( Po) = Wo € GQ: every vector based at gp in V, is a linear combination of the above vector v and 
tangent vectors to Q at qo. 


(2.2) Example. Suppose that C is given, close to (0,0), by the equation y = x”, which we can 
parametrize by e(t) = (t, t*). Take po = (0,0), p, = (s, 5”). Let k = 3 and take for Q the a,-axis 
in V;, which has tangent vector (1,0) (along the a-axis) at qo = (1,0) € Vy. (See Fig. 11.) A 
straightforward calculation shows that 


y(p,) = (1 + 4s?) 


so that the vector v defined in (2.1) works out at (0, —4). Since every vector in R? is a linear 
combination of (1,0) and (0, —4), y is indeed transverse to the a,-axis at qo. 


~°/? —~4s(1 + 452)’) 


a3 


OQ go = (1,0) 


a 
ue Y(Ps) 


Fic. 11. Example (2.2). 


One of our objectives is to show that, for “most” curves C, y is transverse to the origin for 
k = 3 and to the curve with equations a, = a, — a3 = 0 for k = 4. From these and the definition 
of transversality it will follow that y(C) does not meet these sets at all, and this implies (1) and (ii) 
of (1.1). Actually we shall prove a much more general transversality statement. 


3. Families of Curves. Starting with an embedded curve C in R*, we can deform C by means 
of a collection U of smooth maps R* > R?. Thus if y © U and C is given by e: S' > R?’, then 
W(C) is given by yo e, which takes ¢ € S' to ¥(e(t)) € R?. We shall always take U to be an open 
set of some euclidean space R”. For example, U could consist of all translations of R? 


(h(x, y) = (x + Cy,x + Cy), U=R*= {(c1, ¢2)}) 


or all affine maps 


(h(x, y) =(c, + ex + c3y, C4 + C5x + coy), U=R® = {((c},..., ¢6)}). 
Suppose w is such that ¥/(C) is still an embedded curve. Then there is a corresponding map 
y= wi¥(C) > Vy. 
All of these can be fitted into a family 
Tr:CcxU-YV, 

defined by '( p, ¥) = y,(¥(p)). In other words, given p € C and a transformation y we work out 
the Taylor expansion of the transformed curve (C) at the point ¥( p) and write down its terms 
up to degree k (or just the coefficients of those terms). 


As an example, let C be the circle given by e(t) = (cos t, sin t), and let (x, y) = (x,(A + ly) 
where A € R,A + — 1. Then y takes C to the ellipse ¥(C) consisting of points (cos ¢,(A + 1)sin £). 
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As A moves away from zero, the image y,((C)) in V, changes from a point (4,0) to a genuine 
curve, as illustrated in Fig. 12. 


a3 


Fic. 12. Images of the Taylor map for a family of ellipses. 


Just as we can speak of y: C — V, being transverse to a submanifold Q of V,, we can speak of 
the whole family I’ being transverse to Q. Here is a simple example. 


(3.1) Let C be as in (2.2). Let U be the set of maps R* > R* of the form ¥,(x, y) = 
(x,(A + 1)y) where A is so close to 0 that ~,(C) is still an embedded curve. Let us examine the 
map 


T:CxU- Vd, 


which takes (p, W) to the coefficients (a,, a,) in the Taylor expansion of ~,(C) at ~,(p). Note 
that T'( po, bo) = (1,0) = Go, say. We shall suppose for simplicity that no other ( p, W) goes by T 
to qo. The image of I is a whole system of curves in V,; parametrized by A. Taking the path 
5S — (Po, ,) in C X U, we obtain from [ the path s > I'( po, w,) = (s + 1,0) in V,. See Fig. 13. 
(For ,(C) is the curve y = (s + 1)x? close to (0,0), and at W,(p.) = pp the Taylor expansion 
(s + 1)x* + 0x? gives a, = s + 1, a, = 0.) Hence, the path in V; gives us the vector 


lim (+10) — (0) = (1,0) 
s—>0 


in V; based at gy. This is the image under I of a tangent vector to C X U at (po, Wo). We already 
know from (2.2) that the path s — (p,, ¥)) in C X U gives us the vector (0, — 4) at g 9. Because 
(1,0) and (0, —4) span all vectors in R* this shows that the whole family I’ is transverse to the 
point gp in V3. 

This is the kind of result we shall need. The reason is the following crucial lemma. This lemma 
does not assert something about a// members of the set U which parametrizes our family of curves, 
but about almost all members. This phrase has a precise technical meaning, which we say 
something about below, but the reader will have the right idea if he thinks of “all but a tiny 
minority of elements of U.” 
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qo = (1,0) 


Fic. 13. Example of a family of curves. 


(3.2) ‘TRANSVERSALITY LEMMA OF SARD AND THOM. Suppose that : C X U —> V is transverse 
to the submanifold Q of V. Then for almost all) © U the map I: C > V defined by T,(p) = T(p, w) 
is transverse to Q. 


Note that T,(p) = yy(¥(p)) where y, is the Taylor map of the curve p(C). 

By “almost all’ we mean all yy € U except those of a set of measure zero in U. The precise 
technical meaning of this standard term from the theory of Lebesgue measure need not concern 
us. A finite or countable set of points in a line has measure zero in the line; a finite or countable 
set of points and lines has measure zero in the plane, as has any smooth curve C. We do need to 
know that if 2 has measure zero in U, then U — Q is dense in U: every point of U has points of 
U — Q arbitrarily close to it. Thus, a “bad” » € Q is arbitrarily close to “good” W’ € U — Q. We 
also need to know that if @,,..., &, have measure zero, then so does their union: almost all w fail 
to belong to any of the Q,. 


U r-'(Q) 


C X {b) 


Fic. 14. Pictorial proof of (3.2). 


Here is a pictorial indication of why (3.2) is true—actually the formal proof ({14], p. 739) is 
surprisingly similar. The fact that I’ is transverse to Q actually makes [~'(Q) a smooth manifold 
in C X U (see Fig. 14). It turns out that I’, is transverse to Q if and only if C x {1} intersects 
I~ '(Q) transversely. As is plausible from the picture, most y will give a “horizontal line” C x {w)} 
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which is transverse to, ie., fails to touch, [~'(Q). This last is actually a consequence of Sard’s 
theorem (see, for example, [12]) on critical values. Thom’s crucial idea was to turn Sard’s theorem 
into a transversality statement. 

In the example (3.1) above, with Q = {qo}, the only map I, whose image passes through Q at 
all is with p = %. Now I, is not transverse to Q but every other I’, is transverse to Q (because 
there is no point where transversality has to be checked). Thus, the “bad” make up one point 
A = 0 out of a whole interval of values of A, and indeed a point has measure zero in an interval. 

If we want transversality to several different submanifolds Q,,..., Q,; say, then we can apply 
Thom’s lemma to each of the Q; separately, obtaining / sets of measure zero in U. The union Q of 
these / sets will still be of measure zero and for y © U — & we shall have I, transverse to all the 
Q;. Thus, in order to show that, for almost all y, I, is transverse to all the Q,, we need only prove 
I is transverse to all the Q,. 

Let us return to the case of a single Q. The way in which one proves that I’ is transverse to Q is 
to take paths in C X U through (po, ¥)) € I~ '(Q), as in (3.1), and to calculate the correspond- 
ing vectors at ['( po, Wo) € V,. In fact, we shall always find, in the important examples below, that 
it is enough to take paths of the form s > (po, W,) where only i, varies. 


4. Generic Properties of Curves. Let C be a curve embedded in R? and let P, be the set of 
maps y: R? > R* of the form 


v(x, y) — (h(x, y) bo(x, y)) 


where each yj; is a polynomial in x and y whose degree is < k. Thus an element y~ € P, is 
determined by the coefficients of the various monomials x'y/ occurring in y, and y,. There are 
altogether 1+ 2+ ---+(k + 1) =3(k + 1)(K + 2) monomials of degree <k so P, can be 
thought of as a euclidean space R” where N = (k + 1)(k + 2). Two elements y/, W’ of P, are close 
together provided the coefficients of ~, are close to the corresponding coefficients of wy and 
likewise for W,, W5. 
The identity map id: R* > R?, id(x, y) = (x, y) is of course an element of P, provided k > 

(we shall always have k > 2 in fact). Let U, consist of those y € P, which are so close to ine 
identity that p(C) is still an embedded curve in R*. (Using the compactness of C (C is closed and 
bounded in R*) it actually follows that all elements of P, sufficiently near id will be in U,.) Thus, 


(x,y) =x + o,(x, y), Wo(x,y) =y + o)(x, y) 


say, where all the coefficients in ¢, and ¢, are sufficiently small. 
Let Q be a submanifold of V,. We propose to prove: 


(4.1) For some open set U of U,, containing the identity map, 1: C X U- V, (see §3) is 
transverse to Q. 


In fact, Q does not enter the argument at all, for we show that we can obtain every vector 
based at g = I'( p, W) simply from tangent vectors to C x U; tangent vectors to Q are not needed. 
(I is then called a submersion.) 

First, a technicality. What we actually do is to take points ( p,id) and show that every vector at 
I'( p,id) can be obtained from tangent vectors to C X U,. Using the compactness of C, a standard 
technical argument then shows that the same holds for (p, W) for all ~ in a sufficiently small 
neighborhood of the identity. (Alternatively a variant of the argument given shows that the same 
holds for any (p, W) with y € U,, which is better still.) The idea behind the proof of (4.1) is best 
illustrated by suppressing this part of the argument and sticking to points ( p,1d). 

By choosing our coordinates, we can in fact assume that p = (0,0) € R* and that close to p the 
curve C is given by the equation y = f(x), where f(0) = f’(0) = 0. Thus, I( p,id) = terms of 
degree < k in the Taylor series of f at 0. (To see this, let us do a rigid motion 6 of R* to bring C 
into standard position, with p moved to (0,0) and the tangent line moved to the x-axis. This will 
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not affect the result because referring our transformations W to the new coordinates leaves them of 
the same degree, and also the Taylor expansion of C at p is the same as that of 6(C) at 6( p). The 
result about I( p,id) now follows from the definition of T.) 

Let g(x) be a polynomial in x of degree > 2 and < k, and let W(x, y) = (x, y + sg(x)) 
where s © R. Then, for all small s, y, € U, and we can use the path s > (p, ,) in C x U,. Now 
V(t, f(t) = (t, f(t) + sg(t)) so Y,(C) is still in the form where we can read off the Taylor 
expansion— this is why we chose g to have degree > 2. Namely, I'( p, w,) = terms of degree < k 
in the Taylor expansion of f + sg. The corresponding vector in V, based at I'( p, id) is 


kim (terms of degree < k inf + sg) — (terms of degree < kin /f) 
s—>0 RY 
= terms of degree < king 


By choosing g, we can therefore obtain every vector at ['(p,id) from paths in C x U and 
therefore the result is proved. 0 

Now we apply (4.1) and the Sard-Thom lemma (3.2) to some examples. (For some harder 
examples, see [6].) 


(4.2) Take k = 3 and Q = ((0,0)} Cc V;. Thus, the Taylor map y: C > V, takes C to a curve 
through Q if and only if C has a higher inflexion (compare § 1). Put y in a family I as in (4.1), T 
being transverse to Q. Applying (3.2), we have that for a dense set of y € U, Ty: C > V;j is 
transverse to Q, and hence I',(C) misses Q: for dimensional reasons transversality is impossible if 
they intersect. Hence, there are arbitrarily small deformations of C, given by such wW arbitrarily 
close to the identity, which have no higher inflexions. The same result could be obtained by taking 
k = 4 and the linea, = a, = 0 as Q. 


(4.3) Take k = 4 and let O = {(a), a3, a4) © V4: a3 = a, — a3 = 0). Thus, the Taylor map y: 
C > V, has y(C) O Q nonempty if and only if C has a higher vertex. Reasoning as before, we 
find that there are arbitrarily small deformations of C which have no higher vertices. 


(4.4) As remarked in § 3, we can work with several submanifolds Q, at once. Taking k = 4, 
Q, — {(a), a3, a,): a, =~ a3 = 0} 


and 


Q,= {(a,, 43,44): a; = a4, — a3 = 0}, 


we know that the I of (4.1) will be transverse to both Q,, so applying (3.2) shows that there are 
arbitrarily small deformations of C which have neither higher inflexions nor higher vertices. 


(4.5) This is marginally more technical. Take k = 3 and let Q, be the a-axis, Q, the a,-axis. 
Then there exist y arbitrarily close to the identity for which y,: ~(C) — V, is transverse to Q, and 
Q,. We can deduce from that (once again using the compactness of C) that y,~ '(Q,) and 
Yy '(Q2) are finite sets. Hence, ¥(C) will have only a finite number of ordinary inflexions and 
ordinary vertices (a, = 0 gives an inflexion and a, = 0 a vertex). There is no upper limit to the 
number of inflexions or vertices, of course. The 4-vertex theorem ((9], p. 36) asserts that there will 
be at least 4 vertices, but there could be no inflexions, as with a convex curve. 

When we suggested in (1.1) that certain properties were “generic” for plane curves, we had in 
mind a specific meaning for “generic.” Consider some property P of embedded curves e: 
S' > R’, such as “possessing no higher inflexion” or “having a finite number of vertices.” 
Generic geometers are interested only in geometrical properties, so we shall always take a P which 
is invariant under changes of parametrization and under rigid motions in R?. 


(4.6) The property P is said to be dense (or, if you prefer, curves with property P are said to be 
dense in “the set of all curves’) if every non-P curve can be taken by an arbitrarily small 
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deformation into a P-curve. More precisely, for each curve e, we can find a family 
E: S'x U> R’ 

where U is a neighborhood of a point u, in some euclidean space (thought of as R™ for some N), 
such that 

(i) E(t, uy) = e(t) for all t © S', and 

(ii) every neighborhood of uy) in U contains a point u for which the curve given by E,,: 
S' + R? (E,(t) = E(t, u)) has property P. (In the definition, the deformation does not have to 
be effected by a transformation R* > R?, but in all our examples that is how it is done.) 

Thus, we have shown above that the properties of having no higher inflexions, no higher 
vertices (also finitely many inflexions or vertices) are dense. The space U consisted of polynomial 
transformations y, and E(t, ¥) = W(e(t)). 


(4.7) The property P is said to be open if all curves close to a P-curve in a family are also 
P-curves. More precisely, for any embedded curve C, given by e as above, having the property P, 
and any family 


E:S'x U-> R’ 


with U as before and E(t, u,) = e(t) for all t € C, the curves given by E,, for all u sufficiently 
close to ug also have property P. 

For the properties we are considering, which can be described by transversality to certain 
closed submanifolds Q of V,, openness is actually automatic, using (again) the compactness of C. 
Roughly speaking, if y, 1s transverse to Q, so 1s y,- for any e’ close to e. In fact, all the properties 
considered here are “open” in the slightly stronger sense that if C has one of these properties, then 
so does any curve given by an embedding “sufficiently close to e.” We shall not go into this more 
technical matter here. 


(4.8) A property P is called generic, or is said to hold generically, iff P is both open and dense. 
Note that if P,,..., P, are all generic, then so is the property P = (P, and P, and... and P,). 


(4.9) The following properties of plane curves are all generic: 
(i) there are no higher inflexions; 
(11) there are no higher vertices; 
(ili) there are finitely many (or no) ordinary inflexions; 
(iv) there are finitely many (or no) ordinary vertices. 0 
Notice that it is specific properties, and not specific curves, which are generic. A given curve 
may have some generic properties and some nongeneric ones (a circle has property (iii) above, but 
is highly nongeneric in its possession of vertices!). Also the list of generic properties can be 
extended indefinitely—and no curve could be so general as to possess all these properties! (Can 
you see why?) The important thing is that curves which have a specific property, known to be 
generic, constitute “most” curves in the open and dense sense explained above. 


5. Duals and Evolutes. We now introduce some ideas from singularity theory to show how it is 
possible to deduce, from the genericity result (4.9), properties of duals and evolutes of plane 
curves. We give more detail for the former. 

Let e: S' > R? give an embedded curve C = e(S'). We consider a family of functions on C 
parametrized by another space W, that is, we consider a map F: S'x W> R. 


(5.1) Height functions. Let W be the set of directions in R*, which we think of as given by unit 
vectors, i.e., W = S' so our family is of the form 


H:S'xS' or. 


Let H(t, w) = e(t): w =“height” of e(t) above the line through (0,0) perpendicular to w. See 
Fig. 15. 
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Fic. 15. Height function. 


Consider the condition 0'H/dt'(to, W) = 0 for i= 1,..., m; #0 for i= m+ 1. When this 
holds we say that the function h(t) = H(t, wo), for the fixed wo, has “type A,, at to.” 

To see what this means take wy) = (0, 1) (“vertical”) and e(t) = (t, f(t)) so that C has equation 
y = f(x). Then H(t, w ) = f(t) so the above condition just says that the tangent to C at 
(to, f(to)) is horizontal (parallel to the x-axis) and has (m + 1)-point contact with C. The same 
goes in general and we have, writing N(t,)) for the normal to e at e(t,), 


(5.2) m = 1 (type A,) @ wo parallel to N(to) but e(t)) not an inflexion of C 
m = 2 (type A,) @ and e(t,)) an ordinary inflexion of C 
> 3 (type A, 3) ° ” and e(t,)) a higher inflexion of C.O 


The set 
| 0H 
CV(H)= {(r,) ER x S!: for some t, a, (ts) = 0 and H(t, w) = r| 


has an interesting interpretation. (CV(/7) is also the set of critical values of the map S' x S'> R 
x S! defined by (t, w) ~ (A(t, w), w).) Let T(t) be the unit tangent vector to the curve C at 
e(t), so that T(t) is a nonzero multiple of the vector de/dt. 

Then (0H /dt)(t, w) = 0 @ T(t): w =0 @ w = +N(t) where N(t) is the unit normal vector 
to C at e(t), defined by turning T(t) anticlockwise through 90°. 

Oriented (directed) lines in R* are in one-to-one correspondence with points (r, w) € R x S|, 
as follows. Given a line, let v be a unit vector parallel to the line, and define w by rotating v 
clockwise through 90°. Define r = x - w where x is any point of the line (Fig. 16). With this 
interpretation, CV( #2) falls into two parts: one consists of the tangent lines to C oriented via the 
embedding e (so that the tangent line at e(t) has direction de/dt) and the other consists of 
the same lines oriented the other way (direction — de/dt). Choosing one of the two parts (say the 
first) we get the dual of C. 

The dual thus lies in R x S' which is a cylinder. But locally we can flatten the cylinder on the 
plane, i.e., use the angle coordinate 8 on S' so that a neighborhood of (7, w)) in R x S!' becomes 
a neighborhood of (79, 8)) in R X R, where wo = (cos 8, sin 9). 

We can also get a global picture of most of CV(A) in R? by projecting R xX S' > R?’, 
(r, w) > rw (this is like using polar coordinates). Points (0, w) € CV(A), which are occasioned 
by tangents to C passing through (0,0), now all go to the origin, so the projection is not an 
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D (0, 0) 


Fic. 16, Parametrizing oriented lines. 


accurate picture of CV(#) at such points. A more minor matter is that (7, w) and (—r, —w) both 
project to the same point of R*: the projection identifies the two parts of CV(H) to one curve in 
R*. It is a pleasant exercise to verify that the projection of CV(/Z) is the pedal curve of C relative 
to (0,0): the locus of the foot of the perpendicular from (0,0) to the tangent to C. Here are two 
examples where the tangent to C never passes through (0,0), so the pedal curve drawn is an 
accurate picture of the dual. See Fig. 17. 

More about CV(/7) in § 6. 


Limacon with two inflexions, and its pedal curve. Limacon with a higher inflexion, and its pedal curve. 
Fic. 17. 


(5.3) Distance-squared functions. Other families of functions are connected with other geometri- 
cal objects. For example let 


D:S'XR*->R 
be given by D(t, w) = |le(t) — w]|*: the distance-squared from w € R? to e(t) € R*. We do not 
have space for all the details here (see [3] or [7]) but just mention the following. (i) For curves 
satisfying the generic property 4.9(11) only A,, A,, A; singularities of f(t) = D(t, w)) occur, that 
is, we do not get 0'D/dt'(to, W) = 0 for i = 1,2,3,4 except when e(t,) is a higher vertex of C 
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(and wp is the center of curvature of C at e(t,)). (ii) CV(D) is the surface in R X R? = R? 
(coordinates (r, w)) whose section r = constant > 0 is the parallels to C at distance yr from C. 
(iii) The “bifurcation set” 


B(D) = {w ER’: for some t, D/dt = 07D/adt* = 0 at (t, w)} 


is the evolute of C, the locus of centers of curvature. The proofs of these facts just use the standard 
formulae for plane curves connecting tangent, normal and curvature; see for example [9]. 


6. Versal Unfoldings. The family H of height functions on a curve satisfying the generic 
property 4.9(1) is itself generic in a sense to be explained below. The same goes for the family D of 
(5.3), assuming 4.9(ii). In the language of singularity theory H or D is called a versal unfolding and 
this has interesting geometrical consequences, which we consider first. To cover both H and D 
(not to mention other examples) we write our family as F: S'x W—>R so that W = S' for 
F = H; W = R? for F = D. Consider a specific point (to, w) € S' X W and let f(t) = F(t, Wo). 
As above, we shall say f has “type A,, at t)” iff f(t.) is zero for i = 1,..., m and nonzero for 
i=m+ 1. 

For F we define CV(F) and B(F) exactly as in the examples of § 5. 


(6.1) Suppose that F satisfies the genericity condition of being a “versal unfolding of f at ty)” (see 
below). Then the sets CV(F) and B(F) are determined locally, up to a smooth change of coordinates 
inR X Wand W respectively, by the number m. OD 

By “locally” we mean that t, w and r are to be close to ty, wy and F(to, wo) respectively. This 
amounts to saying that we consider for H (resp. D) only the part of the dual curve (resp. evolute) 
corresponding to points of C near e(f9). 

The good thing about (6.1) is that we obtain an accurate picture of CV(F) or B(F), distorted 
only by smooth changes of coordinates, by considering any other versal unfolding with the same 
W and m. In practice we can, and do, choose very simple ones. It is clearly necessary to be able to 
recognize a versal unfolding when we see one; fortunately this is very easy, using the following 
standard result of singularity theory. 

Write w,,..., w, for local coordinates on W near wo. Thus for height functions, n = 1 and 
w, = angle coordinate 8, so we regard S' near wy = (cos 95, sin @)) as the same as R near 6). For 
distance-squared functions we use ordinary cartesian coordinates (w,, w,). 

Given F, we can form 0F/dw;(t, wo) fori = 1,..., n. This is a function of t only, and we can 
write down its Taylor series at ty, say 


2 m—1 
8, = a(t — to) +ay(t— to) +++ +4; m—i(t — to) 
where we omit the constant term and stop after m — 1 terms. There is the usual formula 
a; (1/j!)(0/* 1F/dt/dw;)(to, Wo). 


(6.2) F is a versal unfolding™ of f at to iff the n X (m — 1) matrix of coefficients a;; has rank 
m- 1.0 


Notes. (1) If m = 1, then the condition is vacuous and automatically satisfied. 

(2) If n > m — 1, then we would expect “almost any” F to be a versal unfolding, for matrices 
have maximal rank unless the entries are unlucky enough to satisfy certain relations. Being a 
versal unfolding is a generic property, so long as n > m — 1. 

(3) If n < m — 1, then F can never be versal. 


(6.3) Let e: S' > R? satisfy the generic property (i) (resp. (ii)) of (4.9). Then the family H of 
height functions (resp. D of distance-squared functions) is always a versal unfolding. 


Proof. We consider only the case of H and leave the reader to try the (slightly harder!) D. By 


*Our family F is a family of “potential functions,” that is, we ignore additive constants, as is appropriate for 
F = H or F = D. There is a slightly different result when constants are important. 
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(5.2) we need only consider m = | and m = 2, and by note (1) we can stick to m = 2. Write 
w=(cos@,sin@), e(t)= (X(t), Y(t)) 


so that 

H(t, 0) = X(t)cos@ + Y(t)sin@. 
Thus 

0H/00 = — X(t)sin@ + Y(t) cos @ 
and 


g, = (—X"(to)sin 8 + Y’(t)cos 9) )(t — to). 
We just need to know that the coefficient of (¢ — t)) is nonzero. Now from m = 2 we know 
0H/0dt(ty, %) = 0 (also 0*H/dt* = 0, but we don’t need to use that), so 

X'( to )cos Oy + Y’(t)sin % = 0. 
Since X’(t,)) and Y’(t,.) won’t both be zero (e’(t) is never 0 for an embedding e) this easily 
implies that the required coefficient is nonzero. 0 
Consider the family F(t, 9) = t?, for which n = 1, and for this family take t, = 0, 8, = 0 (so 

Wy = (1,0)). Then f(t) = F(t,0) has type A, (m= 1) at 0 and this is automatically a versal 
unfolding. Further 

CV(F) = {(r, @): for some t, t = 0 and t? = r} 


and close to (0,0) this is just a little interval of the @-axis: the local picture of CV(F) is a piece of 
straight line in a plane. A smooth change of coordinates in the plane will take this to a piece of 
smooth curve (Fig. 18). 


CV(H) 


CV(F) 


Fic. 18. Critical value set, type A, —smooth. 


For the family H of height functions on a curve C, the condition m = | means that C does not 
have an inflexion at the relevant point e(f,) (see (5.2)). Thus the part of C close to e(t,) has dual 
which is a piece of smooth curve, by (6.1). 

More interesting is the case m = 2. For this consider F(t, 0) = t? + 6t, where again we take 
ty = 9 = 0 for this F. Certainly f(t) = F(t, 0) = ¢? has type A, (m = 2) at 0; also 0F/00 = t, so 
the matrix of (6.2) is the 1 X 1 matrix (1). Hence F is versal. Further 


CV(F) = {((r, 8): 3t? + 6 =0, ¢° + 6¢ =r for some t} 


which comes to {(r, 8): 27r? + 46° = 0). Any smooth change of coordinates takes this to what is 
called an ordinary cusp in the plane (Fig. 19). Since, for height functions on C, m = 2 means that 
the curve C has an ordinary inflexion at the relevant point e(t,) (see (5.2)) we have: 
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CV(H) 


CV(F) 


Fic. 19. Critical value set, type 4,—cusp. 


(6.4) For a plane curve C satisfying the generic property 4.9(1) the dual (or the pedal away from 
(0,0)) is a smooth curve except for the presence of ordinary cusps corresponding with ordinary 
inflexions of C.0 


Compare Fig. 17, noting that there may be self-crossings on the dual, corresponding to double 
tangents to C. These are not part of the Jocal structure: each branch at the crossing is smooth. 

A similar analysis using the distance-squared functions D and the bifurcation set B(.D) shows 
the following. 


(6.5) For a plane curve C satisfying the generic property 4.9(11) the evolute is a smooth curve 
except for the presence of ordinary cusps corresponding with ordinary vertices of C.0 


Adaptations of the methods above show that the property 4.9(i) together with the absence of 
tritangents is generic (but not the latter by itself—can you see why?). This implies that duals 
generically have no triple self-crossings. Double self-crossings, as in Fig. 17, are stable under small 
perturbations of C, but triple crossings split up into three double crossings. 

Other families of functions are connected with the study of envelopes, and one can study 
singular points (e.g., cusps) of these envelopes by analogous methods. We give a good deal of 
detail on this in [7]. 
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The following results of the forty-third William Lowell Putnam Mathematical Competition, 
held on December 4, 1982, have been determined in accordance with the governing regulations. 
This annual contest is supported by the William Lowell Putnam Prize Fund for the Promotion of 
Scholarship, left by Mrs. Putnam in memory of her husband, and is held under the auspices of the 
Mathematical Association of America. 

The first prize, five thousand dollars, was awarded to the Department of Mathematics of 
Harvard University, Cambridge, Massachusetts. The members of its winning team were: Benji N. 
Fisher, Michael J. Larsen, and Michael Raship; each was awarded a prize of two hundred fifty 
dollars. 

The second prize, two thousand five hundred dollars, was awarded to the Department of 
Mathematics of the University of Waterloo, Waterloo, Ontario, Canada. The members of its team 
were: David W. Ash, W. Ross Brown, and Herbert J. Fichtner; each was awarded a prize of two 
hundred dollars. 

The third prize, one thousand five hundred dollars, was awarded to the Department of 
Mathematics of the California Institute of Technology, Pasadena, California. The members of its 
team were: Bradley W. Brock, Scott R. Johnson, and Zinovy B. Reichstein; each was awarded a 
prize of one hundred fifty dollars. 

The fourth prize, one thousand dollars, was awarded to the Department of Mathematics of 
Yale University, New Haven, Connecticut. The members of its team were Alan S. Edelman, Paul 
N. Feldman, and Nathaniel E. Glasser; each was awarded a prize of one hundred dollars. 

The fifth prize, five hundred dollars, was awarded to the Department of Mathematics of 
Princeton University, Princeton, New Jersey. The members of its team were Gregg N. Patruno, 
David P. Roberts, and Daniel J. Scales; each was awarded a prize of fifty dollars. 

The five highest-ranking individual contestants, in alphabetical order, were David W. Ash, 
University of Waterloo; Eric D. Carlson, Michigan State University; Noam D. Elkies, Columbia 
University; Brian R. Hunt, University of Maryland, College Park; and Edward A. Shpiz, 
Washington University, St. Louis. Each of these students was designated a Putnam Fellow by the 
Mathematical Association of America and awarded a prize of five hundred dollars by the Putnam 
Prize Fund. 

The next six highest-ranking individuals, in alphabetical order, were Joel Friedman, Harvard 
University; Irwin L. Jungreis, Cornell University; Michael J. Larsen, Harvard University; Gregg 
N. Patruno, Princeton University; Robin A. Pemantle, University of California, Berkeley; and 
Mark G. Pleszkoch, University of Virginia. Each of these students was awarded a prize of two 
hundred fifty dollars. 

The following teams, named in alphabetical order, received honorable mention: Brown Univer- 
sity, with team members Stephen A. DiPippo, Edward F. Grove, and Enk R. Paulsen; University 
of California, Berkeley, with team members John W. Jones, Robin A. Pemantle, and Richard W. 
Webb; Michigan State University, with team members Eric D. Carlson, Erin J. Schram, and Frank 
J. Sottile; University of Toronto, with team members John J. Chew, John J. Im, Alastair M. 
Rucklidge; and Washington University, St. Louis, with team members Bard Bloom, Edward A. 
Shpiz, and Richard A. Stong. 
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Honorable mention was achieved by the following thirty-three individuals, named in alphabeti- 
cal order: Gary M. Bernstein, Princeton University; Bradley W. Brock, California Institute of 
Technology; W. Ross Brown, University of Waterloo; Bev I. Cope, University of Waterloo; 
Charles J. Cuny, California Institute of Technology; Michael J. Dawson, Massachusetts Institute 
of Technology; David L. Desjardins, Massachusetts Institute of Technology; Stephen A. DiPippo, 
Brown University; Paul N. Feldman, Yale University; Michael V. Finn, Harvard University; 
Zachary M. Franco, Harvard University; Tom Ilmanen, Haverford College; John J. Im, Univer- 
sity of Toronto; Scott R. Johnson, California Institute of Technology; Kin Y. Li, University of 
Washington; F. Miller Maley, Amherst College; Yair N. Minsky, Princeton University; Evan W. 
Morton, Harvard University; Alan G. Murray, California Institute of Technology; Laurence E. 
Penn, Harvard University; Jeremy D. Primer, Princeton University; Michael Raship, Harvard 
University; David P. Roberts, Princeton University; James R. Roche, University of Notre Dame; 
Richard A. Shapiro, Massachusetts Institute of Technology; Carlos T. Simpson, Harvard Univer- 
sity; Arthur P. Smith, Memorial University of Newfoundland; Thomas R. Stevenson, University 
of British Columbia; Richard A. Stong, Washington University, St. Louis; John M. Sullivan, 
Harvard University; Jerome V. Walsh, University of Illinois at Urbana-Champaign; Dwight S. 
Wilson, Johns Hopkins University; and David Wolland, Harvard University. 

The other individuals who achieved ranks among the top 100, in alphabetical order of their 
schools, were: University of Alberta, Arthur B. Baragar, Robert P. Morewood; University of 
Arizona, Jan M. O. Soderkvist; Biola University, Mark M. Shimozono; University of British 
Columbia, Edmond D. Chow; Brown University, Erik R. Paulsen; California Institute of Technol- 
ogy, Pang-Chieh Chen, R. Sekhar Chivukula, James T. Lin, Vladimir S. Matijasevic, Zinovy B. 
Reichstein; University of California at Santa Barbara, John R. Kelly; Carleton University, Mike 
D. Dixon; Case Western Reserve University, Kevin E. Kelso; University of Chicago, Keith A. 
Ramsay, Yongbum Park, Michael P. Spertus, David S. Yuen; University of Colorado, Boulder, 
Boris Lerner; Harvard University, Frederick R. Adler, Bruce W. K. Brandt, Benji N. Fisher, 
Howard M. Pollack, Gregory B. Sorkin; Haverford College, Samuel R. Evens; Jowa State 
University, William R. Somsky; Université Laval, Line Baribeau; University of Maryland, College 
Park, Dougin A. Walker; Massachusetts Institute of Technology, David E. Brahm, Andrew E. 
Gelman, Daniel S. Lewart, Tomasz S. Mrowka; Michigan State University, Frank J. Sottile; 
University of Michigan, Ann Arbor, David A. Short; University of Michigan, Dearborn, Gregory 
T. Parker; University of New Brunswick, Christian Friesen; North Carolina State University, 
Samuel P. White; Pomona College, Alan M. Nadel; Princeton University, Kazuko Suzuki, Burt J. 
Totaro, Kevin M. Walker; Queen’s College of the City University of New York, Boris Aronov; 
Reed College, Paul S. Hsieh; Rensselaer Polytechnic Institute, Benjamin J. Patz, William J. 
Harte; Rose-Hulman Institute of Technology, Randy L. Ekl; Rutgers University, Scott E. 
Axelrod; Simon Fraser University, David G. Wagner; Sioux Falls College, William H. Paulsen; 
Stanford University, Cris G. Poor, University of Toronto, John J. Chew, Peter A. Miegom; 
Washington University, St. Louis, Patrick J. Abegg, Paul H. Burchard; University of Waterloo, 
Herbert §. Fichtner, Charles R. Smith, Charles S. A. Timar; University of Wisconsin, Jon G. 
Udell; Yale University, Alan S. Edelman, Nathaniel E. Glasser. 

There were 2024 individual contestants from 348 colleges and universities in Canada and the 
United States in the competition of December 4, 1982. Teams were entered by 249 institutions. 

The Questions Committee for the forty-third competition consisted of William J. Firey 
(Chairman), Douglas A. Hensley, and Melvin Hochster; they composed the problems listed below 
and were most prominent among those suggesting solutions. 


PROBLEMS 
Problem A-1 


Let V be the region in the cartesian plane consisting of all points (x, y) satisfying the simultaneous conditions 


Ix]}<y<|x|+3 and y<4. 


548 L. F. KLOSINSKI, G. L. ALEXANDERSON, AND A. P. HILLMAN [October 


Find the centroid (xX, y) of V. 
Problem A-2 
For positive real x, let 
B,(x) = + 2% 4+ 3% 4+ +--+ 7%, 
Prove or disprove the convergence of 
5B, (log,,2) 


Problem A-3 
Evaluate 
ie Arctan( 7x) — Arctan x dx 
0 Xx 
Problem A-4 


Assume that the system of simultaneous differential equations 


Yo 9B ro 13 
you-z2, z2FYy 


with the initial conditions y(0) = 1, z(0) = 0 has a unique solution y = f(x), z = g(x) defined for all real x. Prove 
that there exists a positive constant L such that for all real x, 


f(x+ L)=f(x), g(x+ L)= (x). 


Problem A-5 


Let a, b, c, and d be positive integers and 


Given that a + c < 1982 and r > 0, prove that 


Problem A-6 


Let o be a bijection of the positive integers, that is, a one-to-one function from {1,2,3,...} onto itself. Let 
X1) X7, X3,... be a sequence of real numbers with the following three properties: 


(1) |x,,| is a strictly decreasing function of n; 
(ii) |o(n) — n|-|x,,| ~ 0 as n > 00; 


Prove or disprove that these conditions imply that 


noo 


n 
lim )) XG = 1. 
k=1 


Problem B-1 


Let M be the midpoint of side BC of a general AABC. Using the smallest possible n, describe a method for cutting 
A AMB into n triangles which can be reassembled to form a triangle congruent to AAMC. 


Problem B-2 


Let A(x, y) denote the number of points (m,n) in the plane with integer coordinates m and n satisfying 
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m?+n?<x?4+y?, Letg = Le, ek Express 
ioe) ioe) 2 2 
/ / A(x,y)e *~ *% dx dy 
— 00% — 00 
as a polynomial in g. 


Problem B-3 


Let p,, be the probability that c + dis a perfect square when the integers c and d are selected independently at 
random from the set {1,2,3,..., 2}. Show that lim, _, ,¢ p,Vvn ) exists and express this limit in the form r(Vs — 1), 
where s and ¢ are integers and r is a rational number. 


Problem B-4 
Let n,,,..., n, be distinct integers such that 
(ny + k)(ng + k)--+ (ne +k) 
is an integral multiple of n\n, --- n, for every integer k. For each of the following assertions, give a proof or a 
counterexample: 
(a) |n,| = 1 for some /. 


(b) If further all n, are positive, then 
{n,,N7,..., Ns) = {1,2,..., 5}. 
Problem B-5 


For each x > e® define a sequence S, = ug, Uy, Uz,.-.. recursively as follows: ug = e, while for n > 0, u,,,, is the 
logarithm of x to the base uw,,. Prove that S, converges to a number g(x) and that the function g defined in this way 
is continuous for x > e®. 


Problem B-6 


Let K(x, y, z) denote the area of a triangle whose sides have lengths x, y, and z. For any two triangles with sides 
a, b, c and a’, b’, c’, respectively, prove that 


VK(a,b,c) + ~K(a’,b’,c’) < YK(ata’,b+b’,c+c’) 


and determine the cases of equality. 


SOLUTIONS 


In the 12-tuples (719, N9,..., 49, N_,) following each problem number below, n, for 10 > i > 0 
is the number of students among the top 201 contestants achieving i points for the problem and 
n_, is the number of those not submitting solutions. 


A-1. (146, 9, 2, 5,0, 0,0, 5, 2, 9, 22, 1) 


Let T consist of the points inside or on the triangle with vertices at (0, 3),(— 1,4), (1,4) and let U 
be the set of points inside or on the triangle with vertices at (0,0), (—4, 4), (4,4). Then 7 and V 
overlap only on boundary points and their union is U. The centroids of T and U are (0, 11/3) and 
(0, 8/3), respectively. The areas of T, V, and U are 1, 15, and 16, respectively. Using weighted 
averages with the areas as weights, one has 


1-0+ 15x = 16-0, 1-44 15y = 16-8. 
It follows that x = 0, y = 13/5. 
A-2. (68,17, 18, 7,0, 3, 1,0, 10,2, 37, 38) 
Since x = log, 2 > 0, B(x) = 1% + 2% + +++ +n* <n-n* and 
B, (log,,2 » Bn? 
— Pullobn2) — nnn? _ 2 


O< 2~* 2 DT 
(nlog,n) (nlog,n) n(log,n) 
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As ©%_,[2/n(log,n)7] converges by the integral test, the given series converges by the compari- 
son test. 


A-3. (14,19, 11,5,0,0,0,2, 5, 0, 37, 108) 


i- Arctan( ax) — Arctan x x 
0 


Xx 


u 
u=l 


00 7 l 7 70 l 
LL Ga tS 


= "1 . = du = ind. 


1] Uu 2 2 


oo | 
dx = i 5 Aretan( ux ) 


A-4. (0,2,4,5,0,0,0, 11,9, 22, 33, 115) 
The differential equations imply that 
yy’ + 232! =2z'y’ — y'z' =0 
and hence that y* + z* is constant. This and the initial conditions give y* + z* = 1. Thinking of x 


as a time variable and (y, z) as the coordinates of a point in a plane, this point moves on the 
curve y* + z* = 1 with speed 


2 2] 1/2 1/2 
LY + 2]? = (26 + v9)? 
At any time, either y* or z* is at least 4 and so the speed is at least ({(4)'/*]°}'”7. Hence the point 


will go completely around the finite curve in some time L. As the speed depends only on y and z 
(and not on x), the motion is periodic with period L. 


A-5. (22,7,0, 1, 0,0, 0,0, 5, 4, 48, 114) 
We are given that | 
a ce bd-—ad-— be S 
b oa bd 


Thus bd — ad — bc is a positive integer and so r > 1/bd. We may assume without loss of 
generality that b < d.Ifb < d < 1983,r > 1983-7 > 1983 °. Sincea + c < 1982, if 1983 <b< 
d, one has 


0. 


ee ee 
1983 1983 ~~ 1983. 1983 = 19833 


The remaining case is that with b < 1983 < d. Then the d that minimizes r for fixed a, b, c is 
1 + [bc/(b — a)], where [x] is the greatest integer in x. This d is at most 19835 since b-—a> 1 
and c < 1982 and thus 


r2 


l l l 
r>—_— > — > —.. 
bd ~ 198362 =1983° 


Hence we have the desired inequality in all cases. 
A-6. (0,0,0,0, 4, 1,2, 0, 5, 4,50, 135) 


We disprove the assertion. Let y, = 1/(n + 1)In(n + 1). Then © ,(—1)"*'y, converges to some 


g> Osincey, > 0asn > o andy, >y, > ---. Letx, =(-1)"*'y,/g. Then conditions (i) and 
(iii) are satisfied. Let ay, a,,... be positive integers to be made more definite later. Let b) = 0 and 
b,,, = b, + 4a, for i = 0,1,.... The bijection o is defined as follows: 


o(n)=2n—1-—b5, for b<n<b,+2a,, 


o(n)=2n-—5,,, for b, + 2a,;<n< bj,). 
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Then 0 < o(n) < 2n and hence |o(n) — n| < n. Thus 
n 
—_ e < ae ee ed 
lo(n) — ml eal < SO ayin(n + 1) 


which implies condition (ii). Let C(n) = L7_ x; and D(n) = L7_ 1X ,,;). Then 
1 a, 1 a, 
(A) D(b; + 2a;) — C(b; + 2a;) = Z » Yo,+25 + Z » Vb,+2a,+2k—-1° 
j=l k=1 


Since y, + yy +¥ ++: diverges to +00 by the integral test, the a; can be chosen large enough 
so that the first sum in (A) exceeds | for each i. Then D(n) > 1 + C(n) for an unbounded 
sequence of n’s. Hence D(n) and C(n) cannot converge to the same limit. 

B-1. (56, 46, 23,0, 0,0, 0, 0, 18, 5,5, 48) 


The smallest n is 2. Let D be the midpoint of side AB. Cut AAMB along DM. Then ABMD can be 
placed alongside AADM, with side BD atop side AD, so as to form a triangle congruent to 
AAMC. Since AAMB need not be congruent to AAMC in a general AABC, there is no method 
with n = 1. 


B-2. (5, 28, 16, 3, 2, 0, 0, 0, 4, 8, 32, 103) 


Letr=yx*+y*,R(m,n)=(x,y)i:mt+ne< x? + y), and 
r=fo fo A(x, ye? ded 
= x,y)je ~ *» dx dy. 

tw y y 


Let & and &’ denote sums over all integers m and over all integers n, respectively. Then 


I= xry'yff e-¥" dy dy 


R(m, n) 


TEE Sion rr dr dO 


= yy’ rl —te-" | m+n? 40 
=LDL' [er a6 
=) Yi 'ae™ = n(en™ \(r’e-” | = n(2g - 1)’. 


B-3. (82, 18, 5,0,0,0, 0,3, 8, 6, 40, 39) 
Let a(n) =[¥yn + 1] and b(n) =[y2n], where [x] is the greatest integer in x. For ¢ in 
{1,2,...,a(n)}, there are t? — 1 ordered pairs (c,d) with c and d in X = {1,2,...,} and 
c+d=t?. For tin {1 + a(n),2 + a(n),..., b(n)), there are 2n + 1 — t? ordered pairs (c, d) 
with c and din X and c + d = t”. Hence the total number F(n) of favorable (c, d) is 
a(n) b(n) 


F(n)= ¥ (t?-1)+ YO (n4+1-127) 


t=1+a(n) 
a(n) b(n) 
_ 2 yi °| — | L t?| — a(n) +[b(n) — a(n) ](2n + 1) 


_ 2a(n)[l + a(n) [1 + 2a(n)] _ b(o)L1 + b(m) IL + 26(n)] 
6 6 


—2(n + l)a(n) + (2n 4+ 1)d(n). 
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Since p, = F(n)/n’, 
lim (p,vn) = lim F(n)/n?? 
3 
- 22 tim (4 ~2 tim (202) — 2 lim a(n A) 1 y tim b(n) 
6 no vn 6 n— oo vn n— oo vn noo vn 
=2 12) -24 22 = 42-1), 
B-4.  (2,7,5, 1, 13, 3,3, 1,0, 2, 36, 128) 
Let P, =(n, + k)(n, + k)---(n, + k). We are given that P,|P, for all integers k. 


(a) Po|P_ , and P,|P, together imply Py\(P_,P,) or (njnz +++ nsyi(nt — (np - I+ 
(ns; — 1)}. 
No n, can be zero since P, # 0 for k sufficiently large. Thus, for each i, n? > 1 and n? > n? — , 
> 0. Hence P? > P_,P, > 0. This and Pj|(P_,P,) imply P_,P, = 0. Then for some i, |n;| = 


(b) P, is a polynomial in k of degree s. Since Py divides each P,;, Py also divides the nth 
difference 


s 
y(-)'(F)B= 
i=0 
Since P) > 0, this means that P) < s!. As Po is a product of s distinct positive integers, it 
follows that 
{n,,M,...,n,) = {1,2,..., 8). 
B-5. (2,3, 1,2, 3, 1,0, 4, 9, 23, 44, 109) 
Since the derivative of x'/* is negative for x > e, 
(1) a® > b*whene <a<b. 
The w’s are defined so that uy = e and 
(2) x = (uy)" = (u,)"? = (ua)? = a 
Hence 
(3) Uns = (u,Inu,_,)/in u,. 
As x > e°, u. =Inx>e=Uu,. Now u, > uy implies Inu, > Inu, and then (3) with n = 1 
implies u, < u,. Also, (2) and (1) imply (u,)“2 = (uy) > (u,)”, which gives us u, > uy. Now 


u, <u, and (3) with n = 2 imply u, > u,. Also (2) and (1) imply (u,)" = (u,)" < (uy)! and 
hence u,; < u,. Similarly, vu, < u, < u;. Then an easy induction shows that 


C< Ua, < Uganda < Vong] < Uy,-) forn = 1,2,.... 


Thus the monotonic bounded sequences uo, uz, u4,... and u,, U3, us5,... have limits a and }, 
respectively, with e < a < b. Also 


a’ = lim (u,,)°"*! = x = lim (u,,_,)°7" = b%. 
n— oo n— oo 
Then a° = b*, e<a< b, and (1) imply a = b. Hence lim, _,,,u, exists and is the unique real 


number g = g(x) with g > e and g® = x. Since f(y) = y” is continuous and strictly increasing for 
y > e, its inverse function g(x) is also continuous. 


B-6. (10,3, 7,0, 0,0, 0, 1, 8, 10, 29, 133) 


Let s=(a+b+c)/2, t=s—a, u=s-—b, v=s-—c and similarly for the primed letters. 
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Using Heron’s Formula, the inequality to be proved will follow from 


(A) Vstuo + Vs't'u'e’ < Ms + s')\(t+ t')(ut u’)(v + v’) 


for positive s, t, u, v, s’, t’, uw’, v’. A simpler analogous inequality that might be helpful is 


(B) yxy + Yx'y’ <y(x+x')(yt+y’) for x, y, x’, y’ positive. 


First we note that (B) follows from the Cauchy Inequality applied to the vectors (Vx , Vx’) and (Jy ; 
Vy’ ) [and also follows from (Vxy’ — yx'y )* > 0 or from the Inequality on the Means applied to 
xy’ and x’y]. Using (B) with x = vst, x’ = vs’t’, y = Vuv, y’ = Yu'v’ and reapplying (B) to the 
new right side, one has 


Vstun + Ys't'u'v’ < (vst + vst’ )(Vuv + Vu'v’ ) 
<yy¥(st+s')(t+ 0’) (ut u’)(o4+ v’) . 


Since here the rightmost part equals the right side of (A), we have proved (A). 

Equality holds in (B) if and only if Vx :Vx’ = Vy :yy’ and this holds if and only if 
x: x’ =y: y’. Hence equality occurs in (A) if and only if s:t:u:o0=5s': t': u’: v’. It follows 
that equality occurs in the original inequality if and only if a, b, c are proportional to a’, b’, c’. 


PROGRESS REPORTS 
EDITED BY THOMAS BANCHOFF AND RICHARD MILLMAN 


It is easy to be too busy to pay attention to what anyone else is doing, but not good. All of us should know, and 
want to know, what has been discovered since our formal education ended, but new words, and relations between 
them, are growing too fast to keep up. It is possible for a person to learn of the title of a recent work and of the key 
words used in it and still not have the faintest idea of what the subject is. 

Progress Reports is to be an almost periodic column intended to increase everyone’s mathematical information 
about what others have been up to. Each column will report one step forward in the mathematics of our time. The 
purpose is to inform, more than to instruct: what is the name of the subject, what are some of the words it uses, what 
is a typical question, what is the answer, who found it. The emphasis will be on concrete questions and answers 
(theorems), and not on general contexts and techniques (theories). References will be kept minimal: usually they will 
include only one of the earliest papers in which the answer appears and a more recent exposition of the discovery, 
whenever one is easily available. 

Everyone is invited to nominate subjects to be reported on and authors to prepare the reports. The ground rules 
are that the principal theorem should be old enough to have been published in the usual sense of that word (and not 
just circulated by word of mouth or in preprints); it should be of interest to more than just a few specialists; and it 
should be new enough to have an effect on the mathematical life of the present and near future. In practice most 
reports will probably be on progress achieved somewhere between 5 and 15 years ago. 


MANIFOLDS WITH THE SAME SPECTRUM 


RICHARD S, MILLMAN 
Department of Mathematical & Computer Sciences, Michigan Technological University, 
Houghton, MI 49931 


Many extremely important results in differential geometry concern the effect of the geometry 
of an object upon its topology. In addition, over the past forty years there has been a great deal of 
interest in the relationship between analytic quantities and the geometry and topology of 
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Telegraphic Reviews 


Edited by Lynn Arthur Steen, with the assistance of the Mathematics Departments 
of Carleton, Macalester, and St. Olaf Colleges. Books submitted for review 
should be sent to Book Review Editor, American Mathematical Monthly, St. Olaf 
College, Northfield, Minnesota 55057. 


Telegraphic reviews are designed to give prompt notice of all new books in the mathematical 
sciences. Certain of these books will be selected for more extensive review in the Reviews section 


of the Monthly. 
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T: Textbook 13-18: Grade Level 

S: Supplementary Reading 1-4 : Time in Semesters 
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General, P, L. A Handbook of Game Design. Henry Ellington, Eric Addinall, Fred Percival. Nichols 
Pub, 1982, 156 pp, $25. [ISBN: 0-89397-134-0] A systematic design strategy for developing card, 
board, computer and other types of games. Most of the examples discussed either are educational or 
mimic real life situations (war, business, politics, etc.), with no discussion of mathematical game 
theory, winning strategies, etc. Bibliography, glossary, extended case study, tips on holding com- 
petitions. GHM 


Precalculus, T(13: 1). Algebra and Trigonometry, Third Edition. Margaret L. Lial, Charles D. 
Miller. Scott Foresman, 1983, 632 pp, $23.95. [ISBN: 0-673-15794-6] Standard topics in precalculus. 
Changes from Second Edition include more thorough review of algebra, and new sections on complex 
numbers and polar coordinates. (TR, First Edition, October 1978; Second Edition, December 1980.) 
JRG 


Precalculus, T(13: 1). Essentials of Trigonometry. Karl J. Smith. Brooks/Cole Pub, 1983, xi + 292 
pp, $19.95. [ISBN: 0-534-01224-8] In response to the "back-to-basics" movement the development here 
begins with right triangles but recognizes modern technology with extensive use of hand-held calcu- 
lators. Standard topics, routine treatment; includes complex numbers, polar coordinates, loga- 
rithms, tables. JS 


Education, $(15-16). Courseware in the Classroom: Selecting, Organizing, and Using Educational 
Software. Ann Lathrop, Bobby Goodson. Addison-Wesley, 1983, viii + 187 pp, $10 (P). [ISBN: 0- 
20007-4] Practical tips for the neophyte. Includes ideas on evaluating, sources of software and 
software reviews, and a courseware directory (to be updated annually). Illustrated with photos of 
actual programs on-screen. Examples are primarily elementary level in all subject areas. Good sup- 
plement for a course on uSing computers to teach mathematics. MW 


Education, P, L*. The Agenda in Action: 1983 Yearbook. Gwen Shufelt, James R. Smart. NCTM, 1983, 
x + 245 pp, $14.50. [ISBN: 0-87353-201-5] 27 papers dealing with specific strategies for implement- 
ing eight of the recommendations in NCTM’s 1980 pamphlet Agenda for Action (TR, December 1980). 
Topics range from problem solving to statistics, from computers to basic skills. A rich source of 
the best current thinking on mathematics education. LAS 


History, P. Mathematical Perspectives: Essays on Mathematics and Its Historical Development. Ed: 
Joseph W. Dauben. Academic Pr, 1981, xv + 272 pp, $34. [ISBN: 0-12-204050-3] 13 scholarly essays in 
German (7), English (5) and French (1) on the history of mathematics, mostly on topics or people of 
the 17th, 18th and 19th centuries: Gauss, Kummer, Crelle, analytic functions, isoperimetric figures. 
A supplement to Historia Mathematica dedicated to Kurt-Reinhard Biermann on this 60th birthday. 


LAS 


History, P, L*. P.R. Halmos: Selecta Expository Writing. Ed: Donald E. Sarason, Leonard Gillman. 
Springer-Verlag, 1983, xix + 304 pp, $19.80. [ISBN: 0-387~-90756-4] Second volume of Halmos” 
Selecta, this one contains 27 expository gems ranging from Monthly-level articles like "What does 
the spectral theory say?" to popular pieces ("Mathematics as a creative art" and "The thrills of 
abstraction"). LAS 


History, T*(14-16: 1), S*, L*. An Introduction to the History of Mathematics, Fifth Edition. Howard 
Eves. Saunders Coll Pub, 1983, xviii + 593 pp, $34.95. [ISBN: 0-03-062064-3] Additions include 
sections on the metric system, the "new math" and on Bourbaki. Expanded treatment of more recent 
developments in mathematics. More anecdotes, human interest stories and biographical data on later 
mathematicians. More figures and illustrations. Expanded Problem Studies. Lists of essay topics 
for student papers have been added. Convenient mathematical periods chart on inside front cover. 
Extended chronological table in text. Numerous particular and general references. (TR, Third Edi- 
tion, October 1969; Fourth Edition, August-September 1976.) JK 
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Foundations, T(15-16: 1), 8, L. The Logical Foundations of Mathematics. William S. Watcher. Per- 
gamon Pr, 1982, x + 320 pp, $38. LISBN: 0-08-025800-X] Text covering major nonconstructive founda- 
tional systems of the last century including Frege’s system, type theory, ZF set theory, Hilbert’s 
program, GYdel’s incompleteness theorem, systems of Quine, and categorical algebra. Presents basic 
ideas of each system plus criticism and comparison of systems. Expanded, revised, updated version 
of author’s 1968 book (TR, January 1970; Extended Review, March 1974). KS 


Foundations, S. Der Zahlbegriff. Benno Artmann. Vandenhoeck & Ruprecht, 1983, viii + 265 pp, DM 
34 (P). [ISBN: 3-525~40544-8] A modern and comprehensive treatment of the number concept. Treats 
real and complex numbers, quaternions, infinite cardinals, nonstandard numbers. Ends with 
Pontrjagin’s topological characterization of R, C and quaternions. JD-B 


Combinatorics, P. Algebraic and Geometric Combinatorics. Ed: Eric Mendelsohn. Math. Stud., No. 
65. Elsevier Sci Pub, 1982, xiii + 378 pp, $88.50 (P). [ISBN: 0-444-86365-6] Collection of papers 
dedicated to Nathan Mendelsohn on his 65th birthday. Note price. JRG 


Number Theory, T(15), S$, L. Introduction to the Theory of Numbers. Harold N. Shapiro. Wiley, 
1983, xii + 459 pp, $39.95. [ISBN: 0-471-86737-3] An interesting text appropriate for seniors and 
beginning graduate students. Topics include divisibility, unique factorization, arithmetic funce- 
tions, congruences, quadratic residues, combinatorial considerations, and the prime number theorem. 
Contains many exercises, references, and bibliographical notes. SG 


Linear Algebra, T(14: 1), S, L. Linear Algebra with Applications to Differential Equations. P.G. 
Kumpel, J.A. Thorpe. Saunders Coll Pub, 1983, xii + 353 pp, $23.95. [ISBN: 0-03-060556-3] Unusual 
introductory text which uses linear differential and algebraic equations to motivate and illustrate 
topics. Greater emphasis on function spaces than normal. Optional final chapter covers power 
series solutions, Laplace transform. Good range of computational and theoretical exercises. KS 


Linear Algebra, T(14-15: 1), L. Linear Algebra with Applications, Second Edition. Jeanne Agnew, 
Robert C. Knapp. Brooks/Cole Pub, 1983, xiii + 361 pp, $22.95. [ISBN: 0-534-01364-3] Written 
(apparently successfully) to be accessible to sophomores with limited backgrounds, the text begins 
with matrix algebra, and works through linear systems, determinants, and eigenvalues before develop- 
ing the formalism of linear spaces, transformations, and quadratic forms. Each chapter has at least 
one significant application included in the text and the last chapter is devoted to linear program- 
ming. Several computer programs, in Basic, are included. (TR, First Edition, January 1979.) JS 


Linear Algebra, T(13-14: 1), S. Linear Algebra with Computer Applications. Ronald I. Rothenberg. 
Wiley, 1983, x + 387 pp, $12.95 (P). [ISBN: 0-471-09652-0] Self-instructional format. Covers stan- 
dard linear algebra topics, plus applications (Markov processes, population growth) and use of com- 
puters and programmable calculators to do matrix calculations. Does not assume calculus. JRG 


Finite Mathematics, T(13-14: 1, 2). Finite Mathematics with Applications to Business, Life Sci- 
ences, and Social Sciences. Stanley I. Grossman. Wadsworth Pub, 1983, xiv + 512 pp. [ISBN: 0-534- 
01246-9] Topics include introductions to linear equations and matrices, linear programming, proba- 
bility, statistics, game theory, and business topics. Tells students the what and the why (not just 
the how) in an engaging style. Wide variety of applications. A promising text. GHM 


Calculus, T(13-14: 1, 2). Elements of Calculus. G. Don Allen, Charles Chui, Bill Perry. 
Brooke/Cole Pub, 1983, xiii + 460 pp, $24.95. [ISBN: 0-534-01188-8] Applied calculus for business, 
social and biological science students. Single and multivariable calculus, some differential equa- 
tions and optimization. Contains many problems, index of applications. RM 


Calculus, T*(13-14: 1, 2), S. Mathematics for the Biosciences. Michael R. Cullen. Prindle, Weber 
& Schmidt, 1983, xvii + 751 pp. [ISBN: 0-87150-352-2] Emphasis on calculus and its applications. 
Coverage includes vectors and matrices, discrete probability, differential and difference equations. 
For freshmen and sophomores with high school algebra background. Intuitive rather than rigorous 
treatment. Wealth of prerequisite-free applications drawn from journals and more advanced texts. 
Adaptable to a variety of quarter, semester, and two-semester courses. Nicely done, especially the 
applications in the biosciences. JK 


Calculus, T(13: 2). Differential and Integral Calculus. Ya. S. Bugrov, S.M. Nikolsky. Transl: 
Leonid Levant. MIR Pub, 1982, 464 pp. Brief theoretical treatment, little attention paid to motiva- 
tion. Essentially no problems or applications. JRG 


Complex Analysis, S(18), P. Aspects of Contemporary Complex Amalysis. Ed: D.A. Brannan, J.G. 
Clunie. Academic Pr, 1980, xiii + 572 pp, $108. [ISBN: 0-12-125950-1] Proceedings of a 1979 NATO 
Advanced Study Institute at Durham, England. In three parts: notes for 15 short courses, 19 lec- 
tures, progress reports on old and new problems. Topics, nearly all one-variable, include BMO, 
quasiconformal mappings, interpolation theory, approximation. PZ 


Differential Equations, T*(14-15: 1, 2), L. Differential Equations and Their Applications: An 
Introduction to Applied Mathematics, Third Edition. M. Braun. Appl. Math. Sci., V. 15. Springer- 
Verlag, 1983, 546 pp, $24. [ISBN: 0-387-90806-4] Two significant changes have been made in this edi- 
tion: the section dealing with singular solutions of differential equations has been revised and 
expanded, and a new section on bifurcation theory has been added. (First Edition, TR, December 
1975, Extended Review, October 1977; Second Edition, TR, June-July 1978.) AO 
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Differential Equations, P¥*. Asymptotic Methods in Nonlinear Wave Theory. Alan Jeffrey, Takuji 
Kawahara. Pitman Pub, 1982, x + 256 pp, $39.95. [ISBN: 0-273-08509-3] Identifies concepts which 
form the basis of perturbation techniques currently in use to obtain an asymptotic description of 
nonlinear dispersive waves. Emphasis on formal development rather than theory. Unifying theme is 
application of various techniques to a single equation--the Boussinesq equation, a simple but typi- 
cal and nontrivial nonlinear dispersive equation. Very extensive bibliography. Of special interest 
to researchers in applied mathematics, physics and geophysics. JK 


Numerical Analysis, P. Lecture Notes in Mathematics-973: Matrix Pencils. Ed: B. KagstrUm, A. 
Ruhe. Springer-Verlag, 1983, xi + 293 pp, $17 (P). [ISBN: 0-387-11983-3] A selection of contribu- 
tions to a March 1982 conference sponsored by the University of Umea at Pite Havsbad, Sweden. Major 


focus is on algorithms for numerical analysis of the matrix pencil A - AB. LAS 


Functional Analysis, P. Transformations Intégrales et Calcul Opérationnel. VV. Ditkine, A. Proudni- 
kov. MIR, 1982, 435 pp. Chapters on Fourier, Laplace, Bessel and other transforms, and on the 
operational calculus, together with an extensive table of transforms. JD-B 


Analysis, P. Handbook of Integral Transforms of Higher Transcendental Functions: Theory and Algo- 
rithmic Tables. 0.1. Marichev. Transl: L.W. Longdon. Ser. in Math. & Its Appl. Ellis Horwood, 
1983, 336 pp, $79.95. [ISBN: 0-85312-538-7] Modern theory of hypergeometric functions. Useful 
reference for evaluating a variety of definite integrals, with emphasis on the use of Mellin 
transforms. Systematic account including relevant background information. Requires some knowledge 
of analysis, including basics of complex variables. JK 


Algebraic Geometry, P. Tata Lectures on Theta I. David Mumford. Progress in Math., V. 28. Bir- 
khauser Boston, 1983, xiii + 235 pp, $20. [ISBN: 3-7643-3109-7] The first half of a survey of the 
theory of theta functions. The author describes motivating problems and classical applications, the 
heat equation, projective embeddings of torii, modular forms, sums of squares, and the zeta func- 


tion. He concludes with an introduction to theta functions of several variables. SG 


Algebraic Geometry, P. Topics in the Theory of Algebraic Groups. James B. Carrell, et al. Notre 
Dame Math. Lect., No. 10. U of Notre Dame Pr, 1982, ix + 181 pp, $9.95 (P). [ISBN: 0-268-01843-X] A 
series of five survey lectures (by Carrell, Curtis, Humphreys, Parshall, and Weisfeiler) on aspects 
of the theory of algebraic groups. SG 


Differential Geometry, P. The Method of Iterated Tangents with Applications in Local Riemannian 
Geometry. J. Enrico White. Pitman Pub, 1982, xx + 252 pp, $43.95. [ISBN: 0-273-08515-8] A study of 
Riemannian manifolds from the point of view of iterated tangent bundles, a generalization of tensor 
analysis. The author develops the theory and presents applications to Gaussian and Riemannian cur- 
vature. 5G 


Operations Research, P. Mathematical Models of Renewable Resources. Ed: Roland lLamberson. Hum- 


boldt State University (Mathematical Modeling Group), 1982, 107 pp, $5 (P). Proceedings of First 
Pacific Coast Conference on Mathematical Models of Renewable Resources, April 1982. JRG 


Probability, P*. Extremes and Related Properties of Random Sequences and Processes. M.R. Lead- 
better, Georg Lindgren, Holger Rootzén. Ser. in Stat. Springer-Verlag, 1983, xii + 336 pp, $36. 
[ISBN: 0-387-90731-9] Divided into four parts: classical extreme value theory (dealing with 
sequences of independent, identically distributed random variables), extensions to dependent 
sequences, extensions to continuous parameter stationary processes, and applications. Good set of 
references. RSK 


Statistics, P. The Jackknife, the Bootstrap and Other Resampling Plans. Bradley Efron. SIAM, 
1982, vii + 92 pp, $12.50 (P). [ISBN: 0-89871-179-7] Lectures given at an NSF-sponsored CBMS 
Regional Conference at Bowling Green State University in June, 1980. Concerned with nonparametric 
estimation of bias, variance, and more general measures of error, using these relatively new tech- 
niques (which are simple but require extensive computations). RSK 


Statistics, T(15-17: 1), P*. Sampling for Health Professionals. Paul S. Levy, Stanley Lemeshow. 
Lifetime Learning Pub, 1980, xv + 320 pp, $27. [ISBN: 0-534~-97986-6] Elementary introduction to sam- 
ple survey methodology, designed primarily for practicing statisticians in health-related fields, 
particularly those with minimal formal training in statistics. Thorough treatment of simple random, 
systematic, stratified and cluster sampling. RSK 


Statistics, $(15-17), P. The Beginning Forecaster: The Forecasting Process Through Data Analysis. 
Hans Levenbach, James P. Cleary. Lifetime Learning Pub, 1981, xiii + 372 pp, $31.50. [ISBN: 0-534- 
97975-0] First of a two-volume work (see TR of The Professional Forecaster below), this book 
describes a number of basic forecasting methods. Divided into three parts: approaching a forecast— 
ing problem, solving a forecasting problem, and managing the forecasting function. RSK 


Statistics, S(16-18), P. The Professional Forecaster: The Forecasting Process Through Data 
Analysis. James P. Cleary, Hans Levenbach. Lifetime Learning Pub, 1982, xiii + 402 pp, $31.50. 
[ISBN: 0~534-97960-2] Describes up-to-date methods for the experienced forecaster. Robust/resistant 
methods, regression analysis, explanatory data analysis, the Box-Jenkins methods for time series, 
analysis of residues. FLW 
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Statistics, T(17: 1), P. Analyzing Experimental Data by Regression. David M. Allen, Foster B. 
Cady. Lifetime Learning Pub, 1982, xvi + 394 pp, $29.95. [ISBN: 0-534-97963-7] Unified approach to 
classical regression, experimental design, analysis of covariance and some advanced topics, using 
regression techniques with appropriate design matrices. Computational algorithms are emphasized and 
amply illustrated on small data sets. Analyzes many real data sets. RSK 


Statistics, P. Lecture Notes in Statistics-16: Specifying Statistical Models: From Parametric to 
Non-Parametric, Using Bayesian or Non-Bayesian Approaches. Ed: J.P. Florens, et al. Springer- 
Verlag, 1983, xii + 204 pp, $14 (P). [ISBN: 0-387~90809-9] Twelve papers presented at the Second 
Franco~Belgian Meeting of Statisticians held in Louvain-la-Neuve (Belgium) in October, 1981. Clas- 
sified according to four types of methodological considerations: specification, model approximation, 
robustness, and adaptivity. RSK 


Statistics, P. Nonlinear Renewal Theory in Sequential Analysis. Michael Woodroofe. SIAM, 1982, v 
+ 119 pp, $14.50 (P). [ISBN: 0-89871-180-0] Based on lectures given at an NSF-supported CBMS 
Regional Conference at Oklahoma State University in July, 1980. Deals with repeated significance 
tests, developing the mathematical techniques for determining their properties. RSK 


Statistics, T(13-16: 1, 2). Introduction to Statistical Thinking. E.A. Maxwell. Prentice-Hall, 
1983, xii + 574 pp, $25.95. [ISBN: 0-13-498105-7] Traditional topics for a general introduction to 
statistical methods for students without a strong mathematics background (no calculus is used). 
Emphasis on inferential statistics. Examples from a wide range of disciplines. GHM 


Statistics, S$(18), P. The Advanced Theory of Statistics, Volume 3: Design and Analysis, and Time 
Series, Fourth Edition. Sir Maurice Kendall, Alan Stuart, J. Keith Ord. Macmillan Pub, 1983, x + 
780 pp, $65. {[ISBN: 0-02-847860-6] Revision of the final volume of this classic treatise, which 
deals with the analysis of variance, design of experiments, sample survey theory, multivariate 
analysis, and time-series. Major changes are in the latter two areas, reflecting recent develop- 
ments. RSK 


Statistics, 8(13-18), P, L. Statistics in Medical Research: Methods and Issues, with Applications 
in Cancer Research. Ed: Valerie Miké, Kenneth E. Stanley. Wiley, 1982, xxi + 551 pp, $34.95. 
[ISBN: 0~-471~-86911-2] Papers from a 1981 conference on biostatistics in clinical oncology. Provides 


an overview of biostatistical methods as well as practical aspects of medical research. FLW 


Statistics, T(13-14: 1). A First Course in Statistics. James T. McClave, Frank H. Dietrich, II. 


Dellen Pub, xii + 433 pp, $24.95. [ISBN: 0-89517-050-7] Presupposes no college mathematics. The 
usual topics presented in a straightforward manner. Includes some interesting case studies. FLW 


Statistics, P. Lecture Notes in Medical Informatics-20: Manual for the Planning and Implementation 
of Therapeutic Studies. Sibylle Biefang, Wolfgang KUpcke, Martin A. Schreiber. Springer-Verlag, 
1983, 100 pp, $12 (P). [ISBN: 0-387-11979-5] Translation of a 1979 German guidebook that compiled 
from a variety of sources, strategies and recommendations for the effective conduct of clinical stu- 
dies as a prerequisite to statistical studies. “Applied statistical methods still appear to many 


doctors to constitute an estrangement of medicine in thought and action." LAS 


Statistics, T(15-18: 1, 2), S, P, L. Applied Linear Statistical Methods. Donald F. Morrison. 
Prentice-Hall, 1983, xiv + 562 pp, $38.95. [ISBN: 0-13-041020-9] Presupposes a basic statistics 
course. Matrix algebra is presented in an appendix. Regression analysis, time series, stepwise 
regression, analysis of variance and covariance. Underlying assumptions emphasized but proofs are 
usually omitted. Real data used in examples. FLW 


Statistics, T(16-18: 1), 8, L. A Course in Linear Models. Amant M. Kshirsagar. Statistics, V. 45. 
Dekker, 1983, xvi + 422 pp, $32.75. [ISBN: 0-8247-1585-3] Presupposes matrix theory and a course in 
mathematical statistics. Supplies the theory behind regression analysis, analysis of variance and 


covariance, generalized least squares, and variance components analysis. FLW 


Statistics, T(16-18: 1, 2), S, P, L. An Introduction to Applied Multivariate Statistics. M.S. 
Srivastava, E.M. Carter. Elsevier North-Holland, 1983, x + 394 pp, $34.50. [ISBN: 0-444-00621-4] 
Presupposes some statistics. Supplies a chapter on matrix algebra. Multivariate normal distribu- 
tions, analysis of variance, and regression. Growth curves. Profile, discriminant, principal com 
ponent, and factor analysis. Inference on covariance matrices. FLW 


Statistics, S(15-18), P. Reliability in the Acquisitions Process. Ed: Douglas J. DePriest, Robert 
L. Launer. Lecture Notes in Stat., V. 4. Dekker, 1983, viii + 199 pp, $35 (P). [ISBN: 0-8247- 
1792-9] Papers from a 1981 workshop held to "review recent developments in reliability research and 
to provide a forum to facilitate communications between DOD personnel and academic researchers." 
Note the high price for a paperback. FLW 


Computer Programming, T*(13-14: 1), S*, L. Oh! Pascal! Doug Cooper, Michael Clancy. WW Norton, 
1982, xix + 476 pp, $15.95 (P). [ISBN: 0-393-95205-3] Detailed and readable introduction to program- 
ming and Pascal. Includes comments on problem-solving techniques. Section on debugging, self-test 
and exercises at end of each chapter. Appropriate for self-study or classroom use and with batch or 
interactive system. KS 
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Computer Programming, P. Programming the TI-59 & the HP-41 Calculators. Paul Garrison. TAB Books, 
1982, vi + 294 pp, $12.95 (P). [ISBN: 0-8306-1442-7] A very detailed introduction to the TI-59 and 
HP-41 for the complete calculator novice. A great number of examples. But what novice will have 
the patience to learn these two quite different programming systems simultaneously before choosing 


one or the other? GHM 


Computer Programming, S*(13-16), P*, L**. The Programmer’s Book of Rules. George Ledin, Jr., Vic- 
tor Ledin. Lifetime Learning Pub, 1979, 248 pp, $12.95 (P). [ISBN: 0-534-97993-9] Two-hundred fifty 
Do’s and Don’ts, with brief explanations and examples. These "state-of-the-art" maxims are sup- 
ported by numerous references to recent literature on programming methodology. GHM 


Computer Programming, P, L. The Preparation of Programs for an Electronic Digital Computer, with 
special reference to the EDSAC and the use of a library of subroutines. Maurice V. Wilkes, David J. 
Wheeler, Stanley Gill. Tomash Pub, 1982, xxxi + 170 pp, $30. [ISBN: 0-938228-03-X] Photographic 
reproduction of the "first programming textbook" ever published (1951). Introduction by Martin 
Campbell-Kelly outlines the origins, innovations and influence of this detailed guide to the cele- 


brated EDSAC computing machine which became operational at Cambridge in 1949. GHM 


Computer Programming, T(13: 1), &. Programming in BASIC. Tom Logsdon. Anaheim Pub, 1981, ix + 258 
pp, $15.95 (P) [ISBN: 0-88236-179-1]; Instructor’s Guide and Answer Manual, 101 pp, (P). Slow-paced 
introduction to BASIC. Covers only one-dimensional arrays. Exercises and programming problems at 
end of each chapter. Assumes student is using a terminal. Some remarks, pictures and language may 


be regarded as sexist. Instructor’s Guide includes listings and runs for programming problems. KS 


Computer Programming, T(13-14: 1). Structured Fortran 77 Programming. Seymour V. Pollack. Boyd & 
Fraser Pub, 1982, xvi + 496 pp, $17.95 (P). [ISBN: 0-87835-095-0] A structured introduction to the 
programming language Fortran 77. The author spends the first two chapters introducing the concepts 
of an algorithm and program design. The book then introduces the complete language specifications 
in Chapters 3-17. The examples all appear to be well structured, within the limits of what is 
available on Fortran 7/7. The order of presentation seems quite reasonable. This book seems compar- 
able to a number of good Fortran 77 books on the marketplace. MS 


Computer Programming, P. Z80 Assembly Language Subroutines. Lance A. Leventhal, Winthrop Saville. 
Osborne/McGraw-Hill, 1983, xi + 497 pp, $15.95 (P). [ISBN: 0--931988-91-8] This is a text for 
experienced assembly language programmers who must learn the specific details of the assembly 
language of the 2Z80 processor. It contains only a one-chapter introduction to the concepts of 
assembly level programs while Chapters 2-11 are a description of the features available on the 2Z80. 
In addition, it contains many complete subroutines for common system level operators such as code 
conversion, bit manipulation, string manipulation, and interrupt processing. This is mainly an 
advanced reference book rather than a text. MS 


Computer Programming, T(14: 1), S. The Programmer’s Craft: Program Construction, Computer Architec- 
ture, and Data Management. Richard J. Weiland, Charles R. Bauer. Reston Pub, 1983, xv + 160 pp. 
[ISBN: 0-8359-5645-8] A mix of structured programming (language independent), computer architecture, 
and data structures, with a data processing orientation. Probably too sparse for a text. RM 


Software Systems, S(15-16), P, L. Language Translators. John Zarrella. Microprocesor Software 
Eng. Concepts Ser. Microcomputer Applic, 1982, ix+ 189 pp, $12.95 (P). [ISBN: 0-935230-06-8] Not 
geared to any particular language or machine, this book gives a fine overview of the general princi- 
ples of assemblers, compilers and interpreters for anyone interested in how programs are converted 
into machine-executable form. Presumes a slight familiarity with machine and assembly language pro- 
gramming. GHM 


Software Systems, S(16-18), P. Distributed Systems--Architecture and Implementation, An Advanced 
Course. Ed: B.W. Lampson, M. Paul, H.J. Siegert. Springer-Verlag, 1983, xiii + 510 pp, $17 (P). 
[ISBN: 0-387-10571-9] Originally published as Volume 105 in the Lecture Notes in Computer Science 

series. Contains the texts of twenty papers presented at the Technische Universit¥t Milnchen, March 
4-13, 1980. The papers cover various aspects of the design and implementation of distributed sys~ 
tems. AO 


Computer Science, T(13: 1). Mathematics for Data Processing, Second Edition. Frank J. Clark. Res- 
ton Pub, 1983, x + 326 pp, $21.95. [ISBN: 0-8359-4263-5] Intended for students seeking entry level 
occupations in data processing. Covers numeration systems, Boolean algebra, functions, matrix alge- 
bra, introductory material in linear programming and statistics. (First Edition, TR, January 1975.) 
JRG 


Computer Science, P. Lecture Notes in Computer Science-148: Logics of Programs and Their Applica- 
tions. Ed: A. Salwicki. Springer-Verlag, 1983, vi + 324 pp, $14.50 (P). [ISBN: 0-387-11981-7] 23 
selected contributions to an August 1980 conference in Poznan, Poland dealing with program logics 
(including methods from nonstandard logic and model theory), abstract data structures, parallel com- 
putations and the University of Warsaw’s programming language LOGLAN. LAS 


Computer Science, T(14-15: 1). The Logic Design of Computers, An Introduction. M. Paul Chinitz. 
Howard W. Sams, 1981, 413 pp, $15.95 (P). [ISBN: 0-672-21800-3] An introduction to the principles of 
computer organization and logic design intended for students who have had an introduction to pro- 
gramming languages and are now interested in looking at how these ideas are actually carried out 
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within a computer system. It looks at both the functional organization and the logic design level 
of a typical digital computer. MS 


Computer Science, P. Third Caltech Conference on Very Large Scale Integration. Ed: Randal Bryant. 
Computer Sci Pr, 1983, xii + 430 pp, $36.95. [ISBN: 0-914894-86-2] This is a collection of 23 
papers, all of which were presented at the Third Conference on VLSI, Pasadena, California, March 
1983. The papers review what has occurred in the area of design, verification, and testing of VLSI 
circuits over the past ten years. The papers are quite technical and theoretical in nature and 
would probably be appropriate for graduate students in electrical engineering, physics, or computer 
science. M5 


Computer Science, T(16-17: 1), L. Principles of Programming Languages: Design, Evaluation, and 
Implementation. Bruce J. MacLennan. Holt, Rinehart & Winston, 1983, xvi + 544 pp. [ISBN: 0-03- 
061711-1] Principles of programming language design. Organization treats important languages (For- 
tran, Algol, Pascal, Ada, Lisp, Smalltalk, Prolog) as wholes to emphasize interrelationships between 
the parts, in addition to comparisons between the languages. Emphasis on historical context, as 
well as special features of the languages. RM 


Control Theory, P. Lecture Notes in Control and Information Sciences-4/7: Adaptive Systems with 
Reduced Models. P.A. Ioannou, P.V. Kokotovic. Springer-Verlag, 1983, v + 164 pp, $10 (P). [ISBN: 
0-387-12150-1] A research monograph on the robustness properties of model reference schemes (a class 
of adaptive systems). AO 


Applications (Biology), $(15-17), P. An Introduction to Mathematical Taxonomy. G. Dunn, B.S. Ever- 
itt. Stud. in Math. Biology, V. 5. Cambridge U Pr, 1982, xi+ 152 pp, $29.95; $11.95 (P). [ISBN: 
0~521~23979-6; 0-521-28388-4] Philosophy and methodology behind mathematical techniques of classifi- 
cation in biology, including principal components analysis, multidimensional scaling, and cluster 
analysis. Requires only minimal background in elementary statistics and matrix algebra. Good set 
of references. RSK 


Applications (Health and Public Affairs), T(17: 1), P*. Public Program Analysis: A New Categorical 
Data Approach. Ron N. Forthofer, Robert G. Lehnen. Lifetime Learning Pub, 1981, xx + 294 pp, 
$30.95. [ISBN: 0-534-97974-2] Describes and illustrates the weighted least squares approach for 
analyzing categorical data in the areas of public health and public affairs. Includes both single 
and multiple-response applications. Presumes background in matrix algebra and analysis of variance. 
RSK 


Applications (Information Science), P. Lecture Notes in Computer Science-146: Research and Develop- 
ment in Information Retrieval. Ed: Gerard Salton, Hans-Jochen Schneider. Springer~Verlag, 1983, ix 
+ 311 pp, $14.50 (P). [ISBN: 0-387-11978-7] Proceedings of a May 1982 conference in Berlin of the 
ACM’s special interest group on information retrieval (SIGIR). A remarkably broad interdisciplinary 
study, linking library science, computer science and linguistics. Sample result: text compression 
in English using prefixes and suffices is NP-hard. LAS 


Applications (Physics), P. Geometric Quantization in Action: Applications of Harmonic Analysis in 
Quantum Statistical Mechanics and Quantum Field Theory. Norman E. Hurt. D Reidel Pub, 1983, xiv + 
336 pp, $49.50. [ISBN: 90-277-1426-6] Quantum statistical mechanics and quantum field theory are 
used to illustrate the use of geometric quantization techniques. AO 


Applications (Physics), T(17-18: 1, 2), P. Mathematical Foundations of Elasticity. Jerrold E. 
Marsden, Thomas J.R. Hughes. Prentice-Hall, 1983, xviii + 566 pp, $42.95. [ISBN: 0-13-561076-1] A 
well-written introduction to the mathematical theory of three-dimensional elasticity using modern 
differential geometry and functional analysis. AO 


Applications (Simulation), T(15-16: 1). Introduction to Simulation: Programming Techniques and 
Methods of Analysis. James A. Payne. McGraw-Hill, 1982, xii + 324 pp, $33.95. [ISBN: 0-07- 
048945-9] A survey of techniques for performing simulation on a digital computer. It assumes that 
the reader has a knowledge of programming in a high-level language as well as a fundamental back- 
ground in statistics. It concentrates most heavily on the techniques of discrete-event simulation 
and introduces the special purpose languages SIMSCRIPT and GPSS for implementing discrete event 
simulation models. MS 


Reviewers 


RJA: Richard J. Allen, St. Olaf; PB: Peder Bolstad, St. Olaf; RB: Richard Brown, Carleton; JNC: 
Judith N. Cederberg, St. Olaf; CEC: Clifton E. Corzatt, St. Olaf; JD-B: John Dyer-Bennet, Carleton; 
JRG: Jennifer R. Galovich, St. Olaf; SG: Steven Galovich, Carleton; JG: Jack Goldfeather, Carleton; 
PH: Paul Humke, St. Olaf; RBK: Roger B. Kirchner, Carleton; RSK: Richard S. Kleber, St. Olaf; JK: 
Joseph Konhauser, Macalester; LCL: Loren C. Larson, St. Olaf; GHM: George H. Mills, Carleton; RM: 
Richard Molnar, Macalester; RWN: Richard W. Nau, Carleton; AO: Arnold Ostebee, St. Olaf; AWR: A. 
Wayne Roberts, Macalester; MS: Michael Schneider, Macalester; JS: John Schue, Macalester; SS: Sey- 
mour Schuster, Carleton; JAS: J. Arthur Seebach, Jr., St. Olaf; KS: Kay Smith, St. Olaf; LAS: Lynn 
Arthur Steen, St. Olaf; TAV: Theodore A. Vessey, St. Olaf; MW: Martha Wallace, St. Olaf; FLW: Frank 
L. Wolf, Carleton; PZ: Paul Zorn, St. Olaf. 
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Section Reports 


An asterisk (*) by the title of a paper indicates that copies of the paper are available from 
the author. Papers presented under special sponsorship as part of joint meetings are so noted in 
parentheses. 


Southeastern Section 


The sixty-second annual spring meeting of the Southeastern Section met at The Citadel, Charles- 
ton, South Carolina on April 15-16, 1983. A total of 255 persons attended the meeting. 


Invited Lectures: 


"Recent Progress on Combinatoria] Problems in Ramsey Theory and Discrete Geometry," by William T. 
Trotter, Jr., University of South Carolina. 

"Beyond the Binomial," by John Neff, Georgia Institute of Technology. 

"Paradoxes About Rationals and Irrationals," by Ivan Niven, University of Oregon. 


Special Sessions: 


"Domination Number in Graphs," by Michael Johnson, University of Louisville. 

"Strongly Regular Graphs," by Robert Roth, Emory University. 

"Cycles in Hypergraphs," by Richard Duke, Georgia Institute of Technology. 

"Hamiltonian Properties in Graphs," by Ronald Gould, Emory University. 

"On Hamiltonian Cycles in Cayley Color Graphs," by Joseph B. Klerlein, Western Carolina Univer- 
sity. 

"Algorithms for Shortest Paths," by Douglas R. Shier, Clemson University. 

"Neighborhood Relations in Finite Graphs," by David Sumner, University of South Carolina. 

"Some Algebras Associated with Graphs," by Trevor Evans, Emory University. 


Short Presentations: 


"On Invertible Linear Combinations of Matrices," by Peter M. Gibson, University of Alabama in 
Huntsville. 

"Shuffling Cards, Imprimitivity, and Correcting Errors," by Stephen L. Davis, Davidson College. 

"On the Computer Algorithm of Generalized Euclidean Algorithm in the Ring of Integral Matrices," 
by Jau-shyong Shiue, Gardner Webb College. 

"Infinitesimal Deformations of Coordinate Axes," by Charles G. Fleming, The Citadel. 

"Affine G-Mappings," by Irl C. Bivens, Davidson College. 

"An Algorithm for Monitoring Global Error in Numerical Solution of a System of Linear Differential 
Equations," by James C. Pleasant, East Tennessee State University. 

"An Ad Hoc Method for Identifying Absolute Extrema in Lagrange Multiplier Problems," by John Bax- 
ley, Wake Forest University. 

"Transformations of Circles in Eg as Point Transformations in E3," by Ray Wylie, Furman Univer- 
sity. 

"On Convergence and Regularity of Regular Ritt Series," by G.R. Viswanath, South Carolina State 
College. 

"A Factorization Theorem for an H*(P)Space," by Nancy Lee Shell, Furman University. 

"Teaching Remedial Math in College: Is It Possible? A Survey of Developmental Mathematics in Col- 
lege," by Ping-Tung Chang, Augusta College. 

"The Arithmetic-Geometric Mean Inequality: Optimization Without Calculus," by Larry Riddle, Emory 
University. 

"The Sloan Program in the New Liberal Arts at Davidson," by R. Bruce Jackson, Jr., Davidson Col- 
lege. 

"Optimizing Boxes and Disks," by Thomas A. Hern, University of North Carolina at Chapel 
Hill/Bowling Green State University. 

"A Monte Carlo Method for Integration in Polar Coordinates," by Jerry E. Bolic, Lenoir-Rhyne Col~ 


lege. 

"Some Elementary Calculus Examples from Statistics," by Richard G. Vinson, University of South 
Alabama. 

"A Matter of Life and Death: A Markovian Process," by Subhash C. Saxena, Coastal Carolina Col- 
lege. 


"Carroll’s Obtuse Triangle Problem," by Douglas H. Frank, Valdosta, Georgia. 

"The Use of the Stem and Leaf Plot in Constructing a Frequency Table," by Lloyd B. Smith, Jr., 
Lenoir-Rhyne College. 

"Modern Cryptography: The Mathematics of Secrecy," by Michael Willett, University of North Caro- 
lina at Greensboro. 

"Teaching Mathematical Induction Via Computing," by Paul R. Patten, North Georgia College. 

"Using the Microcomputer to Discover Patterns in the Graph of Polar Equations," by Linda H. Boyd 
and Charles R. Stone, Dekalb Community College. 

"A Study of Students Who Took Basic Math, Fall 1980," by Sandria N. Kerr, Winston-Salem State 
University. 

"Representation of a Set Whose Derived Set is Countable," by Arthur G. Sparks, Georgia Southern 
College. 

"Alternative Rings with Commuting Nilpotent Elements," by Tae-il Suh, East Tennessee State 
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University. 

"A Lagrange Multiplier Problem with Multiple Critical Points," by Ted Gentry, Wake Forest Univer- 
sity. 

"Some Relations Between Groups and Graphs," by Terri England, Emory University. 

"A Computer Study of Amniocentesis Data," by Cynthia Priest, Emory University. 

"Implementation of a Spelling Checker and Corrector on a Microcomputer," by Keyin Pepe, Emory 
University. 

"Almost Periodic Functions and Semigroups," by Arnold Goldstein, Savannah, Georgia. 

"Weighted Associated Stirling Numbers," by Fred T. Howard, Wake Forest University. 

"Some Variations on the Problem of Generating Pythagorean Triples," by Kenneth E. Whipple, Georgia 
State University. 

"Point Unstable Graphs," by Ted R. Monroe, Converse College. 

"Color Algebras," by Steven D. Comer, The Citadel. 

"Which Triangular Numbers are Square?" by David R. Stone, Georgia Southern College. 

"Schedules for Testing Individuals in Tournaments for Pairs," by Gerald Huff, University of Geor- 
gia, 

"Weak Openness and Almost Openness," by David A. Rose, Francis Marion College. 

"A Division Game or How Far Can You Trust Mathematical Induction," by William H. Ruckle, Clemson 
University. 

"Polygons in Ordered Planes," by R.B. Killgrove, University of South Carolina at Aiken. 

"Casting Shadows on Eggs," by Robert E. Jamison, Clemson University. 


Florida Section 


The sixteenth annual spring meeting of the Florida Section was held on March 4-5, 1983 at 
Florida State University, Tallahassee. There were 128 registrants. 


Invited Addresses: 


"Pure and Applied Topology in 2, 3, and 4-Space," by T. Benny Rushing, University of Utah and 

Institute for Advanced Study. 
* "The Mathematical Sciences K-12: What Is Still Fundamental and What Is Not," by Marcia P. Sward, 

Associate Director, MAA. 

"Signs Sum Square Summable Sequences," by James R. Retherford, Louisiana State University. 

"Mathematicians in Operations Research in the U.S. Army Air Force in World War II," by Charles W. 
McArthur, Florida State University. 

"Motivation in Mathematics: Believe It or Not, It Is a Record," by Ignacio D. Bello, Hillsborough 
Community College. 

"A Math-Science Program for Gifted Girls," by Paul E. McDougle, University of Miami. 

"Antoine’s Necklace: Or How to Keep a Necklace from Falling Apart," by Beverly L. Brechner, 
University of Florida. 


Short Presentations: 


* "A Dialogical Theory of Mathematics Education," by James W. Garrison, Florida Stute University. 

"Linear Algebra and Space Travel to Music on the Apple Microcomputer,” by Gareth Williams, Stetson 
University. 

"Factoring Binomials and Trinomials in Community Colleges with Questions for the “Gordon Rule’ ," 
by Carlton A. Lane, Hillsborough Community College. 

"Harmonizing with a Calculator," by Alan Wayne, Pasco-Hernando Community College. 

"Eigenvalues and Eigenvectors of 2x2 Matrices," by James R. Weaver, University of West Florida. 

"A Computer Simulation of the Unlimited Register Machine," by Marion G. Harmon, Florida State 
University. 

"A Survey of Pending Florida and Federal Programs for Mathematics Education," by E.P. Miles, 
Florida State University. 

"Open Mappings and Dimension," by Alice Mason, University of Florida. 

"Applying Besse] Functions Underground," by John D. Hall, Southwest Florida Water Management Dis~ 
trict. 

"Partial Fractions," by Shiv K. Aggarwal, Embry-Riddle Aeronautical University. 

"Report on Remedial/Developmental Mathematics Conference," by Donald M. Hill, Florida A&M Univer- 
sity. 


Panel Discussion: 


"CUPM Curriculum Recommendations," by Gareth Williams (Moderator), Stetson University; Bettye Anne 
Case, Abraham Kandel, and Frederick Leysieffer, Florida State University; Kermit N. Sigmon, 
University of Florida. 

"Emmy Noether, Her Life and Work," organized by Association for Women in Mathematics: "Noether’s 
Life," by Betsey Whitman, Florida A & M University; "Noether’s Work in Abstract Algebra," by 
Robert Gilmer, Florida State University. 

"MAA Placement Test Program,” by Roy C. Jones, Jr. (Moderator), University of Central Florida; 
Gloria Child, Rollins College; Art Crummer, University of Florida; Donald M. Hill, Florida A&M 
University; Linda W. Smith, Tallahassee Community College. 
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Student Papers: 


"Mathematical Concepts Represented Through Color Graphics," by Douglas R. Martin, Florida State 
University. 

"A Mathematical Model of the Spread of Epidemic Diseases," by Robert S. Knego, University of South 
Florida. 

"A Low Cost Design for a Color Sensing Device for the Blind," by Raymond Curci, Florida State 
University. 

"Remedial/Developmental Mathematics Conference," organized by Donald M. Hill, Florida A & M 
University. 


At the business meeting an award was presented to Herman Meyer of the University of Miami, 


Barry University, and Florida International University for outstanding contributions to mathematics 
and the mathematical community of the Florida Section. 


Southwestern Section 
The Southwestern Section held its spring meeting on March 25-26, 1983 at New Mexico Institute 
of Mining and Technology in Socorro, New Mexico. A total of 44 members were registered for the 
meeting. Bonita Ross, a consultant for MATHFILE gave a presentation and demonstration of the data- 
base MATHFILE. 
Invited Address: 


"Elementary Versions of Some Advanced Theorems," by Mark Kac, University of Southern California. 


Short Presentations: 


"The Dirichlet Problem with Unbounded Data," by David Siegel, New Mexico Institute of Mining and 
Technology. 

"The Rapid Computation of Residues," by Arthur Knoebel, New Mexico State University. 

"Applications of Grassman Algebra to Graphical Statistics," by Alvin Swimmer, Arizona State 
University. 

"The Second Fundamental Problem in the Theory of Quadratic Residues," by Justin McCarthy, Deming, 
New Mexico. 

"The Uniqueness of Generating Functions for Shifted Tableau, or The Hashing of Multisets," by 
Timothy Kraus, University of New Mexico. 

"Computers and Pattern Identification," by Peter Bonner, New Mexico Highlands University. 

"Non-additive Measures and Random Sets," by Hung T. Ngyen, New Mexico State University. 

"On the Distributions of LRT for Testing the Homogeneity of Several Exponential Populations," by 
Anita Singh, New Mexico Institute of Mining and Technology. 

"Empirical Bayesian Estimation in Life Testing," by Ashok K. Singh, New Mexico Institute of Mining 
and Technology. 

"Decision Making in Baseball Using Simulation," by Stephen Thorpe, New Mexico Institute of Mining 
and Technology. 

"Isospectral Potentials for the Laplacian," by H.D. Fegan, University of New Mexico. 

"Some Remarks on the Free Surface Condition for the Two-dimensional Laplace Equation," by Ross 
Lomanitz, New Mexico Institute of Mining and Technology. 

"Periodic Solutions of Autonomous Matrix Riccati Differential Equations," by David Sanchez, 
University of New Mexico. 

"A Modified Harminic Series," by Clyde M. Dubbs, New Mexico Institute of Mining and Technology. 

"Elementary Proofs for Some Advanced Theorems," by Piotr Antosik, UTEP. 

“Statistics About the Mathematical Preparation of Entering Freshmen at UTEP, 1964-1983," by J.R. 
Provencio, UTEP. 


Ohio Section 
The annual spring meeting of the Ohio Section was held April 22-23, 1983. 
Invited Addresses: 


"Mathematical Modeling: When, Why, and For Whom," by Maynard Thompson, Indiana University. 

"Topology, Proofs, and CUPM’s Mathematical Sciences Program Recommendations," by David Lutzer, 
Miami University. 

"A Mathematics Department Microcomputer Laboratory," by Maynard Thompson, Indiana University. 

"The Contributions of Anna Johnson Pell Wheeler," by Nazra Azarnia, Miami-Hamilton. 

"How to Keep Students Awake: Some Tricks of the Trade," by Darrell Horwath, John Carroll Univer- 
sity. 
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Panel Discussions: 


"Initial Recommendations for Teacher Certification: What Do You Think?" by Dick Shumway (Modera- 
tor), Ohio State University. 

"Report on School-University Articulation," by William Beyer (Moderator), University of Akron. 

"Panel on Graduate Study Options," by David Lipsich (Moderator), University of Cincinnati. 

"Microcomputer Software Swap Session," by Robert Anderson (Coordinator), Marietta College. 


Contributed Papers: 


"Generating Fuzzy Topologies with Semi~closure Operators," by Albert J. Klein, Youngstown State 
University. 

"Pavel Sergeevich Alexandorff: In Memoriam," by Douglas E. Cameron, University of Akron. 

"Quotient Additive Properties in Topological Spaces," by Norman Levine, Ohio State University. 

"Near-best Approximation by Averaging Polynomial Interpolants," by Judith Palagallo, University of 
Akron. 

"Conic Sections-1P vs. Euclidean," by Richard Laatsch, Miami University. 

"Angular Separations of Finite Sets in E*," by E.P. Merkes, University of Cincinnati. 

"Study of Curves at Ly" by Kenneth Cummins, Kent State University. 

"The 1983 AHSME, Mathematical Content and Implications of the Results," by Leo J. Schneider, John 
Carroll University. 

"Modula-2: Usage in Modelling and Simulation," by Ralph Hollingsworth, Muskingum College. 

"Promoting COBOL," by William M, Wagner, Dublin, Ohio. 


Kansas Section 


The sixty-eighth annual spring meeting of the Kansas Section was held on April 8~9, 1983 at the 
University of Kansas, Lawrence, Kansas. Approximately 125 persons attended. 


Invited Addresses: 


"An Anecdotal History of Mathematics in Kansas," by G. Baley Price, University of Kansas. 
"Approximate Arithmetic Progressions," by Ronald L. Graham, Bell Telephone Laboratories. 


Panel Discussion: 
"The Mathematics Challenge in Two-Year Colleges," by Jean Moran (Moderator), Donnelly College. 
Contributed Papers: 


"Computer Graphics in the Teaching of Mathematics," by Earl J. Schweppe, University of Kansas. 

"3-D Image Generation in Vector Calculus," by Michael Frantz, Wichita State University. 

"A Class of Sequences Related to Powers of the Golden Ratio," by Hung Cao, Wichita State Univer- 
Sity. 

"What are Dualities?" by Juergen Koslowski, Kansas State University. 

"Approximate Roots of Polynomials," by Chris Christensen, University of Kansas. 

"Prime Recognition Using a Probabilistic Algorithm on an Apple II," by Dave Foley, Wichita State 
University. 

"Fibonacci Structure Made Visual," by Larry Schulte, University of Kansas. 

"Another Look at the Power Rule," by Bonnie Ernst, Wichita State University. 

"Symbolic Integration Algorithms," by William David Miller, Kansas State University. 

"Incorporating Microcomputers Into the Curriculum," by Wayne F. Mackey, Johnson County Community 
College. 

"On the Problems of Presenting Graphically the Shape of Non-Symmetric Solids of Higher Dimension,’ 
by T.M. Creese, University of Kansas. 

"A Note on the Partial Differential Equation uy, + Uyy = f" by William M. Self, Pittsburg State 
University. 

"An Overview of Latin Squares and Their Orthogonality," by D.V. Chopra and Dale Hughes, Wichita 
State University. 

"Generation of Families of Combinatorial Structures," by Thomas C. French, Wichita State Univer- 
Sity. 

"Differentials of Numerical Functions," by Louis Herman, Kansas State University. 

"See How They Run: Phenotyping Race Horses with Polar Coordinates," by Paul Mostert, University of 
Kansas. 

"SIMPL: Remedial Mathematics at the University of Kansas," by Philip R. Montgomery, University of 
Kansas. 

"Who Was James Bernoulli?" by Glenn R. Shafer, University of Kansas. 

"A Characterization of Locally Compact Spaces," by Daniel Grubb, Kansas State University. 

"Forty-four Theorems for Four Fours," by Lester E. Laird, Emporia State University. 

"Super Heronian Triangles," by Ruth Janssen and William H. Richardson, Wichita State University. 

"Remarks on Some Submanifolds of Sasakian Manifolds," by Gregory Ronsse, Kansas State University. 
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North Central Section 


The spring meeting of the North Central Section was held April 22-23, 1983 at Carleton College 
with 131 persons in attendance. 


Invited Addresses: 


* "Strategies for Algorithm Development and Implementation in Vehicle Routing," by Kendall E. 
Nygard, North Dakota State University. 
"Non-Commutative Limits," by Paul Halmos, Editor of the Monthly, Indiana University. 


Panel Discussion: 


"Placement, Etc.," by Gail Earles (Moderator), St. Cloud State University; Edwin Andersen, 
Southwest High School, Minneapolis; Mary B. Johnson, Inver Hills Community College; William 
Tomhave, University of Minnesota at Morris. 


Contributed Papers: 


* "The Solution of the Dumonceaux Problem," by H.B. Coonce, Mankato State University. 
"Lattices of Metrics," by Harold W. Martin, St. Cloud State University. 
"The Axis and Angle of Rotation Corresponding to an Orthogonal Matrix," by Warren S. Loud, Univer- 
sity of Minnesota. 
* "A Residential High School for Science and Mathematics in Minnesota," by Joseph A. Gallian, 
University of Minnesota at Duluth. 
"Computerized Black Box Exercises to Reinforce Abstract Concepts in Calculus," by George Mills, 
Carleton College. 
"An Example of Bayesian Estimation in Time Series," by Loretta Thielman, Gustavus Adolphus Col- 
lege. 
* "An Heuretical Approach to a Well-Studied Problem," by Sylvan Burgstahler, University of Minnesota 
at Duluth. 
"What is the Fast Fourier Transform?" by J. Arthur Seebach, St. Olaf College. 


Michigan Section 


The spring meeting of the Michigan Section was held on May 6-7, 1983 at the Oakland Community 
College, Auburn Hills Campus, Michigan. The meeting was attended by approximately 100 members. 


Invited Addresses: 


"Contemporary Applications of Linear and Integer Programming," by Harvey M. Salkin, Case Western 
Reserve University. 

"Using Quantitative Techniques to Take the Gambling Out of the Stock Market," by Harvey M. Salkin, 
Case Western Reserve University. 

"National Perspectives on Mathematical Education," by Marcia Sward, Associate Director, MAA. 

"Finite Groups and Finite Geometry," by Jonathan I. Hall, Michigan State University. 

"Applying Mathematics in Industry," by W. Weston Meyer, General Motors Research Labs. 


Short Presentations: 


"Geometric Methods in Babylonian Mathematics," by James K. Bidwell, Central Michigan University. 

"Who Wants to Talk About Mathematics in China?" by Chester Tsai, Michigan State University. 

"Using Microcomputers to Illustrate Concepts in Probability and Statistics," by Elliot A. Tanis, 
Hope College. 

"Exploratory Data Analysis Using Microcomputers," by Tom Ten Hueve III and Lynn Ploughman, Hope 
College. 

"A Proposed Logic Machine Control System," by Rick L. Stevens, Western Michigan University and 
Argonne National Lab. 

"Decompositions of Graphs," by Sergio Ruiz, Western Michigan University. 

"The Euclidean Algorithm Revisited," by John W. Petro, Western Michigan University. 

"Midwest Talent Search," by Sharon Higham (Coordinator), Michigan Talent Search. 

"Finding All Solutions to a System of Polynomial Equations," by Alden H. Wright, Western Michigan 
University. 

"Using the Gutenberg Word Processor," by P.K. Wong, Michigan State University. 

"Computer Graphics: Surfaces in R3," by Jack Kuipers, Calvin College. 

"Public Secret Codes," by F.I1. Papp, University of Michigan at Dearborn. 


Northern California Section 


The 1983 spring meeting of the Northern California Section was held February 26, 1983 at Stan- 
ford University. There were 215 registered participants at the meeting. 
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Invited Lectures: 


"Imagery in Mathematics Applications," by Ross L. Finney, Massachusetts Institute of Technology. 

"Some Paradoxical Coverings of the Real Line," by Ivan Niven, University of Oregon. 

"Mathematical Analysis of Decorative Patterns," by Dorothy K. Washburn, California Academy of 
Science. 

"Hilbert’s “Grundlagen der Geometrie” Revisited," by Garrett Birkhoff, Harvard University. 

"Anecdotes from the Early History of Computing," by Henry S. Tropp, Humboldt State University. 


Indiana Section 


The spring meeting of the Indiana Section was held at Indiana University on April 19, 1983 with 
approximately 60 members present. The Indiana Small College Math Competition was held in conjunc- 
tion with the meeting. 


Invited Addresses: 


"Puzzle It Out," by John Ewing, Indiana University. 
"Public Key Cryptography," by Dorothy Denning, Purdue University. 
* "Finding All Solutions to Small Systems of Polynomials Using a Computer," by Alexander Morgan, 
General Motors Research Institute. 
"Algorithmically Defined Functions," by Richard D, Anderson, Louisiana State University. 


Lowa Section 


The 7Oth annual spring meeting of the Iowa Section was held on the campus of Iowa State Univer- 
sity, Ames, Iowa on April 22-23, 1983. The meeting was held jointly with the Iowa Section of SIAM. 
Altogether 83 people attended the meetings. 


Invited Addresses: 


"Dynamics on the Riemann Sphere," by Paul Blanchard, Boston University. 
"Classroom Notes," by Leonard Gillman, University of Texas. 


Student Papers: 


"A Ubiquitous Partition on Perfect Subsets of RN," by Donald John Nicholson, Iowa State Univer- 
Sity. 

"The Apple Looks at Bertrand’s Paradox," by Todd Wibben and Jimmie Arnold, Iowa Wesleyan College. 

"Factoring the Characteristic Polynomial for Graphs with Symmetry," by Grant Izmirlian, Drake 
University. 

"Cardano’s Method of Solving Cubics in Finite Fields," by Doug Jaynes and Bob Anderson, Iowa 
Wesleyan College. 


Short Presentations: 


"Applications of Clifford Algebras in Theoretical Physics," by Nikos Salingaros, University of 
Iowa. 

"Automata: A View from Four Perspectives," by Michael H. Millar, University of Northern Iowa. 

"A Geometric Limit Problem in Calculus," by Tom Whaley, Central College. 

"Hyperspace-~Exploring the Concept of Dimension," by George Kelley, Maharishi International 
University. 

"A Fuller Value of Geometry," by George Kelley, Correy Vion and Vedder Wright, Maharishi Interna~ 
tional University. 

"The Musical Experience of Number," by George Kelley and Thomas Stone, Maharishi International 
University. 

"On Characterization of Isospectral Graphs," by Wayne L. Woodworth, Drake University. 

"A Matrix Calculator for Classroom Use," by Leslie Hogben, Iowa State University. 

"The Set of Solutions of a Semi-linear Parabolic Equation," by Paul Sacks, Iowa State University. 

"Practical and Mathematical Aspects of Computed Tomography," by W.R. Madych, Iowa State Univer- 
sity. 

"Frequency Entrainment in Periodically Forced Oscillators," by Jim Murdock, Iowa State University. 

"On Deriving the Characteristic Polynomial of a Graph," by Milan Randic, Drake University. 

"Using an Understanding of Consciousness to Explain the Effectiveness of Mathematics in the Sci- 
ences," by Catherine Gorini Wadsworth, Maharishi International University. 

"The Geometry of Coefficients of Powers of Polynomials," by Stephen J. Willson, Iowa State Univer- 
sity. 

"A Refined Platonist Philosophy of Mathematics," by Eric W. Hart, Maharishi International Univer~ 
sity. 
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Using Heron’s Formula, the inequality to be proved will follow from 


(A) Vstue + Vs't'u'e! < Ms + s’)\(t+ t’)\(ut u’)(o + v’) 


for positive s, t, u,v, s’, t’, u’, v’. A simpler analogous inequality that might be helpful is 


(B) yxy + Yx’y’ < y(x+x’)\(yt+y’) for x, y, x’, y’ positive. 


First we note that (B) follows from the Cauchy Inequality applied to the vectors (Vx , Vx’) and (Jy ; 
Vy’ ) [and also follows from (/xy’ — yx’y )* > 0 or from the Inequality on the Means applied to 
xy’ and x’y]. Using (B) with x = yst, x’ = vs’t’, y = Vuv, y’ = Vu'v’ and reapplying (B) to the 
new right side, one has 


Vstuo + Vs't'u'o’ < \(Vst + ¥s’t’ )(Vuw + Vu'v’ ) 
<Vi(sts)Q+ 0) (ut uw )(ot v’) . 


Since here the rightmost part equals the right side of (A), we have proved (A). 

Equality holds in (B) if and only if ¥x :/x’ = yy yy’ and this holds if and only if 
x: x’ =y: y’. Hence equality occurs in (A) if and only if s:t:u:o0= 5’: t': u’: v’. It follows 
that equality occurs in the original inequality if and only if a, b, c are proportional to a’, b’, c’. 


PROGRESS REPORTS 
EDITED BY THOMAS BANCHOFF AND RICHARD MILLMAN 


It is easy to be too busy to pay attention to what anyone else is doing, but not good. All of us should know, and 
want to know, what has been discovered since our formal education ended, but new words, and relations between 
them, are growing too fast to keep up. It is possible for a person to learn of the title of a recent work and of the key 
words used in it and still not have the faintest idea of what the subject is. 

Progress Reports is to be an almost periodic column intended to increase everyone’s mathematical information 
about what others have been up to. Each column will report one step forward in the mathematics of our time. The 
purpose is to inform, more than to instruct: what is the name of the subject, what are some of the words it uses, what 
is a typical question, what is the answer, who found it. The emphasis will be on concrete questions and answers 
(theorems), and not on general contexts and techniques (theories). References will be kept minimal: usually they will 
include only one of the earliest papers in which the answer appears and a more recent exposition of the discovery, 
whenever one is easily available. 

Everyone is invited to nominate subjects to be reported on and authors to prepare the reports. The ground rules 
are that the principal theorem should be old enough to have been published in the usual sense of that word (and not 
just circulated by word of mouth or in preprints); it should be of interest to more than just a few specialists; and it 
should be new enough to have an effect on the mathematical life of the present and near future. In practice most 
reports will probably be on progress achieved somewhere between 5 and 15 years ago. 
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Houghton, MI 49931 


Many extremely important results in differential geometry concern the effect of the geometry 
of an object upon its topology. In addition, over the past forty years there has been a great deal of 
interest in the relationship between analytic quantities and the geometry and topology of 
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generalized surfaces. Many of these new results center on the notion the spectrum of a manifold, 
defined in terms of eigenvalues of a generalized Laplacian. In 1964, a series of high-dimensional 
examples clarified the way in which the spectrum determines the object and then five years ago a 
breakthrough provided examples in dimension 2! 

A surface sits in 3-dimensional Euclidean space and, in a neighborhood, appears to be like the 
(2-dimensional) plane. A manifold, M@”, is a generalized surface; that is, it looks locally like R”, 
where n might be greater than 2. Any such manifold M” can be viewed as a subset of a Euclidean 
space R” of larger dimension so that it has an n-dimensional tangent space at each point in the 
same way that a surface in R° has a tangent plane at each point. The set M thus inherits an inner 
product on its tangent space at each point in a natural way; there are, however, many possible 
ways to assign an inner product to a vector space like the tangent space. Generalization: a 
Riemannian manifold is a manifold together with a “smooth” assignment of an inner product to 
each tangent space. 

The fact that M” is embedded in R™ makes it easy to define the concepts of vector calculus on 
M". To obtain the Laplacian A of a function f on M we first extend f to a function f on an open 
subset of R™ and then define Af to be the restriction to M of the ordinary Laplacian 
—yN_,0°f/dx?. Geometers choose the negative sign here so that the eigenvalues of A are 
nonnegative rather than nonpositive. (In fact one way to distinguish an analyst from a geometer is 
to ask what the sign of A is.) The value of Af will be independent of the extension f used to define 
it; so it makes sense to define the spectrum of M to be the set of eigenvalues of A, 


{A € R|Af = Af for some nonzero function f defined on M}. 


If M” is the n sphere S” in R”*' for example, we may extend any f to a function f which is 
constant in the radial direction and the eigenfunctions f with Af = Af are precisely the usual 
spherical harmonics. In general the spectrum of M” is a discrete set of nonnegative numbers 
tending to infinity, each with a finite multiplicity. For an abstractly defined Riemannian manifold 
M", the spectrum of M turns out to be independent of the way the structure is obtained by the 
embedding into Euclidean space. 

The most primitive question which can be asked about the influence of the spectrum on the 
geometry of the manifold is whether or not it determines M up to isometry (rigid motion). M. 
Kac’s beautifully written article in this MONTHLY (1966) about the Euclidean problem quotes L. 
Bers’ interpretation of this question as “can you hear the shape of a drum?” The first result along 
these lines is the example John Milnor produced, in 1964, of two 16-dimensional manifolds (in 
fact, tori) that have the same spectrum (are “isospectral”) but are not isometric. The Milnor 
example [1, p. 154] is elegant in its simplicity. Let {€,,..., €,) be a basis for R” such that e; - é; is 
an integer for each i and /, and let I’ be the set of linear combinations of the basis using integer 
coefficients only. (The set I is called the lattice generated by the basis.) The lattice I is a subgroup 
of R” and the quotient, T= R”/T, is a manifold called a torus, which inherits the Euclidean 
metric. (If e, = (1,0) and e, = (0, 1), this is the usual torus obtained by identifying opposite edges 
of a square.) If \ € T is fixed, then the complex-valued function on R” defined by x > e*7'”’* has 
the same value at x and x + y’ (for any y’ © I’) and so gives a function on the torus R"/T. As 
Af = —L0*f/dx*, therefore Af, = 42*|y|*f,, so that \ = 427|y| is an eigenvalue of A for each 
y in I’. Note that the multiplicity of 4 depends only on how many elements of I are located on a 
sphere of radius YA /27 about the origin in R”. Because { f,ly © T} is dense in the set of all 
complex-valued functions, every element of the spectrum of T must be of the form 477|y|’. 

We see thus that the eigenvalues (and multiplicities) on a torus depend only on the number of 
lattice points on spheres of a given radius about the origin. On the other hand, R”/T, and R"/T, 
are isometric precisely when there is a rigid motion of R” carrying the lattice [’, onto T’,. However, 
Milnor pointed out 2 lattices discovered by E. Witt in 1941, which do not satisfy the rigid motion 
criterion but are such that every ball about the origin contains the same number of elements of I, 
as I’,. Using considerably more machinery (i.e., the theory of modular forms) it can be shown [1] 
that there is also a torus counterexample in dimension 12. 
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What then is the lowest dimension in which isospectral, nonisometric manifolds can exist? It 
came as quite a surprise when, in 1978, Marie-France Vignéras gave examples of 2-dimensional 
isospectral manifolds (actually, Riemann surfaces) which are not isometric. In fact, she gave 
examples in each dimension n > 2 of isospectral, nonisometric manifolds which don’t reduce to a 
product of lower dimensional examples. Her 3-dimensional examples are more astounding 
because they have different fundamental groups. Thus spectral data do not suffice to determine 
even the feeble topological property of homotopy type, let alone the geometry. Her construction 
starts with the hyperbolic upper half plane H = {x + iy|y > 0} Cc C rather than Euclidean space 


a 


as Milnor’s does. A nonsingular matrix, A = with real coefficients (i.e., an element of 


GL(2, R)) acts on H as a linear fractional transformation; that is, Az = (az + b)/(cz +d). A 
“nice” subgroup of GL(2, R) will give a 2-dimensional manifold (actually, a Riemann surface) in 
the way that the lattice [ produces a torus from R”. Vignéras, by using a deep construction in 
quaternion algebras, is able to produce two subgroups that give isospectral manifolds which are 
not isometric. Details of her work are sketched in Section 2 of [3]. Thus we are led to the question: 
Does there exist a continuous, isospectral but not isometric family of Riemannian metrics? This 
problem seems to be quite elusive with only partial results along these lines known today. Anyone 
who is interested in the kinds of questions, applications, or techniques of the subject can get a 
good overview by a trip through [2]. These ideas will provide much food for thought for geometers 
and global analysts for many years to come. 
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Added in Proof: C. Gordon and E. Wilson have announced, in abstract 803-53-017 (April, 1983) of the Notices 
of the AMS, a continuous isospectral change by nonisometric Riemannian metrics. 
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CHARACTERIZATION OF SMOOTH DOMAINS IN C BY THEIR BIHOLOMORPHIC 
SELF-MAPS 
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The classification theory for Riemann surfaces contains much important and striking informa- 
tion about biholomorphic maps of domains {2 ¢ C. For obvious reasons, material of this nature is 
not often presented in a first course in complex analysis. The purpose of this note is to provide a 
short and elementary proof of a classification of smoothly bounded plane domains which have 
transitive automorphism groups. The author became aware of (the general idea of) this proof in 
the setting of several complex variables. 
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We need a little terminology. A smooth domain Q C C is a bounded, connected open set whose 
boundary consists of finitely many C! Jordan curves. If ¢:Q2—Q is a one-to-one, onto 
holomorphic function (with a holomorphic inverse!) then @ will be called an automorphism of Q. 

For fixed Q, the set of automorphisms of Q (denoted by Aut Q) forms a group under 
composition. Namely, the composition of two automorphisms is an automorphism, composition is 
associative, and the inverse of an automorphism ¢ is @ '. The group Aut is said to act 
transitively on Q (or Q is said to have “transitive automorphism group’) if, given p, g € Q, there is 
ag € Aut Q such that ( p) = g. Which domains Q have transitive automorphism group? 

Every first course in complex analysis contains a discussion of the automorphism group of the 
unit disc D. One learns that the automorphisms of the disc consist precisely of the rotations 


py(z) = ez, 0 <0 < 2m, 


the Mobius transformations 
z-—a 
1 -— az’ 


$.(2) = 0<|a|<1, 
and compositions of these. It is easy to see that Aut D acts transitively on D. For if p, g € D, then 
o_,°, maps p to q. The main result of this note is 


THEOREM. If Q C C is a smooth domain with transitive automorphism group, then Q is conform- 
ally equivalent to the disc. 


The proof will take up the rest of this note and will be broken up into several lemmas. The 
proof reveals a nice interplay between function-algebraic ideas and geometric ones. 


DEFINITION. A function f: Q — C will be called a peaking function for the point P € dQ if fis 
continuous on (2, f is holomorphic on Q, f(P) = 1, and |f(z)| < 1 for z € Q\{P). The point P is 
then called a peak point. 


LEMMA1. Given a smooth domain XQ, there is a peak point P € 0Q and a peaking function f for 
the point P. 


Proof. Let P € 0Q be a point in Q which is furthest from 0. Then 


1/{ zP 


(= 3[ 55] 


does the job. 0 


LEMMA 2. Let P € 0&2 be a peak point. Let zy € Q be fixed. Suppose that $; € Aut Q satisfy 
0;(Z9) > P as j > 0. If K C Q is a compact set, then $;(z) > P uniformly for z © K. 


Proof. Let f be a peak function for & at P. Consider the family {f° ,}. Since this family of 
functions is bounded by 1, it is normal. By Montel’s Theorem, every subsequence has a 
subsequence which converges uniformly on compact sets to a holomorphic function. But 
(f° ;)(Z9) > 1. By the Maximum Modulus Principle, the only possible limit function is constant, 
so f ° , converges uniformly on K to the constant function 1. Since f takes the value 1 only at P, 
the proof is complete. 0 


LEMMA 3. If Q has transitive automorphism group, then Q is simply connected. 


Proof. If Q is not simply connected, then there is a loop A: [0,1] — Q which cannot be 
continuously deformed to a point in 2. Let P € dQ be a peak point. Let z; € Q satisfy z; > P. 
Fix z) © Q. Since Aut is transitive, there exist ¢, © AutQ such that $,(z)) = z;. Since 
A = {A(t): 0 < t < 1} is compact, Lemma 2 implies that ¢,(z) — P uniformly for z € A. 

Now choose a neighborhood U of P such that UN Q is simply connected (since dQ is nice, we 
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can do this—see Fig. 1). By the definition of uniform convergence, there is a j so large that 
o(A) oc UN. Since UN Q is simply connected, ¢,(A) can be continuously deformed to a 
point in UM &. But since $, is a homeomorphism, A can be continuously deformed to a point. 
That is a contradiction. 0 


Now the proof of the theorem is immediate. For Lemma 3 says, together with the Riemann 


$,(A) 


Fic. 1 Fic. 2 


Mapping Theorem, that a smooth domain with transitive automorphism group is conformally 
equivalent to the disc. 

It is not necessary to assume that dQ is nice in order to prove this result. But the proof we have 
given will not work in general. For if Q has infinitely many holes which accumulate at P (see Fig. 
2), then there may be no neighborhood U of P such that UM Q is simply connected. So $,(A) 
may not be deformable-to a point, no matter how large /. 

Our theorem is an old one which is known to experts in one complex variable. It has been 
extended to the setting of several complex variables by Bun Wong and Rosay. It is discussed in 
Section 10.2 of [1]. 
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It is an interesting exercise in a course on distributions and generalized functions to show that 
the assignment of © /f‘”(0) to f is not a distribution on the space of infinitely differentiable 
functions of compact support on the real line. A solution is to multiply something like the 
exponential function by a “bump” function of compact support which is constantly equal to 1 on 
a neighborhood of 0 to create an f for which the series does not converge. One can actually prove 
much more without using such a “bump” function. If fis any nonzero, infinitely differentiable, 
compactly supported function, there must be a point x where the sequence f‘”)(x) does not 
converge to 0. In fact there must be sequences of points converging to each endpoint of the 
support of f at each point of which the derivatives of f fail to remain bounded. This assertion will 
be derived from the following proposition about the relationship between analyticity and infinite 
differentiability. The proof of the assertion itself is a simple illustration of the use of complex 
methods to obtain real results. We conclude with some remarks about the situation for functions 
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in the Schwartz space of C® functions of rapid decrease. 

If fis infinitely differentiable at a point /, let r( f; ¢) be the radius of convergence of the formal 
Taylor series 2°_)(f‘(t)/n!)(x — t)". Recall that f is analytic at ¢ if this series converges to 
f(x) on some neighborhood of ¢. Let C~(a, 5) be the functions infinitely differentiable on the 
open interval (a, b) and let C>°(a, b) be the infinitely differentiable functions defined on the real 
line with compact support contained in the closed interval [a, b]. 


PROPOSITION. Suppose f © C’(a, b) and r(f; t) > 6 > 0 for all t © (a, b). Then f is analytic 
on (a, b). 


This proposition may be found in [1, p. 183] and is proved in [4, Hilfs. 6]. We give here a proof 
which may be accessible to upper division mathematics students and shows the use of the Baire 
category theorem. 


LEMMA |. Suppose f © C”(c, d) and that for some N, 
sup{| f°? (s)/n\|'/":¢< s<dandn>WN)<o. 
Then f is analytic on the open interval (c, da). 


Proof. Let M be the indicated supremum. If x and ¢ are in (c, d) and |x — t| < 1/M, Taylor’s 
formula with remainder assures us that 


n 


f(x) = ¥ (FO) k(x — 1)5 + (FOO (s)/ t+ DIY(x -— 2)" 


k=0 


for some point s between x and ¢. If n > N, the remainder term is bounded by \(x — 1) M|"*!. 
Since this goes to 0 as n grows, we have the desired convergence and f is analytic at ¢. 


LEMMA 2. Suppose f © C“(a, b); that a < c < d < b; and that f is analytic on (c, d). Suppose 
r(f; t) > 6 for all t in [.c, d| and that c < x < d. If|x — c| < 6, then 


f(x) =L(¢(c)/k')(x — €)". 
If |x — d| < 6, then 


f(x) = LF (a) /kY(x — a)". 
Proof. We prove the first case. The second is similar. Pick ¢ between c and x. The series 
g(yy = USP MY(y - 1)" 


defines a function g analytic for t - 6 < y< 7+. Since f is analytic on (c, d), g and f and all 
their derivatives agree on (c, x). Since they are continuous, they also agree at c. Thus 


f(x) = g(x) = L(g (e)/ni)(x — c)" = L(f(c)/m!)(x — ¢)". a 


Proof of the Proposition. Suppose f is not analytic at a point x, in (a, b). Choose [a’, b’] c 
(a, b) with x, in (a’, b’), and let F = {x € [a’, b’]: fis not analytic at x}. Then F is a nonempty 
complete metric space. By the Cauchy-Hadamard formula for r(f; x), the sets 


Ey = {x €[a’, b’]:|f((x)/nyi/" < 1/8 for alln > N) 


exhaust [a’b’]. By the Baire category theorem, one of the closed sets F,, has a nonempty interior 
relative to F. So there is an open interval J such that @ # 11 FC E,,. Without loss of generality, 
we can assume that J is contained in (a, b) and has length less than 6/2. Any point s in /\ Fis 
contained in an open interval (c, d) on which fis analytic and one of whose endpoints, say c, is in 
I F. By Lemma 2, 
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[o@) 


f(s) = (sf (e)/at)(s - €)". 


n=0 
So, fork > N: 


FO(S) = y (F9 (ANIC - Io (GG- kt IY(s- 0)" 


jJ=k 


< 1/8)" 1l)---(j-k + 1)(8/2)7* 


J 


= (2/8)" Yi 1) (j-k + 10/2) 


= (2/8)*(d/dx)*(1/ (1 - x)Jhi2 
= (2/8)"(1/ (1 — (1/2)))""'k! 


Thus | f‘”(s)/k!|'/* is bounded by 1/6 on FN I for k > N, and by 8/6 on the rest of J. Lemma 
1 shows that fis analytic on J contrary to the assumption that there were points of F in J. @ 

We now derive our original assertion from this proposition. Suppose f is a nonzero function in 
CS°(a, b) and that a is the infimum of the support of f. If the derivatives f‘”)(¢) remain bounded 
at every point in some interval (a, c), then r(f; t) = +00 for all ¢ in (— 00, c). The proposition 
shows that f extends from (— 00, c) to an entire function which is identically 0 on (— 00, a). The 
identity theorem for analytic functions shows that f would be 0 everywhere which is impossible 
because a = inf supp (f). This contradiction gives the desired result. 

The situation is quite different if we relax the condition that f have compact support and 
require only that it be in the Schwartz space, 5, of functions of rapid decrease. Thus f and all its 
derivatives should go to 0 as |x| — oo even when multiplied by any polynomial. There are 
functions in 5 for which the sequence of derivatives converges uniformly to 0. The key fact needed 
is that the Fourier transform and its inverse take 5 onto 5 [3, Th. IX.1]. Choose F in Cg°(— 1, 1) to 
be an even function with nonnegative real values. Put 


f(z) = F(z) = Qn) 7p" F(t) eat, 
= 
Then f is in 5, and for real x and k > 0, we have 
f(x) = (20)? F(t) e"(it) at. 
~1 


These are real, and | f)(x)| < C/(k + 1) for a constant C which depends on f but not on x. So, 
for such f, the derivatives converge uniformly to 0. 

There are restrictions on such a function. If f@ C~(IR) and the sequence of derivatives is 
bounded at every point, the proposition shows that f extends to an entire function. Examination 
of the Taylor series around 0 shows that if | f“)(0)| < M for all k, then | f(z)| < Me!?!. So fis the 
restriction of an entire function of exponential type 1. This means that the function cannot decay 
too rapidly along the real axis and it must grow fairly rapidly in the complex plane if it is to be in 
S. In particular it cannot grow more slowly than e!”’ since an entire function of order less than 
1/2 cannot be bounded along any ray [2, Th. 3.1.5]. Since the function is of order at most 1, it 
cannot go to 0 faster than exponentially along the real axis [2, Cor. 5.1.14]. It can go to 0 more 
slowly and still be in 5. If it is, then it is certainly in L*(R) and the Paley-Wiener theorem says 
that fis the inverse Fourier transform of a C® function supported in [— 1, 1]. So the construction 
above gives all such functions in 9. 

In summary we have: If f © Cj°(IR), then there are sequences of points converging to each 
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endpoint of the support of f at each point of which the sequence of derivatives of f is unbounded. 
This is not true for all fin ©. However, for f € 5, the sequence of derivatives is bounded at every 
point if and only if fis the inverse Fourier transform of a function in Cj°(— 1, 1) and, in that case, 
the sequence of derivatives converges uniformly to 0. In fact, | f(x) = O(1/(k + 1)) uniformly 
in x. 
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Recently Russell [1] has given a simple proof of the fact that under certain boundary 
conditions the total variation of the product of two functions of bounded variation is less than or 
equal to the product of their total variations. More precisely, if f and g belong to BV*[a, b], where 
BV*(a, b] = {h; h is of bounded variation on [a, b] and h(a) = 0), then V( fg) < V(f)V(g), 
where V(f) denotes the total variation of f on [a, 5]. 

The following is an even simpler proof of this theorem that depends on the following 
elementary facts. 


(a) If f = BV*[a, b], then f can be decomposed as f = f, — f,, where f, and f, are increasing 
and f(a) = f,(a@) = 9. 

(b) A particular example of such a decomposition is when f, and f, are the positive and 
negative variations of f, respectively; let us call this the canonical decomposition of f. 


(c) For all such decompositions of f, f,(b) + f,(6) > V(f), with equality in the case of the 
canonical decomposition. 

Our proof then runs as follows. Let f = f, — f,, g = 2; — 25, be the canonical decompositions 
of f and g respectively; then 


fe = (fig + fy8>) _ (fi82 + fr81)s 


and the right-hand side is a decomposition of fg. 
Hence, by (c) 


V(fg)< (f,(b)2,(5) + f,(b)g,(b)) + (f(b) g.(b) + fo(b)gi(b)) 
— (f,(5) + f,(b))(g:(b) + 8,(b)) 
= (fit f)(ai + 82) 

V(f)V(g). 
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SERENDIPITY IN MATHEMATICS 
OR 
HOW ONE IS LED TO DISCOVER THAT 


yy bi3 5 Gn) 1 38 ing 
ime n2"n! 2° 16° «144 - 


BERTRAM ROSS 
Department of Mathematics, University of New Haven, West Haven, CT 06516 


This paper chronicles the course of a mathematical discovery. It is usually the case in (alleged) 
accidental discovery that the starting point is far afield from the particular discovery. Our starting 
point is the evaluation of the integral 


(1) The 1)’'Inrdt, Re(v) > 0. 


An integral of the form [*(x — t)’~'f(t) dt/T(v) is known as the Riemann-Liouville integral 
which is the cornerstone of the fractional calculus. More precisely stated, it is the integral which 
defines integration and differentiation to an arbitrary order. The R-L integral can be denoted by 
-D_’f(x), a notation devised by Harold T. Davis [1]. The subscripts on D denote the terminals of 
integration. The order of differintegration [2] can be rational, irrational or complex. When pv = n, 
an integer, then »D,’f(x) and ,D;f(x) are respectively ordinary integration and differentiation. 

The idea of differentiation to an arbitrary order started in 1695 when L’HOpital asked Leibniz 
what would happen with d"y/dx" if n = 1/2. Hence, the topic started with the misnomer 
fractional calculus [3]. 

One method of evaluating (1) is a series expansion followed by term by term integration. Let 


(2) taxti—x=x(1+—), 


where x and ¢ are real and x > 0. Then 


Int=Inx + In(1 + —*). 


When |(t — x)/x| < 1, we can expand In(1 + (t — x)/x) in a Taylor’s series expansion: 


= (=1)" (= x)" s (x=t) 
=| AUAy NS Linx Aw) 
(3) Int=Inx+ } ii nx- » ae 


n=1 n=1 
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where the interval of convergence is 0 < ¢t < 2x. 
When (3) is substituted into (1) we have 


—y»y _ In x x _ y—J| _ | x _ y—] = (x — 1)” 
»D, In x roy 4 t) dt Tod, & t) a dt. 


Thus, for integration of In x to an arbitrary order we get 


y o.6) 


x’Inx x ] 
» 


4 oP X= IPC) Fr) Lyne tny’ 


Re(v) > 0. 


Because of the property of analyticity, we can interchange — vy and »v. So, for differentiation of 
In x to an arbitrary order we will have 


—yp fore) 1 


x "Inx x 
5 D’1 — —— 
( ) o> ne v1 ( v) T( v) n=1 n( v n)’ 


The results in (4) and (5) can be shown to be consistent with the results of ordinary integration 
and differentiation for the case v = 1. 

This is where the matter stood for some time until there was sufficient time and inclination to 
pursue this work further. It seemed like a good idea to write both terms on the right sides of (4) 
and (5) with the same denominator. This requires the multiplication of the second term on the 
right in (4) by unity in the form of v/v and in (5) by —v/-— »v. It was after playing with symbols 
and naively placing » and —»v under the summation signs that pattern recognition occurred and 
the adrenalin started to flow. 

From the study of the psi function [4] we learn that series of the form 


oo 


Xn tay Ot 40+), 


where C is Euler’s constant and is the psi function, tables of which can be found in [5]. In terms 
of the psi function (4) and (5) become respectively 


Re(v) > 0. 


(6) oD, "Inx = Fo YG Ha x c-vi(1 + v)], Re(v) > 0, 
and 
(7) oDeinx = 57 ylinx—C-¥(1—»)], Rev) > 0, 


So far, nothing very spectacular has developed. The second step on the path toward mathe- 
matical discovery is the solution to an integral equation of the Volterra type. Armed with a 
formula for the derivative of arbitrary order of In x, I proceeded to solve 


(8) rahe 1) (2) dt = nx, 


where v = 1/2 was chosen arbitrarily. 

The integral is the Riemann-Liouville integral and the integral equation can be solved by 
Laplace transforms. However, the use of the operator notation of the fractional calculus shows its 
elegance and power by virtue of its simplicity and succinctness. An interested reader can compare 
the following with the method of Laplace transforms. We write (8) as 


(9) oD, '/*f(x) = Inx. 
Operating on both sides of the above with ,D!/* gives 


(10) f(x) =,D! In x. 
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The use of formula (7) with vy = 1/2 gives us at once 


—1/2 
(11) f(x) = Fay linx - C- ¥(1/2)). 
Also, from the list of special values of the psi function [6] we see that 
(12) ¥(1/2) = -—C — In4. 
The solution to (8) is then 
(13) f(x) = nay. 
"ha 


The third step on our path is where the “serendipitous” event occurs and which is given in the 
title. Many hours were wasted because of slight mistakes in sign and algebra, not to speak of a 
ream of paper in the waste basket. I could not let (13) rest without verifying it. 

The solution (13) is substituted into (8) and we have to show 


Ley y-12,-17 _ 
7h t) “rt '/*In4t dt = Inx, 
that is, 
(14) = fe (x rat [(x—rin edt = nx. 
T 0 
The expansion for In ¢ given in (3) is substituted into (14) and we have 


In4 —1/2,-1/2 Inx * _ —1/2),-1/2 
(15) LG —t) “tf dt + 7 iC t) “"t dt 


-ofe — 1) aS {xp #)" dt = Inx. 


n=1 
We recall the beta integral 


x d T(d+ DT (at) grass 
— a SS a —], —]. 
(16) iC t)°t? dt = T(d+a+2) x ,d> a> 


The first and second integrals in (15) have the values In 4 and In x respectively. The third integral 
is integrated term by term. The use of formula (16) and the recurrence formula for the gamma 
function yields 


1 n 1-3 , 135 
2-1! 2.27.2 3-23-31 


In summation form the value of the third integral in (15) is 
(17) rs n2"n! 
In order to satisfy (15) we “discover” that 


153-5 e+ (Qn-1) | 
(18) » n2"n! = In4. 


n= 1 


The matter does not end at this point. The series will be put into a computer. Before doing so, 
and to guard against error in the development of (18), the series is tested for convergence. When 
the series is put into the form whose general term is (1/ Vz )I'(n + 1/2)/nT(n + 1), d’Alembert’s 
ratio test fails but Raabe’s test is decisive. The series is then put into the computer without trying 
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to improve on the rapidity of convergence and without regard to the rate of error propagation. 
After 682,783 terms, at which point the difference between consecutive terms is less than 10°”, 
the series summed to 1.385... , whereas the known value for In 4 is 1.386... . [8] 

The relation (18) was obtained by deduction. So, as one thing leads to another, the thought 
arose that once a relation is discovered indirectly it might be a good idea to prove it directly. A 
starting point is the expression 


1-3-5--- (Qn-—-1) 
Qo” 
which is almost like the identity 


T(n + 1/2) — 350+ Gna ye [7] 


The term ['(n + 1/2) also appears in the evaluation of a definite integral. From the study of the 


beta integral we have 


1T(n+1/2)TU/2) _ va T(n+1/2) 
2 T'(n +1) 2 n! 


Denoting the series (17) by S we can write 


00 * .2n 
s=2 yf aa * dx. 


7 n=1°0 n 


n/a , 
i sin?”x dx = 
0 


We can, in this instance, interchange the summation sign and the integral sign getting 


sa 2 fr? sin*x n sin*x n sin®x 
; 2 3 


T 
Observing that the integrand above is the Maclaurin series expansion for —In(1 — sin*x), it 
follows that 


te | dx 


a /2 
S = -4/" Incos x dx 
7 0 


§(-f3), 


= 1n4. 


There are several other series expansions for In 4 that can be found in the literature. The result 
(18) is not very important in the scheme of things although the result might be of use in 
accelerating the rapidity of convergence of a similar series because its sum is known. The purpose 
of this paper is not to introduce a new result. The primary purposes are to present an exposition 
of techniques that could be more widely known and to exemplify the process of mathematical 
discovery. 

The result (18) was unexpected but could not have been achieved without ability and a 
prepared mind. In mathematics the word serendipity does not have the ordinary meaning “lucky 
accident.” 
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An asterisk (*) indicates that neither the proposer nor the editors supplied a solution. 

Solutions should be sent to the addresses given at the head of each problem set. 

A publishable solution must, above all, be correct. Given correctness, elegance and conciseness are preferred. 
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Solutions of these Elementary Problems should be mailed in duplicate to Professor G. L. Alexanderson, 
Department of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by February 29, 1984. Please 
place the solver’s name and mailing address on each (double-spaced) sheet. Include a self-addressed card or label 
(for acknowledgement). 


E 3013. Proposed by Stanley Rabinowitz, Digital Equipment Corp. 


Let ABC be a fixed triangle in the plane. Let T be the transformation of the plane that maps a 
point P into its isotomic conjugate (relative to ABC). Let G be the transformation that maps P 
into its isogonal conjugate. Prove that the mappings TG and GT are affine collineations (linear 
transformations). 


E 3014. Proposed by Lorraine L. Foster, California State University, Northridge. 


Prove that the congruence 3* = 19 (mod 2”) is solvable for n > 1. 


E 3015. Proposed by Ioan Tomescu, University of Bucharest. 
Let h > 2 be an integer and let a(h) > h be the unique solution of the equation 


x + x fo... x _y Gd) 
x- 1 x-2 x-h+1 h! 


where (x), = x(x — 1)---(x — A + 1). Show that 


1+ 


1 
lim, n0(h)(2h + >In h| = 1. 
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Let ABC be a fixed triangle in the plane. Let T be the transformation of the plane that maps a 
point ‘P into its isotomic conjugate (relative to ABC). Let G be the transformation that maps P 
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transformations). 

E 3014. Proposed by Lorraine L. Foster, California State University, Northridge. 


Prove that the congruence 3* = 19 (mod 2”) is solvable for n > 1. 


E 3015. Proposed by Ioan Tomescu, University of Bucharest. 
Let h > 2 be an integer and let a(h) > h be the unique solution of the equation 


x x (x), 
—Ttye2t tyne Oh 


where (x), = x(x — 1) ---(x— A + 1). Show that 


1+ 


1 
lim... n0(h)(2h + sin h| =]. 
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E 3016. Proposed by Eugene Levine, Adelphi University. 


Let a), a2,..., a, be positive numbers (n > 2), lets = a, + a, +--:-+a,,andlet0<B< 1 
Prove that 


& (Se) = od (Fa) 


k=] 
Further, show that equality holds iff a, = a, = --- =a,,. 


E 3017. Proposed by Charles W. Schelin, University of Wisconsin, La Crosse. 


Let f(z) be a real polynomial of degree n>1 such that f(—1)f(1) #0. Put L= 


—f'(-))/f(— 1) and R = f’(1)/f(1). Show that: 
(a) If L <n/2 or R < n/2, then f has a zero in |z| > 1 
(b) If L > n/2 or R > n/2, then f has a zero in |z| < 1. 


E 3018. Proposed by A. M. Nadel, Pomona College. 


A nonempty set S of positive integers is said to be eventually linear if there exist integers N and 
k such that for all n > N, n € S if and only if k|n. Show that any nonempty set of positive 
integers that is closed under addition is eventually linear. 


SOLUTIONS OF ELEMENTARY PROBLEMS 
A Definite Integral Involving a Parameter 


E 2910 [1981, 705]. Proposed by Tian Jing Huang, Szechwan University, People’s Republic of 
China. 


Prove that the function 
A(h)= fi — (h — cos x) | [4(h — cos x) — 1| dx, k = arccos(h — 1), 
0 


has at least two zeros in the interval (0, 2). 


Solution by William A. Newcomb, Lawrence Livermore National Laboratory. We have first 


A(0) = [sin x(4cos* x — 1) dx = : > 0. 


We have also A(h) > 0 for 3 < h < 2, since the integrand in this case is positive throughout the 
range of integration. Now let w = 1 — 2cos x. Then, by straightforward calculation, 


A(1) = [- FO») dw, 


where F(w) = zw(2 + wy) GEG) 


F(w) + F(-w) = - 


. Finally, since 


w7(1 + w?) 
(9 — w2)(1 — w2) 


we see that A(1) < 0. But A(/) is continuous and must therefore have at least one zero in each of 
the intervals (0, 1) and (1,3/2). 


Gaitley found h = 0.2766, 1.2547. 


Also solved by B. J. Gaitley, D. Hensley, G. A. Heuer, O. P. Lossers (Netherlands), M. Woltermann, and the 
proposer. 
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E 3016. Proposed by Eugene Levine, Adelphi University. 


Let a,, ay,..., a, be positive numbers (n > 2), lets =a, +a,+--:+a,,andlet0<B< 1 
Prove that 


——}] >(n- 1 ———]. 
x | ay ( ) x S—~ ay 
Further, show that equality holds iff a, =a, = -:- =a,. 
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A Definite Integral Involving a Parameter 


E 2910 [1981, 705]. Proposed by Tian Jing Huang, Szechwan University, People’s Republic of 
China. 


Prove that the function 
k 
A(h) =[ E — (h - cos x) | [4c — cos x) — 1] dx, k = arccos(h — 1), 
0 
has at least two zeros in the interval (0, 2). 


Solution by William A. Newcomb, Lawrence Livermore National Laboratory. We have first 


A(0) = [sin x(4cos? x — 1) dx = : > 0. 


We have also A(h) > 0 for + < h < 2, since the integrand in this case is positive throughout the 
range of integration. Now let w = 1 — 2cos x. Then, by straightforward calculation, 


A(1) = iz F(w) aw, 


(3+w)U—w) 
—w)(1+w)- 


F(w) + F(-—w) = - 


where F(w) = 3w(2 + wy Gene) Finally, since 


w*(1 + w’) 
(9 — w?)(1 — w?) 


we see that A(1) < 0. But A(/) is continuous and must therefore have at least one zero in each of 
the intervals (0, 1) and (1,3/2). 


Gaitley found h = 0.2766, 1.2547. 


Also solved by B. J. Gaitley, D. Hensley, G. A. Heuer, O. P. Lossers (Netherlands), M. Woltermann, and the 
proposer. 


568 PROBLEMS AND SOLUTIONS [October 
Numerators and Denominators of Bernoulli Numbers 


E 2912 [1981, 705]. Proposed by Barry Powell, Kirkland, WA. 

Let N,,,» D2,,, be the numerator and denominator of the 2mth Bernoulli number B,,,, defined 
by tan z = 4(—1)"7'27"(22""! — 1)B,,27"7'/(2n)!. 

Prove: (a) If p is an odd prime, N,,/p = | mod p. (b)* For any even positive integer k, there 
exist infinitely many even positive integers r,s, ¢,... such that N,|N,, N,|N,,... . 

Solution to part (a) by Ira Gessel, Massachusetts Institute of Technology. Note that the assertion 
is false for p = 3, since B, = a. We prove it for odd primes greater than 3. 

By von Staudt’s theorem (see, e.g., G. H. Hardy and E. M. Wright, An Introduction to the 
Theory of Numbers, Fourth Edition, p. 91) D,,, 1s the product of all primes qg such that g — 1 
divides 2m. Thus D,, is either 6 or 6(2p + 1). In either case D,, = 6(mod p). By Kummer’s 
congruence (J. Reine Angew. Math., 41 (1851), 368-372), if p — 1 does not divide 2m and if 
2m > p, then 

B,,,/ (2m) = Bym—p+i/(2m — p + 1)(mod p). 
Thus 
By,/(2p) = B,/2 = 1/12(mod p), 
SO 
N,,/p = (Brp/(2p)) *2Dz, = (1/12) -2- 6 = 1(mod p). 


It may be noted that the definition given for the Bernoulli numbers is incorrect: the formula 
should be 


2n—\ 

_ _ n—-lAjAIn 2n __ Z 
tanz= )(-1)" 27"(2 1) Ban nyt 
The Bernoulli numbers are usually defined by the much simpler formula 


eet —\1)= Yo Bz" /nl. 
n=0 


Solution to part (b) by L. E. Mattics, University of Southern Alabama. To answer (b) it suffices 
to show that for N, > 5, N,|N, where 2|k and r = 6(N7) + k. Indeed, with von Staudt’s theorem 


and the formula B(x) = rr gxt B, for Bernoulli polynomials, it is a number-theoretic 
exercise to deduce that 

(1) B,.1(N,)/(r + 1) = B,N, (mod NZ) 

(2) B,+\(N,)/(k + 1) = B,N, (mod N?) = 0 (mod N?) 

and if p*||N, with p prime and s > 1, 


(3) By4\(N,/p) = O(mod p**~*). 
Now if p is prime, p*||N,, s > 1, then 


N.-1 N,-1 
v= YD i’sincer > 2s 


i=1 i=] 


(i, p)=1 
= ) i*sinceer =k + (NZ) 


i=l 
(i, p)=1 
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N,— | (Ni/P)— | 


~ i*+ YY p*i* by (3) 


i=1 i=1 
(i, p)=1 


N,-1 


>, i* (mod p?*). 


i=] 


Hence by (1) and (2), 
B,N, = B,N, = Omod NZ and we are done. 
Part (a) was also solved by L. E. Mattics, the proposer and by S. Wagstaff, who forwarded important comments. 
3" = 14+274+ 22 +2 
E 2915 [1981, 763]. Proposed by Leo J. Alex, SUNY at Oneonta, NY. 


Let k be fixed. It is conjectured that 3” = D*_,2” has only finitely many integral solutions 
(1,1, %,.-., 7). This is clear if k = 1; Stormer proved it if k = 2, and S. S. Pillai if k = 3. Prove 
the assertion for k = 4. 


Remark by E. G. Straus, University of California at Los Angeles. The “conjecture” is a corollary 
of a Theorem of H. G. Senge and mine [Theorem 3, PV-Numbers and Sets of Multiplicity, 
Periodica Math. Hungarica, 3 (1973) 93-100] which shows that, if there are infinitely many 
integers that can be expressed in two different integral bases b,, b, with sums of digits below a 
fixed bound k, then b, and b, are powers of the same integer bD. 


Also solved by M. Bencze (Romania), L. L. Foster, L. E. Mattics, St. Olaf Problem Group, and the proposer. 


ADVANCED PROBLEMS 


Solutions of these Advanced Problems should be mailed in duplicate to Professor G. L. Alexanderson, Department 

of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by February 29, 1984. The solver’s full 

post-office address should be on each sheet. 

6439. Proposed by Morton Brown, University of Michigan. 

Let {a,,} be a sequence of real numbers satisfying the relation a,,, , =|a,| — a,,_,. Prove that 
{a,,) is periodic with period 9. 

6440. Proposed by M. S. Klamkin, J. McGregor and A. Meir, University of Alberta. 

Let F(x), G(x) be two functions in L,(— 00, 00) which satisfy 


[o F) dx = f° Gx) dx = 1. 


Show that for any A in (0, 1) there is a set E C (— 00, 00) such that 


[ F(x) dx = [ G(x) dx =X. 


6441. Proposed by F. W. Schmidt and R. Simion, Southern Illinois University. 


Let a, <a, < --: <a, ben distinct positive integers. Show that the rational function 
xa _ x4 
it —— 
l<i<j<n x'— x/ 


is actually a polynomial. 
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Ny — | (N./P)— | 


~ i+ DY p*i* by (3) 


i=] i=1 
(i, p)=1 


N,-1 


>, i* (mod p?*). 


i=] 


Hence by (1) and (2), 
BLN, = B,N, = O0mod N? and we are done. 
Part (a) was also solved by L. E. Mattics, the proposer and by S. Wagstaff, who forwarded important comments. 
3" = 14+274+ 22 +2 
E 2915 [1981, 763]. Proposed by Leo J. Alex, SUNY at Oneonta, NY. 


Let k be fixed. It is conjectured that 3” = ©‘_,2” has only finitely many integral solutions 
(1, 1\,%,.-.,%,). This is clear if k = 1; Stormer proved it if k = 2, and S. S. Pillai if k = 3. Prove 
the assertion for k = 4. 


Remark by E. G. Straus, University of California at Los Angeles. The “conjecture” is a corollary 
of a Theorem of H. G. Senge and mine [Theorem 3, PV-Numbers and Sets of Multiplicity, 
Periodica Math. Hungarica, 3 (1973) 93-100] which shows that, if there are infinitely many 
integers that can be expressed in two different integral bases b,, b, with sums of digits below a 
fixed bound k, then b, and b, are powers of the same integer bD. 


Also solved by M. Bencze (Romania), L. L. Foster, L. E. Mattics, St. Olaf Problem Group, and the proposer. 


ADVANCED PROBLEMS 


Solutions of these Advanced Problems should be mailed in duplicate to Professor G. L. Alexanderson, Department 

of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by February 29, 1984. The solver’s full 

post-office address should be on each sheet. 

6439. Proposed by Morton Brown, University of Michigan. 

Let {a,,} be a sequence of real numbers satisfying the relation a, ,, = |a,| — a,,_,. Prove that 
{a,,\ is periodic with period 9. 

6440. Proposed by M. S. Klamkin, J. McGregor and A. Meir, University of Alberta. 

Let F(x), G(x) be two functions in L,(— 00, 00) which satisfy 


[i FO) dx = [Gs dx = 1. 


Show that for any A in (0, 1) there is a set E C (— 00, 00) such that 


FC) dx = [ G(x) dx =X. 


6441. Proposed by F. W. Schmidt and R. Simion, Southern Illinois University. 


Let a, <a, < --: <a, ben distinct positive integers. Show that the rational function 
xa _ x4 
it —— 
l<i<j<n x'— x/ 


is actually a polynomial. 
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Ny — | (N./P)— | 


~ i+ DY p*i* by (3) 


i=] i=1 
(i, p)=1 


N,-1 


>, i* (mod p?*). 


i=] 


Hence by (1) and (2), 
BN, = B,N, = Omod N; and we are done. 
Part (a) was also solved by L. E. Mattics, the proposer and by S. Wagstaff, who forwarded important comments. 
3" = 14+274+ 22 +2 
E 2915 [1981, 763]. Proposed by Leo J. Alex, SUNY at Oneonta, NY. 


Let k be fixed. It is conjectured that 3” = D_,2” has only finitely many integral solutions 
(1, 1\,%,...,%,). This is clear if k = 1; Stormer proved it if k = 2, and S. S. Pillai if k = 3. Prove 
the assertion for k = 4. 


Remark by E. G. Straus, University of California at Los Angeles. The “conjecture” is a corollary 
of a Theorem of H. G. Senge and mine [Theorem 3, PV-Numbers and Sets of Multiplicity, 
Periodica Math. Hungarica, 3 (1973) 93-100] which shows that, if there are infinitely many 
integers that can be expressed in two different integral bases b,, b, with sums of digits below a 
fixed bound k, then b, and b, are powers of the same integer b. 


Also solved by M. Bencze (Romania), L. L. Foster, L. E. Mattics, St. Olaf Problem Group, and the proposer. 


ADVANCED PROBLEMS 


Solutions of these Advanced Problems should be mailed in duplicate to Professor G. L. Alexanderson, Department 
of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by February 29, 1984. The solver’s full 
post-office address should be on each sheet. 


6439. Proposed by Morton Brown, University of Michigan. 

Let {a,,) be a sequence of real numbers satisfying the relation a,,, , = |a,| — a, _,. Prove that 
{a,,) is periodic with period 9. 

6440. Proposed by M. S. Klamkin, J. McGregor and A. Meir, University of Alberta. 

Let F(x), G(x) be two functions in L,(— 00, 00) which satisfy 


[o F) dx = [Gx dx = 1. 


Show that for any A in (0, 1) there is a set E C (— 00, 00) such that 


F(x) dx = [ G(x) dx = . 


6441. Proposed by F. W. Schmidt and R. Simion, Southern Illinois University. 


Let a, <a,< --: <a, ben distinct positive integers. Show that the rational function 
x — x4 
it ——— 
l<i<j<n x' — x/ 


is actually a polynomial. 


570 PROBLEMS AND SOLUTIONS [October 
SOLUTIONS OF ADVANCED PROBLEMS 
Flexible Linkages of Polygons 


6367 [1981, 711]. Proposed by A. Ehrenfeucht and J. Mycielski, University of Colorado. 


Let A be a finite collection of distinct but possibly overlapping regular n-gons of the same size 

on the plane such that every vertex of every n-gon of A is a vertex of exactly two n-gons of A. 
(a) Construct a collection A of 2” n-gons such that, even if the n-gons are rigid, A is flexible. 
(b)* For which n is a rigid A possible? 


Solution to part (a) by Kit Hanes, Eastern Washington University. We establish a more general 
result. Let B be any n-gon with vertices B,,..., B, and let C be any m-gon with vertices 
C,,..., C,,. Let 6 be the angle between B,B, and C,C,. Then, for i € {1,..., n} let C, = B, + C, 
and for j € {1,..., m) let B, =B+C,. Let A= (Cyy.ees on B,,..., B,,). A is a collection of 
n + m polygons each of which is a translate of either B or C. Any vertex of any polygon in A is a 
point B; + C;, which is a vertex just of B, and of C,. This construction is valid independently of 6; 
as @ varies A flexes. 


Also solved by Michael Goldberg and the proposers. 


Part (b). The proposers state that they know of only two cases when A as defined in the 
problem is rigid, namely n = 2 (a triangle) and n = 4 (a configuration of 12 squares inside a 
regular 12-gon whose outer edges form the 12-gon). 


ANSWER TO PHOTOS ON PAGE 517 


They are all outstanding mathematicians, and they were all presidents of the A.M.S. Here are 
their names, followed by the years of their presidency. Top row: J. von Neumann (1951-1952), 
R. L. Wilder (1955-1956); bottom row: J. L. Doob (1963-1964), Oscar Zariski (1969-1970). 
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A Sum of Reciprocals of Euler Totients 


6370* [1981, 769]. Proposed by P. Erdés, Hungarian Academy of Sciences. 
For each set of positive integers {a;} such that La; ' < 1, define f,(n): = ©1/¢(a,), the sum 


I 


being extended over those a; < n. Set f(n) = max f,(n), the maximum being taken over the 
admissible sets {a,}; @ is Euler’s totient. Estimate f(n) as well as you can. 


Solution by M. G. Hickman, New Mexico State University. Let 
A=P\Pr.°°* PrSMS Py Pr °°? Peri 


where p,, P>,... are the primes in increasing order. Then 


Further, if m < n and the distinct prime factors of m are q), q2,..., q;, thenj < k and hence 


$9) (1-2) 511-4) -2@ 


m i=1 qi DP; a 


It follows that 


If repetitions of the integers a; are allowed in the definition of f,;(”), we obtain 


a 
f(n) =——~ 

(a) 
by taking a, = a fori = 1,2,..., a and observing that, in this case, f,(n) = a/o(a). If repetitions 
are not permitted, the above yields only an upper bound on f(n). 


Closed and Connected Graphs 


6373 [1981, 769]. Proposed by F. S. Cater, Portland State University. 


By an “order space” we mean a totally ordered nonvoid set endowed with the open interval 
topology. Let X be an order space, and let Y be either an order space or a locally compact 
Hausdorff space. Let f be a mapping of X into Y that is not everywhere continuous, and let G 
denote the graph of f, G = ((x, f(x)): x © X}. 

(i) Prove that G is not both a closed and connected subset of the product space X xX Y. 
(Compare problem 6255 [1980, 679] in this MONTHLY.) 

(ii) Show, by example, that (i) need not hold for complete connected metric spaces X and 
Y = f(X), if X is not an order space. 


Solution to part (i) by the proposer. Suppose that G is closed and connected and f is 
discontinuous at x. There is a neighbourhood V of f(x) such that for any neighbourhood U of x, 
there is an x’ € U with f(x’) € V. By reducing the neighbourhood V of f(x), if necessary, we can 
(and do) assume that bdry V is compact. (Note here that, if Y is an order space, intervals have 
boundaries consisting of at most two points.) 

We claim that there is a neighbourhood U, of x such that f(U)) c V U ext V. Assume not. For 
each neighbourhood U of x, pick x, € U with f(x,) € bdry V. Let % denote the set of 
neighbourhoods of x directed by inclusion. Then { f( y,,)} is a net on this directed set, and a subnet 
of { f(x ,)} converges to y, say, in the compact set, bdry V. It follows that the point (x, y) is in the 
closure of G. But (x, f(x)) © Gand y # f(x). This contradicts the hypothesis that fis a function 
and G=G. 
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A Sum of Reciprocals of Euler Totients 


6370* [1981, 769]. Proposed by P. Erdés, Hungarian Academy of Sciences. 
For each set of positive integers {a;} such that La; ' < 1, define f,(n): = X1/¢(a,), the sum 


I 


being extended over those a; < n. Set f(n) = max f,(n), the maximum being taken over the 
admissible sets {a,}; @ is Euler’s totient. Estimate f(n) as well as you can. 


Solution by M. G. Hickman, New Mexico State University. Let 
A=P\Pr2°°* PeSNS Pi Pr - °° Prt 


where p,, p5,... are the primes in increasing order. Then 


Further, if m < n and the distinct prime factors of m are q,, q2,..., q;, then j < k and hence 


#1) T(i-2) 5m (1-+)-2@) 


m i=1 qi DP; a 


It follows that 


If repetitions of the integers a; are allowed in the definition of f,;(”), we obtain 


a 
f(n) =——~ 

(a) 
by taking a, = a fori = 1,2,..., a and observing that, in this case, f,(n) = a/o(a). If repetitions 
are not permitted, the above yields only an upper bound on f(n). 


Closed and Connected Graphs 


6373 [1981, 769]. Proposed by F. S. Cater, Portland State University. 


By an “order space” we mean a totally ordered nonvoid set endowed with the open interval 
topology. Let X be an order space, and let Y be either an order space or a locally compact 
Hausdorff space. Let f be a mapping of X into Y that is not everywhere continuous, and let G 
denote the graph of f, G = (x, f(x)): x © X). 

(i) Prove that G is not both a closed and connected subset of the product space X xX Y. 
(Compare problem 6255 [1980, 679] in this MONTHLY.) 

(ii) Show, by example, that (i) need not hold for complete connected metric spaces X and 
Y = f(X), if X is not an order space. 


Solution to part (i) by the proposer. Suppose that G is closed and connected and f is 
discontinuous at x. There is a neighbourhood V of f(x) such that for any neighbourhood U of x, 
there is an x’ € U with f(x’) € V. By reducing the neighbourhood V of f(x), if necessary, we can 
(and do) assume that bdry V is compact. (Note here that, if Y is an order space, intervals have 
boundaries consisting of at most two points.) 

We claim that there is a neighbourhood U, of x such that f(U)) C V U ext V. Assume not. For 
each neighbourhood U of x, pick x, € U with f(x,y) € bdry V. Let % denote the set of 
neighbourhoods of x directed by inclusion. Then { f( y,,)} is a net on this directed set, and a subnet 
of { f(x ,)} converges to y, say, in the compact set, bdry V. It follows that the point (x, y) is in the 
closure of G. But (x, f(x)) © Gand y # f(x). This contradicts the hypothesis that fis a function 
and G=G. 
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We reduce Up to an interval [a,b], where x € int[a, b], f[a,b] C VU extV. But some 
c € int[a, b] satisfies f(c) € V. Say c< x. (The case c > x is analogous.) Then f(x) € V, 
f(c) € ext V, f[c, x] C VU ext V. The following two sets are disjoint and open in X X Y: 


((c, x] X V) U ((x,00) X Y), ([c, x) x extV) U ((—00,c) X Y). 


The union of these sets contains the graph G. The first set contains the point (x, f(x)) and the 
second contains (c, f(c)). Thus G is not connected, contrary to hypothesis. 


Solution to part (ii) by D. E. Sanderson, Iowa State University. Let X be the unit circle and in 
polar coordinates set f(1, 8) = 27/(27 — @). Then Y = f(X) = [], 00) and f is discontinuous at 
(1,0), yet its graph is closed and connected. 


A Sequence Converging to t 


6375 [1982, 65]. Proposed by Chang Gengzhe, University of Utah. 


Set 
tebe (MENSA) oo (PENS) 
n+2 n+2)\n+3 n+2/\n+3 2n} 


Show that 


Composite solution. We have 


—jn—1 
_{ 2n 2n 
_1/( 2n \" (2) ana 2n — 
-5(,7",] (2 m) =2 n— | 2n — 


Application of Stirling’s approximation formula to the binomial coefficient yields the desired 
result. 


Solved by 52 solvers including the proposer. Some solvers used the Wallis formula 


lr 1 \ 
a= lim “T(t +34) 


noo Mj} 


instead of Stirling’s approximation formula. 
L. Van Hamme (Belgium) obtained the more general result 


S, vr 1 Ne l (4). 


vn 2 Wn l6n 93/2 


G. H. Gonnet showed that if r and s are arbitrary constants and 


n+r (ntry(ntr-), 4 [T(n+s)T(n+r+1) 


=|+ _ thn spitarrs tt) 
S,Cr, 5) n+s (n+s)(n+s5s41) T(nts+k)T(n+r+1—k) 


then 
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We reduce Up to an interval [a,b], where x € int[a, b], f[a,b] Cc VU extV. But some 
c € int[a, b] satisfies f(c) € V. Say c< x. (The case c > x is analogous.) Then f(x) € V, 
f(c) © ext V, f[c, x] C VU ext V. The following two sets are disjoint and open in X X Y: 


((c, x] XV) U((x,00) xX Y), ([c, x) x extV) U ((—00,c) x Y). 


The union of these sets contains the graph G. The first set contains the point (x, f(x)) and the 
second contains (c, f(c)). Thus G is not connected, contrary to hypothesis. 


Solution to part (ii) by D. E. Sanderson, Iowa State University. Let X be the unit circle and in 
polar coordinates set f(1, 0) = 27/(2a — @). Then Y = f(X) = [1, 00) and f is discontinuous at 
(1,0), yet its graph is closed and connected. 


A Sequence Converging to 7 


6375 [1982, 65]. Proposed by Chang Gengzhe, University of Utah. 


Set 
tebe (MENSA) oo (PENUESZ) 
n+2 n+2)\n+3 n+2/\n+3 2n} 


Show that 


-3( 2n ) (2 - (2) = 22-4 2n ) _ntl 

2 n— ] n n- l an ‘. 

Application of Stirling’s approximation formula to the binomial coefficient yields the desired 
result. 


Solved by 52 solvers including the proposer. Some solvers used the Wallis formula 


lr 1 \ 
a= lim “T(t +54) 


noo Mj] 


instead of Stirling’s approximation formula. 
L. Van Hamme (Belgium) obtained the more general result 


S, va 1 or 1 (+). 


Vn 2 Dyn On 2p? 


G. H. Gonnet showed that if r and s are arbitrary constants and 


+ +r +s)T(ntrt+] 
s(rsyorp2tty GIG Dy, Msn t rt 
nt+s (n+s)(n+5+4 1) T(n+s+k)P(n+rt+1—k) 


then 
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Limit of a Sequence 


6376 [1982, 65]. Proposed by I. J. Schoenberg, University of Wisconsin. 
Let a,, (n = 1,2,... ) be reals. Show that 


n 
lim a,>.a2 = 1 implies that lim (3n)'a, = 1. 
n—- oo l n— oo 
Solution by A. Meir, University of Alberta. Let s, = X",a?. Then the condition a,s, > 1 
implies that s, — oo and a, — 0. Hence we also have that a,s,_, — 1. Therefore 


3 63) 2,2 2 
S; s3_, =a2(s2+5,5,-,+8 )>3 


n—} 


and thus s? ~ 3n. It follows that a, ~ (3n)7'”°. 
Similarly, it can be proved that if for some p > 0, 


n 
lim ay » |a;|? = l, 
nO i=l 
then a, ~ ((p + In) '7F9, 
Also solved by E. J. Barbeau (Canada), Artin Boghossian (Saudi Arabia), Robert Breusch, Paul S. Bruckman, 
Edmond Butler, Jeesen Chen, David C. Cox, Nathaniel Grossman, Ellen Hertz, Robert B. Israel (Canada), Mauri 


Koskela (Finland), T. K. Louton & C. C. Rousseau, L. E. Mattics, Robert K. Meany, William A. Newcomb, 
Johannes C. C. Nitsche, Otto G. Ruehr, Chow Seong Seow (Canada), W. T. Sledd, and the proposer. 


The proposer wishes to add that he received the problem without solution from Franklin Richards about ten 
years ago. 


Decomposition of a Function into Measurable Functions 


6378 [1982, 134]. Proposed by Yi Hong and Jingcheng Tong, Wayne State University. 


Let f(x) be a real function defined on J = [0,1]. Prove there are two Lebesgue measurable 
functions g(x) and h(x) such that f(x) = g(h(x)). 


Solution. Berthold Schweizer and John C. Morgan II pointed out that the result is known, 
apparently having been obtained first by S. Ruziewicz in 1932 and subsequently published in his 
paper, “Sur une propriété des fonctions arbitraires d’une variable réelle,’ Mathematica, 9 (1935) 
83-85. 

Michael Renardy and the proposers submitted the following decomposition of f. Define h: 
I> I by setting 


Then h is strictly increasing on J, and h(/), being a subset of the Cantor set, has Lebesgue 
measure zero. Define g: J — I by setting 


x)= f(n7"(x)) forx € h(1), 
8) (4 otherwise. 


Then f(x) = g(h(x)) for x € J, and both g and h are Lebesgue measurable. 
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Limit of a Sequence 


6376 [1982, 65]. Proposed by I. J. Schoenberg, University of Wisconsin. 
Let a, (n = 1,2,... ) be reals. Show that 


n 
lim a,>.a2 = 1 implies that lim (3n)'7°a, = 1. 
n—- oo l n— oo 
Solution by A. Meir, University of Alberta. Let s, = X",a?. Then the condition a,s, > 1 
implies that s, — oo and a, — 0. Hence we also have that a,s,_, — 1. Therefore 


3 63) 2,2 2 
S; s3_, =a2(s2+5,5,-,+8 )>3 


n—} 


and thus s? ~ 3n. It follows that a, ~ (3n)7'”°. 
Similarly, it can be proved that if for some p > 0, 


n 
lim ay » |a;|? = l, 
nO i=l 
then a, ~ ((p + In) '7F9, 
Also solved by E. J. Barbeau (Canada), Artin Boghossian (Saudi Arabia), Robert Breusch, Paul S. Bruckman, 
Edmond Butler, Jeesen Chen, David C. Cox, Nathaniel Grossman, Ellen Hertz, Robert B. Israel (Canada), Mauri 


Koskela (Finland), T. K. Louton & C. C. Rousseau, L. E. Mattics, Robert K. Meany, William A. Newcomb, 
Johannes C. C. Nitsche, Otto G. Ruehr, Chow Seong Seow (Canada), W. T. Sledd, and the proposer. 


The proposer wishes to add that he received the problem without solution from Franklin Richards about ten 
years ago. 


Decomposition of a Function into Measurable Functions 


6378 [1982, 134]. Proposed by Yi Hong and Jingcheng Tong, Wayne State University. 


Let f(x) be a real function defined on J = [0,1]. Prove there are two Lebesgue measurable 
functions g(x) and h(x) such that f(x) = g(h(x)). 


Solution. Berthold Schweizer and John C. Morgan II pointed out that the result is known, 
apparently having been obtained first by S. Ruziewicz in 1932 and subsequently published in his 
paper, “Sur une propriété des fonctions arbitraires d’une variable réelle,’ Mathematica, 9 (1935) 
83-85. 

Michael Renardy and the proposers submitted the following decomposition of f. Define h: 
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Then h is strictly increasing on J, and h(J), being a subset of the Cantor set, has Lebesgue 
measure zero. Define g: J — I by setting 


x)= f(n7"(x)) forx € h(1), 
8) (4 otherwise. 


Then f(x) = g(h(x)) for x € J, and both g and h are Lebesgue measurable. 
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The Mathematical Theory of Chromatic Plane Ornaments. By Thomas W. Wieting. Marcel Dekker, 
New York, 1982. 


MARJORIE SENECHAL 
Department of Mathematics, Smith College, Northampton, MA 01063 


Browsing through collections of patterns or ornaments, almost everyone becomes fascinated by 
their intricate repetitions: we correctly sense that, despite their diversity, all the patterns are 
variations on a few basic themes. Symmetrically colored patterns are especially attractive and 
intriguing because of the lively but orderly interaction of permutations and symmetries that they 
display. A careful scrutiny of colored patterns leads us naturally to consider color symmetries, and 
to the problem of classifying them. 
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{((g, 7), § = G} a color group. It is easy to verify that 7, is the image of g in S, under a 
permutation representation of G. The color groups, under a suitable definition of equivalence, 
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To handle these and related applications, we need a complete enumeration of the k-color groups 
for each k, up to some reasonably large number, in 2- and 3-dimensional space. This is a subtle 
mathematical problem involving geometry, group theory, linear algebra, and a good deal of 
computation. There is a richness of material here that can form the basis for a valuable 
interdisciplinary course for advanced undergraduates. 

The cornerstone of such a course must be the theory of crystallographic groups. Bieberbach’s 
1910 solution of the first part of Hilbert’s 18th problem, together with the 1912 discovery of the 
diffraction of x-rays by crystals, brought these groups into the mainstream of modern mathe- 
matics and physics. Although they have rarely found their way into the undergraduate curriculum, 
they deserve a place there. Not only are they intrinsically interesting, attractive and important; 
they can also serve as valuable examples in many ways. (In algebra they give us easily visualized 
examples of group extensions with nontrivial factor sets.) The color groups are a particularly 
interesting generalization of the crystallographic groups whose heuristic value should also be 
exploited. For example, by inspecting a colored plane ornament one can quickly write down the 
permutation representation, find a subgroup which defines it, visualize the action of its cosets, 
determine whether the subgroup is normal and if it is not, compute the number of its conjugates. 
The insights obtained in this way can be very valuable, and not only to a beginner. 

The book under review thus fills a significant gap. Designed as a text, it is also a definitive 
monograph, whose goal is to classify and enumerate (for k < 60) the k-color groups of the plane. 
The material is divided into three parts: (1) the study of the Euclidean plane and its transforma- 
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tions, (2) the properties and enumeration of the seventeen isomorphism classes of plane crystallo- 
graphic groups, and (3) the classification and enumeration of their transitive permutation 
representations. The exposition is careful and clear, with a blend of rigor and informality which is 
appropriate for the undergraduate level. Several features are especially noteworthy. The relation 
between the geometric and algebraic properties of the Euclidean plane is studied with unusual 
care, making the first part an excellent introduction to transformation geometry. In the second 
part, the two-dimensional crystallographic groups are given the prominent position that the 
subject requires and which they deserve. A unique and especially valuable feature of the third 
section is the adaptation of the algebraic conditions for the classification of permutation 
representations to a set of algorithms suitable for machine computation, and the presentation of 
extensive, definitive tables of data which correct errors in the recent literature. References to the 
historical and current mathematical literature can be found in the extensive bibliography; one 
could wish, however, that it was annotated and that references to applications had been included. 
Challenging problems at the end of each chapter extend the theory and introduce related topics. 
The material is a natural introduction to many problems of current interest, including n-dimen- 
sional crystallography, tilings and patterns, a deeper study of the geometry of colored patterns, 
and the algorithmic analysis of classification procedures. 

It was disappointing to find that all the illustrations, including the folio of the 96 4-color 
ornaments, are line drawings: representing colors by circles, dots and dashes does not provide a 
comparable intuitive grasp of structure. (Imagine, for example, what the impact of Escher’s 
multicolored tessellations on the scientific imagination would have been if he had shaded rather 
than colored them!) This defect, however, can easily be remedied with a good set of colored 
pencils. The book is strongly recommended as a thorough introduction to the theory of color 
groups, as a supplement to various undergraduate courses and above all as the text for a 
stimulating and significant course. 


A Concrete Introduction to Higher Algebra. By Lindsay Childs. Springer-Verlag, New York, 1979. 
xiv + 388 pp. 


MICHAEL ROSEN 
Department of Mathematics, Brown University, Providence, RI 02912 


Abstract algebra has proved to be so powerful a tool that for a long time even beginning 
textbooks would emphasize axiomatics and abstraction at the expense of calculation. For 
example, students would learn the fundamental theorem of Galois theory without learning a thing 
about computing the Galois group of an explicitly given polynomial. On a more elementary level 
they would learn that Q[X] is a unique factorization domain but have no idea how to factor a 
given polynomial, x° + x° — 4x4 — 4x? + 2x? + 4x + 1 say, into a product of irreducibles. One 
of the great virtues of the book under review is that after reading it, one will no longer feel so 
helpless when confronted with so elementary a task. 

This book is very aptly described by its title. By developing the basic ideas of number theory 
and the theory of polynomials (one variable) the author instructs by means of examples, 
computations, and fascinating applications. The style is leisurely and informal, a guided tour 
through the foothills, the guide unable to resist numerous side-paths and return visits to favorite 
spots. The abstract theories of the higher elevations are only hinted at and occasionally glimpsed. 

There is little overlap between the content of this book and that of the more usual text in 
algebra intended for use at the sophmore-junior level. Groups are mentioned only briefly, but 
subgroups are omitted, rings are mentioned, but ideals only appear on the last page. Fields receive 
more attention, but Galois theory not at all. Instead the emphasis is on examples; the integers, the 
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ring of polynomials, the integers modulo m, finite fields, the rational, real, and complex numbers, 
etc. This approach has the advantage that the material is interesting throughout. There are few dry 
spots. Gratification, if not immediate, is not long postponed. 

To communicate something of the flavor of this text, it is worthwhile to give a sample of some 
of the many applications which are treated. There are two chapters on secret codes, two chapters 
on error-correcting codes, a chapter on duplicate bridge tournaments, one on telephone cable 
splicing, and one on latin squares. It should be mentioned that the chapters, thirty-three in 
number, are generally quite short. One, which I especially liked, discusses the partial fraction 
decomposition of rational functions and applies the results to the problem of finding indefinite 
integrals of rational functions, and also to a problem in the theory of partitions. 

As has already been indicated, a lot of space is devoted to factoring polynomials, especially 
over Q and Z/pZ. Some of the explicit calculations are quite delightful. A pretty application is a 
proof of the law of quadratic reciprocity due to R. Swan based on a theorem of L. Stickelberger 
about factoring polynomials over finite fields. 

What we have here is a very readable introduction to higher algebra. There is lacking, however, 
any substantial amount of higher algebra itself. The question arises of how this book can be 
utilized in the standard undergraduate mathematics curriculum. It is hard to see how it could be 
used in place of more traditional texts as, for example, those of Birkhoff-Mac Lane, Fraleigh, or 
Herstein. The author simply does not present enough group, ring, and field theory. There is 
another problem. The course in abstract algebra is often used to introduce the student to 
standards of mathematical rigor and proof. The style of Childs’ book is too informal to serve this 
purpose well. An ideal set-up would be a two-semester course, the first out of Child’s book, and 
the second out of a more conventional text. Alternatively, it could be used as the basis of a course 
for nonmath majors, students who would find the material interesting in itself, and who are not in 
need of the foundations of abstract algebra as the basis for the study of more advanced 
mathematics. 

Finally, the book is ideal for independent reading. It might be especially suitable for someone 
who is acquainted with theory but has neglected computations. Calculations and examples not 
only illuminate theory, they suggest theory. Moreover, mastering a new algorithm is often just 
plain fun. Perhaps that is why Child’s book is not only instructive, it is very entertaining. 


Calculus and Analytic Geometry. By C. H. Edwards, Jr., and David E. Penney. Prentice-Hall, 
Englewood Cliffs, New Jersey, 1982. xvi + 895 pp. $29.95. 


PETER ROSENTHAL 
Department of Mathematics, University of Toronto, Toronto, Ontario M5S 1A 1, Canada 


It’s spring and our thoughts turn to... choosing a calculus text for next year. We sit quietly, 
staring at the 4000 pounds of textbooks that aggressive book publishers have sent us. What was 
wrong with the old Courant? Even the old (pre-fourth) edition of Thomas wasn’t so bad. 

The sales representative of Prindle, Weber & Schmidt walked into my office and said, “I’m 
glad to see you’re using Swokowski’s Calculus this year; how do you like it?” 

“Nothing personal,” I answered, “but it stinks.” 

“Well,” the salesman replied, “It’s the best-selling calculus book in North America.” 

Apparently the salesman was correct; Calculus with Analytic Geometry by E. Swokowski is 
among the best-selling calculus books. I think I was correct too: it stinks. Many of us have 
apparently been allowing our choice of calculus text to be determined more by the hard-sell 
techniques of the publishers than by pedagogical considerations. 

In selecting calculus texts we get involved in discussions of various points: the level of the text, 
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the quality of the examples, the coverage of various topics, etc. Unfortunately, as the popularity of 

Swokowski proves, we often pay little attention to the exposition itself. At a glance, Swokowski 

looks reasonable as a text for a middle-level calculus course. Actual reading of the book, however, 

reveals it to be riddled with statements and explanations that are cumbersome, misleading, or 

both. I mention some examples below; when reading them I suggest that you imagine you are a 

student who doesn’t already understand the concept that Swokowski is attempting to explain. 
On page 369 (second edition), after the natural logarithm has been defined, we encounter: 


(8.9) DEFINITION. 


The natural exponential function, denoted by exp, is defined by 


expx=y ifandonlyif Iny =x 
for all x, where y > 0. 


The next theorem specifies the relationship between In and exp. 


(8.10) ‘THEOREM. 


The natural logarithmic and natural exponential functions are inverse functions of 
one another. 


Proof. According to Definition (1.23) we must prove 
(i) In(expx) =x for every x, and 
(ii) exp(Iny)=y foreveryy > 0. 


These statements follow immediately from Definition (8.9), for if exp x = y, then 
In y = x, and substitution of exp x for y gives us (i). Similarly, if In y = x, then 
exp x = y, and substitution of In y for x gives us (ii). 


First note (did you miss it because you assumed he said what he should have said?) the phrase 
“for all x, where y > 0.” How many y’s are involved? If he insists on an awkward phrasing, he 
could at least say something like “for all y > 0 and all x.” Also, think about the sentence, “The 
next theorem specifies the relationship between In and exp.” Did he define exp as the inverse of In 
or not? What kind of an idea is the “proof”? of Theorem 8.10 supposed to convey? 

Physicists undoubtedly appreciate Swokowski’s description of g as “the force of gravitational 
acceleration” whose magnitude is “32 ft sec”” (curious units for force), which leads to using 
Newton’s Second Law to get the fact that r’(t) = g if r is the position vector of an object acted 
upon by gravity; (Swokowski, page 733—wouldn’t it be so much simpler if g represented the 
acceleration itself!) 

What do you think of the following (from page 793): 


For certain values of @ the directional derivative may be positive (that is, 
f(x,y) may increase), or negative (f(x, y) may decrease), or it may be 0. 


I wonder, for those values of @ where it is none of the above, what is it? 
Or (from page A 9): 


If f(x) has a limit as x approaches a, then that limit is unique. 


Are the other limits unique too? 

There are many other instances where Swokowski’s use of language makes a simple concept 
appear difficult. Sometimes it’s hard to be sure what he’s saying even when you know the material 
thoroughly. What, for example, could he possibly mean by the assertion that “It is also possible to 
give a strictly algebraic proof” of the fact that limits are unique (page A9)? 
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This book would not be a good one even if the ambiguities and misleading statements were 
removed. The proofs are written in a peculiarly formal and singularly uninstructive style. Previous 
results are often cited by number without attaching a name, even if the name is standard. Thus 
one has to search through the 1000 pages to find out what is meant by “Applying 16.19,” which is 
code for “By the chain rule.” 

Some of us justify using an admittedly bad book by saying “The students don’t read the book 
anyway; its only purpose, really, is for assigning problems.” Such a claim is not an adequate 
justification for forcing students to buy huge books that are so financially and physically taxing. If 
a text is to be merely a source of homework problems, then it should be replaced by a list of 
problems or, perhaps, Schaum’s outline Calculus. 

I think that it is useful to have a text that can be read by students who want additional 
explanations of some topics. It is unfortunate, however, that we have allowed the publishers to 
limit us to choosing big expensive books. If we do choose such a book, it should at least be clear 
and correct. 

There is a new entry in the calculus sweepstakes, Calculus and Analytic Geometry, by C. H. 
Edwards, Jr. and David E. Penney (Prentice-Hall, Engelwood Cliffs, N. J., 1982). It is big and 
expensive, but it is also very readable and clear, and rarely misleading. It covers the standard 
topics in the standard order but livens it up with a real variety of applications, most of which are 
described in language that is simple, careful and also interesting. The explanations of the basic 
ideas of calculus are also very good, directed towards the student whose interest is in applications 
of calculus to physical or social science. It really seems possible that a student who is only taking 
calculus because some bureaucrat decreed that he or she had to will, in spite of everything, find 
some of the problems intriguing. Students who have a genuine curiosity about engineering, 
physical science or the more mathematical aspects of social science will certainly find something 
of interest in this textbook. There is not enough emphasis on the beauty of mathematics as such 
for this to be a suitable textbook for the most mathematically-inclined students, but it is quite 
good (infinitely better than Swokowski) for all others. Anyway, for prospective mathematicians 
Spivak’s Calculus is close to perfect. 

Edwards and Penney is not without faults. In a few places rough statements meant to be 
intuitive are also misleading. One such that might be mentioned, since it occurs in other calculus 
books too, is the rough definition of limit ‘that “F (x) gets closer and closer to L as x gets closer 
and closer to a” (they follow this with a precise definition). Such phrasing suggests that 
|F(x) — L| is a monotone function of |x — al. It’s hard to convince students who think carefully 
that 


lim x sin _ 0 
x0 xX 
after they have read such a “rough definition.” Why not say “F (x) is very close to L when x is 
very close to a” (as Edwards and Penney also do), and then go on to the precise definition 
without introducing a misleading sentence? Also, Edwards and Penney sometimes (e.g., see pages 
108 and 504) give a proof of one theorem using a theorem whose proof is given later in the text. 
While this is logically acceptable, since the theorem proved later does not use the theorem proved 
with its help, there is serious danger that students will extrapolate incorrect circular methods of 
reasoning from such instances in a text. Some might not like terminology like “the closed interval 
maximum-minimum method,” especially when it is implied that the terminology is absolutely 
standard. It would be desirable to have an appendix that includes proofs of theorems such as the 
existence of integrals of continuous functions; in any class there might be a few students who 
would be curious about such proofs. 

The above criticisms of Edwards and Penney are quite minor. I would wholeheartedly 
recommend Edwards and Penney if it were smaller and less expensive. Most (perhaps 90%?) 
calculus students do not continue beyond one year of mathematics, and thus would probably 
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cover at most 594 of the book’s 895 pages. Moreover, they do not require a hardcover 
expensively-produced textbook that would last for years, especially when second editions are 
planned in order to destroy the resale value. Also, the cost and bulk of the text are increased by 
enormous margins. 

A modest proposal to Edwards, Penney and Prentice-Hall: produce a two-volume, soft-cover 
cheap edition. The first volume could contain the first twelve chapters, plus perhaps a brief 
introduction to partial derivatives. This volume could easily become the best-selling first year 
calculus book. Then volume two would be available for those students who continue beyond 
first-year calculus. Neither volume should sell for more than ten dollars. 


Introduction to Functional Analysis: Banach Spaces and Differential Calculus. By Leopoldo 
Nachbin. Marcel Dekker, New York, 1981. 


JOE DIESTEL 
Department of Mathematics, Kent State University, Kent, OH 33142 


The term “functional analysis,” as it was understood by the fathers of the area, referred to 
analyzing function spaces through naturally defined functionals acting on these spaces. By 
skillfully unearthing invariants whose code could be deciphered by means of these functionals, one 
could often solve some functional equation the natural domain of which was a judiciously chosen 
function space. Throughout the first decades of this century, this “soft” approach met with 
frequent success in such diverse areas as integral equations, minimal surfaces, partial differential 
equations, harmonic analysis and moment problems. 

During the twenties, the spectral theory of operators had stunning applications to problems 
uniquely susceptible to analysis in Hilbert spaces. The appearance in 1932 of John von Neumann’s 
““Mathematische Grundlagen der Quantenmechanik” and Marshall Stone’s “Linear Transforma- 
tions in Hilbert Spaces and Applications in Analysis” heralded the emergence of operator theory 
(on Hilbert spaces) as a mature area of mathematical inquiry separate from, but intimately related 
to, what is now known as linear functional analysis. 

At that time, general linear functional analysis experienced its first formative stages of 
development. Many of the tricks of the “soft analysis” trade crystallized. General principles were 
formulated. A host of techniques evolved for approaching linear problems more general than 
those amenable to Hilbert space analysis. Three basic existential principles were soon recognized: 


THE HAHN-BANACH EXTENSION THEOREM. A linear continuous functional on a vector subspace 
of a normed space admits of a linear continuous extension to the whole space. 


THE BANACH-STEINHAUS THEOREM (alias, THE PRINCIPLE OF UNIFORM BOUNDEDNESS). Any 
family of continuous linear operators between Banach spaces that is pointwise bounded on the unit ball 
is uniformly bounded thereupon. 


THE OPEN-MAPPING THEOREM. 4 linear continuous operator from one Banach space onto 
another maps open sets onto open sets. 


In 1932 the French translation of Stefan Banach’s “Opérations Linéaires” appeared. In it, 
these three theorems were showcased as cornerstones for the structure of functional analysis. After 
formulating each principle in its most general, palatable version, Banach provided a full cadre of 
applications of the principle; the effect was to assure their role in the study of linear problems for 
all time. 

Throughout the thirties and forties the boundaries of functional analysis continuously ex- 
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frequent success in such diverse areas as integral equations, minimal surfaces, partial differential 
equations, harmonic analysis and moment problems. 

During the twenties, the spectral theory of operators had stunning applications to problems 
uniquely susceptible to analysis in Hilbert spaces. The appearance in 1932 of John von Neumann’s 
““Mathematische Grundlagen der Quantenmechanik” and Marshall Stone’s “Linear Transforma- 
tions in Hilbert Spaces and Applications in Analysis’ heralded the emergence of operator theory 
(on Hilbert spaces) as a mature area of mathematical inquiry separate from, but intimately related 
to, what is now known as linear functional analysis. 

At that time, general linear functional analysis experienced its first formative stages of 
development. Many of the tricks of the “soft analysis” trade crystallized. General principles were 
formulated. A host of techniques evolved for approaching linear problems more general than 
those amenable to Hilbert space analysis. Three basic existential principles were soon recognized: 


THE HAHN-BANACH EXTENSION THEOREM. A linear continuous functional on a vector subspace 
of a normed space admits of a linear continuous extension to the whole space. 


THE BANACH-STEINHAUS THEOREM (alias, THE PRINCIPLE OF UNIFORM BOUNDEDNESS). Any 
family of continuous linear operators between Banach spaces that is pointwise bounded on the unit ball 
is uniformly bounded thereupon. 


THE OPEN-MAPPING THEOREM. 4 linear continuous operator from one Banach space onto 
another maps open sets onto open sets. 


In 1932 the French translation of Stefan Banach’s “Opérations Linéaires” appeared. In it, 
these three theorems were showcased as cornerstones for the structure of functional analysis. After 
formulating each principle in its most general, palatable version, Banach provided a full cadre of 
applications of the principle; the effect was to assure their role in the study of linear problems for 
all time. 

Throughout the thirties and forties the boundaries of functional analysis continuously ex- 


580 JOE DIESTEL 


panded (with a resulting loss of clear definition). Each success resulted in still more adventurous 
forays into formerly alien territory. Gelfand’s investigations into the structure of commutative 
Banach algebras reunited general linear functional analysis with operator theory to produce, 
among other things, a startling proof of the spectral theorem for bounded normal operators. 
Gelfand theory was used to study locally compact abelian groups, a new proof of Pontryagin 
duality resulted. Fourier analysis on locally compact abelian groups became a viable subject, and 
abstract harmonic analysis was born. 

After the war, the French took up where the Polish school of functional analysts left off, and 
began a series of intensive investigations into the structure of linear topological spaces, especially 
spaces of continuously differentiable functions and their duals. Laurent Schwartz developed the 
theory of distributions (a theory anticipated by others but unquestionably brought to fruition by 
Schwartz). The stage was set for a highwater mark in the history of functional analysis: the 
discovery by Bernard Malgrange and Leon Ehrenpreis that all homogeneous linear partial 
differential equations with constant coefficients have fundamental distributional solutions. Their 
proof is a tour-de-force in the use of the Hahn-Banach Theorem. 

As the sixties dawned, the tools that a young functional analyst needed were diverse—as were 
the potential applications. Choquet theory wed linear functional analysis with operator theory; it 
made serious measure theory a valued ally of the functional analyst. Probabilistic techniques and 
motivations invaded linear analysis and operator theory; complex analysis provided a number of 
interesting problems that could be recast and solved in a functional analytic framework. 
Practically every area of analysis provided tractable problems, its own peculiar techniques, and its 
own intuitions for the functional analyst to test his wares. 

These developments paid dividends. Long thought to be unapproachable, the classical prob- 
lems in Banach spaces were attacked with renewed vigor. (Only a few of the problems posed by 
Banach in his monograph remain unsolved.) Moreover, serious applications of the structure theory 
of Banach spaces have been found in harmonic analysis, probability theory, interpolation theory, 
approximation theory and the distribution of eigenvalues of operators on Hilbert spaces. 

The study of operators on a Hilbert space has undergone several dramatic developments. The 
latest has mated operator theory with K-theory and resulted in serious affairs with differential 
geometry and algebraic topology. The subject “functional analysis” has come to include a 
veritable potpourri of mathematical pursuits. 

It is the very breadth of the area that allows for a rich variety of introductory books to be 
written. Many of the most famous functional analysts have contributed important basic texts. 
Now, Professor Leopoldo Nachbin has added his name to the list. 

Professor Nachbin’s Introduction to Functional Analysis: Banach Spaces and Differential Calcu- 
lus is aimed, remarkably enough, at advanced undergraduate and beginning graduate students 
from economics, engineering, physics and, perhaps, even mathematics. It aims at acquainting the 
students with elementary facts about normed linear spaces and the theory of differentiation of 
functions between such spaces. 

Though the three basic principles are stated within the text, no serious undertaking is made to 
demonstrate their power. Perhaps functional analysis has reached the point where the initial 
contact with the subject can be effective by just acquainting the student with the main terms and 
most elementary facts. Perhaps this is so. But, quite frankly, it seems that Professor Nachbin sold 
the subject (and himself) a bit short. Had he only stated the Three Principles and used them at 
several critical junctures, he might well have written THE classic introduction for the type of 
audience he was addressing. His writing style is so relaxed and clear he just might have pulled off 
the missionary accomplishment of twentieth century mathematics: the conversion of a whole 
horde of economists, engineers, and physicists to functional analysis! By not exhibiting the 
fundamental principles playing their natural roles in functional analysis, Professor Nachbin has 
made it more difficult for the serious reader to begin on more earnest study of the subject. With 
all that one now needs to know, this is unfortunate indeed. 


LETTERS TO THE EDITOR 


Material for this department should be prepared exactly the same way as submitted manuscripts (see the inside 
front cover) and sent to Professor P. R. Halmos, Department of Mathematics, Indiana University, Bloomington, 
IN 47405. 


Editor: 


In a recent note, W. M. Snyder [this MONTHLY, 89 (1982) 462-466] considered prime divisors 
of numbers of the form (b” — 1)/(b — 1) with integers n, b > 1. I wish to point out that the main 
result of this note can be deduced easily from a classical theorem due to K. Zsigmondy [Monatsh. 
Math. Phys., 3 (1892) 265-284] and proved again by G. D. Birkhoff and H. S. Vandiver [Ann. of 
Math., 5 (1904) 173-180]. Furthermore, a crucial result on cyclotomic polynomials in Snyder’s 
note can be proved in a quicker and more elementary fashion. 

Snyder’s main result says that (b” — 1)/(b — 1) has a prime divisor p = 1 mod nif and only if 
n>2orn=2 and b + | is not a power of 2. According to the theorem of Zsigmondy, for any 
n > 2 and b > | (except in the case n = 6, b = 2) there exists a prime divisor p of b” — 1 such 
that p does not divide any of the integers b” — 1 with m being a proper divisor of n. In particular, 
p does not divide b — 1, hence p divides (b” — 1)/(b — 1). The properties of p imply also that n is 
the multiplicative order of b mod p, and so n divides p — 1, 1.e., p = 1 mod n. The remaining cases 
n = 2 and n = 6, b = 2 in Snyder’s result are trivial. 

The original proof of Snyder’s result uses elementary number theory and known properties of 
cyclotomic polynomials, except for the proof of the following fact in which extensive use is made 
of algebraic number theory: if ®, (x) is the nth cyclotomic polynomial over the rationals and q is a 
prime dividing ®,(b) but not n, then q = \modn. To prove this in an easier way, we note that 
since g does not divide n, the polynomial ®,(x) considered mod q yields the nth cyclotomic 
polynomial over the finite field F, of integers mod g, whose roots are the primitive nth roots of 
unity over F,. Now ®,(6) = 0 mod q implies that F, contains a primitive nth root of unity, that is, 
an element of multiplicative order n, and so n divides g — 1, 1.e., g = 1 mod n. 


Harald Niederreiter 
Mathematical Institute, Austrian Academy of Sciences, 
Dr. Ignaz-Seipel-Platz 2, A-1010 Vienna, Austria 


Editor: 


Since the publication of my note “Factoring Repunits” [this MONTHLY, 89 (1982) 462-466], I 
have received several letters indicating to me that my results are not new. This culminated recently 
with a “Letter to the Editor” by H. Niederreiter who also showed that the results could be 
obtained quite easily by elementary methods. 

It is very unfortunate that my article was ever published. 

To make matters worse, there is an error in the proof of the last lemma. I would like to thank 
Frank Schmidt from SIU at Carbondale for pointing this out to me. On page 466, the 7th line 
down “d= 1, mod p implies g = Omod p — 1”, the conclusion should have been p — 1 = 
0 mod gq. Because of this error the lemma, although still correct, needs some changes in the proof. 
In light of the present circumstances, however, it seems inappropriate for me to bother to correct 
the proof here. I suggest that anyone interested in the details correct the proof as a fairly easy 
exercise, or, perhaps, consult the literature for a correct proof. 


W. M. Snyder 

Department of Mathematics 
University of Maine 

Orono, Maine 04469 
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582 LETTERS TO THE EDITOR 


Editor: 


In 1976, the Nepali Mathematical Sciences Report came into existence. It is the first 
mathematics journal to be developed and published in the Kingdom of Nepal. Since 1976, the 
journal has been published twice a year. Each issue has been about fifty pages long with seven or 
eight articles ranging over a variety of topics. For the most part these articles have been written by 
a small group of mathematicians who are located on a campus of the national university just 
outside Kathmandu. Occasionally, articles written by non-Nepalis from universities in nearby 
northern India or by mathematicians on assignment from other countries have been published. 

Mathematical terminology in the Nepali language is in a developmental stage. Each issue of the 
journal contains a glossary section, which is devoted to the construction of a consistent set of 
terms to be used in curriculum development at all levels. To enhance the general problem solving 
capability of its readers, most issues contain a problem section similar to that found in the 
MONTHLY. Through these sections the journal serves as a strong educational device to raise the 
level of mathematical knowledge and understanding throughout the Kingdom. 

My purpose in writing to you is two-fold—to inform the readers of the MONTHLY of the 
existence of this journal and to request their help in providing articles for it. Assistance is needed 
if the journal is to survive. Expository articles will be very useful. Those that contain a historical 
perspective are most welcome. I have been one of the editors of the report since its inception and 
will be glad to receive and forward articles from interested readers. The format should be similar 
to that used for articles for the MONTHLY. 


George F. Feeman 

Department of Mathematical Sciences 
Oakland University 

Rochester, Michigan 48063 


MISCELLANEA 
113. 


Where have all the residents gone? Gone to infinity every one. 


According to Gamow (1947/1971, p. 17) and Rucker (1982, p. 73), Hilbert’s Hotel has an 
infinite number of rooms numbered with the integers 1,2,3,..., each occupied by one resident. 
Moreover, they can make room for a finite number n of newcomers by each moving “up” n 
places, or for a countable number of newcomers by each doubling the number of his room. 
Consider now a slightly different paradox. 

The manager, wishing to have the rooms cleaned by an infinite number of chambermaids, at 10 
a.m. asks each of the residents to move up one room in one hour, another room in the next 
half-hour, another room in the next quarter-hour, and so on. By noon all the rooms are empty and 
the chambermaids begin scrubbing the floors. Where have all the residents gone and how can they 
get back? 


References 
1. G. Gamow, One, Two, Three,..., Infinity, Viking Press and Bantam Books, New York, 1947/1971. 
2. R. Rucker, Infinity and the Mind, Birkhauser, Boston, 1982. 
—I. J. Good 
Statistics Department 
VPI & SU 


Blacksburg, VA 24061 
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Geometry: A High 
School Course 
Serge Lang and Gene Murrow 


This is a completely new approach 
to teaching high school geometry. 
The subject is treated as a coherent 
whole—with interesting exercises, 
numerous helpful illustrations, some 
algebra, and proofs not usually 
given. Long pedantic discussions 
and insignificant items (such as 
common tangents, power of a point, 
and more) have been left out. Stu- 
dents will enthusiastically gain a 
thorough understanding of geome- 
try, and distinguish between impor- 
tant results and technical routines. 
[983/470 pp./545 illus./Paper 
$24.00 

ISBN 0-387-90727-0 


Forthcoming... 

Topology 

Klaus Jinich 

1984 /approx. 196 pp./181 illus./ 


Cloth $24.00 
ISBN 0-387-90892-7 


A Mathematics Education from Springer-Verla 


With the latest appearances in 
our Undergraduate Texts in 
Mathematics series .. . 


Basic Topology 
M.A. Armstrong 


This broad introduction to topology 
gives wide coverage of a variety of 
different techniques and applications 
involving point set, geometric and 
algebraic topology. Over 350 prob- 
lems of varving difficulty and more 
than 130 illustrations included. 
1983/approx. 264 pp./132 illus./ 
Cloth $19.80 

ISBN 0-387-90839-0 


Linear Algebra Through 


Geometry 
Thomas Banchoff and John Wermer 


The ideas of elementary linear alge- 
bra are taught to undergraduates 
using 2 and 3 dimensional geome- 
try. 

1983/257 pp./81 illus./Cloth $24.00 
ISBN 0-387-90787-4 


Undergraduate Analysis 
Serge Lang 


This logically organized, compre- 
hensive text introduces students to 
basic techniques for analysis. 
1983/approx. 545 pp./58 illus./ 
Cloth $28.00 

ISBN 0-387-90800-5 


Variational Calculus with 


Elementary Convexity 
John L. Troutman 
With the assistance of W. Hrusa 


Variational methods used to formu- 
late and solve mathematical and 
physical problems are introduced. 
Students will gain first-hand insight 
into the systematic use of elemen- 
tary (partial) convexity of differen- 
tial functions in Euclidean space. 
1983/364 pp./73 illus./Cloth $29.80 
ISBN 0-387-90771-8 
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To order, write: Springer-Verlag New York Inc., 175 Fifth Ave., New York, N.Y. 10010. 
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The Contest Problem 
Book IV, 


Annual High School Mathematics 
Examinations 1973-1982, 


New Mathematical Library #29 


Edited by Anneli Lax. 

Compiled and with soiutions by Ralph A. Artino, 
Anthony M. Gaglione, and Niel Shell. 

198 pp. Paperbound. 

List: $10.00 MAA Member: $8.00 


The American High School Mathematics Examination, given each 
year to over 400,000 high school students, helps to identify young 
people with an interest in and a talent for mathematics problem solv- 
ing. This volume contains the Examinations for 1973-1982, and is a 
continuation of Contest Problem Book I, Il, and II, published as vol- 
umes 5, 17, and 25 of the New Mathematical Library. The three 
earlier books contain the first twenty-five annual examinations along 
with solutions. 

The problems are designed so that they can be solved with a 
knowledge of precalculus mathematics, with emphasis on intermedi- 
ate algebra and plane geometry. The book includes subject classifica- 
tions that indicate which questions relate to a particular topic in 
mathematics. 

The solutions have been carefully constructed, but students are 
encouraged to attempt their own solutions before looking at the ones 
offered. 

If you teach or counsel high school students, or know of a young 
person interested in mathematics problem solving, take this TICAL 
opportunity to order Contest Problem Book IV now. <N5 


TINS 
<q A \S 
ORDER FROM: The Mathematical Association of America {= { Qs a > 
1529 Eighteenth Street, N.W. w DZD J S 

Washington, D.C. 20036 \ “Lm /S 
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from the MAA Studies 
in Mathematics .. . 


Studies in 
Computer Science 


Edited by Seymour Pollack 
Volume 22, MAA Studies in Mathematics 
A408 pp. Cloth. List: $29.00 MAA Member: $22.00 


Studies in Computer Science continues the tradition of excellence of 
the respected series MAA Studies in Mathematics. Written by com- 
puter scientists for mathematicians, the book presents a readable and 
balanced discussion of the role of mathematics in computer science, 
and the contributions of computing to mathematics. 

Opening with an historical overview of the development of com- 
puter science, the narrative continues with three articles on the 
nature of computer programs and the programming process, a dis- 
cussion of computational complexity, and a concluding group of 
articles on the impact of computer science on artificial intelligence, 
numerical analysis, statistics, and simulation. 

In the introduction the editor expresses his hope that the book 
will provide ‘an interesting look at an explosive field in the process of 
becoming.’’ What an understatement! The process is well advanced; 
the impact of computer science on mathematics and its teaching is 
already immense. And this book is not just interesting, it is exciting 
and important as background reading for anyone interested in the 
interactions between mathematics and computer science. Teachers 
will find that it enriches their teaching and students will find it 
rewarding supplementary reading. Order your copy today! 


<q 
ORDER FROM: The Mathematical Association of America {= 


1529 Eighteenth Street, N.W. 
Washington, D.C. 20036 


SOME INFORMATIVE AND USEFUL 


BO 


OKS FROM THE MAA... 


Browse through this list and see if your library is missing some of these important books 


published by the Association. 


APPLICATIONS OF UNDERGRADUATE 
MATHEMATICS IN ENGINEERING — written and 
edited by Ben Noble. A collection of articles prepared 
by engineers for the Committee on the Undergraduate 
Program in Mathematics. 364 pp. Hardbound. 

List: $19.00. MAA Member: $13.00. 


ANNOTATED BIBLIOGRAPHY OF EXPOSITORY 
WRITING IN THE MATHEMATICAL SCIENCES, 
prepared by M.P. Gaffney and L.A. Steen. An invaluable 
reference source of expository articles in mathematics. 
282 pp. Paperbound. 

List: $12.00. MAA Member: $8.00. 


AN ANNOTATED BIBLIOGRAPHY OF FILMS AND 
VIDEOTAPES FOR COLLEGE MATHEMATICS, by 
David I. Schneider. An up-to-date listing of films and 
videotapes available for classroom use. 107 pp. 
Paperbound. 

List: $9.00. MAA Members: $6.00. 


A BASIC LIBRARY LIST FOR TWO-YEAR COLLEGES, 
prepared by the Committee on Basic Library Lists. A 
recommended library nucleus for two-year colleges. 

66. pp. Paperbound. 

List: $8.00. MAA Member: $6.00. 


A BASIC LIBRARY LIST FOR FOUR-YEAR 
COLLEGES, prepared by CUPM. Presents listings of 
books and journals that should be in every college 
library. 106 pp. Paperbound. 

List: $9.00. MAA Member: $6.50 


THE CHAUVENET PAPERS. A Collection of Prize 
Winning Expository Papers in Mathematics, edited by 
James C. Abbott. Two-volumes of the collected prize 
winning Chauvenet Papers. Vol. 1— 312 pp. 
Hardbound. Vol. 2— 282 pp. Hardbound. 

List: $21.00 each. MAA Member $16.00 each. 

Two volume sets 

List: $36.00. MAA Member: $27.00. 


CRITICAL VARIABLES IN MATHEMATICS 
EDUCATION: Findings from a Survey of the 
Empirical Literature, by E.G. Begle. A joint publication 
of the MAA and the National Council of Teachers of 
Mathematics. 

List: $8.00. MAA Member: $6.40. 


A COMPENDIUM OF CUPM RECOMMENDATIONS. 
Volumes I and II. A collection of the major 
recommendations of the Committee on the 
Undergraduate Program in Mathematics. Two volumes. 
756 pp. Hardbound. 

List: $16.50. MAA Member: $12.00. 


THE WILLIAM LOWELL, PUTNAM MATHEMATICAL 
COMPETITION: PROBLEMS AND SOLUTIONS — 
1938-1964. Compiled the R. E. Greenwood, A. M. 
Gleason, and L. M. Kelly. Contains problems and 
solutions to the first 25 Putnam Exams. 652 pp. 
Hardbound. 

List: $35.00. MAA Member: $26.00. 


THE MATHEMATICAL ASSOCIATION OF AMERICA: 
Its First Fifty Years. An historical perspective of the 
Association. 170 pp. Hardbound. 

List: $10.00. MAA Member: $5.00. 


FIFTY YEAR INDEX OF THE MATHEMATICS 
MAGAZINE, edited by Lynn A. Steen, and J. Arthur 
Seebach. Cumulative index of volumes 1-50. 163 pp. 
paperbound. 

List: $10.00. MAA Member: $6.50. 


INDEX OF THE AMERICAN MATHEMATICAL 
MONTHLY. Contains the tables of contents for each 
issue of volumes 1-80 as well as subject and author 
indices. 269 pp. Hardbound. 

List: $19.00. MAA Member: $13.00. 


PRIME-80. Proceedings of a Conference on 
Prospects in Mathematics Education in the 1980's. 
Included are the background of the conference and the 
recommendations that resulted. 

84 pp. Paperbound. $3.50. 


PROFESSIONAL OPPORTUNITIES IN THE 
MATHEMATICAL SCIENCES, Eleventh Edition. 1983. 
Designed for the student interested in a career in 
mathematics. 35 pp. Paperbound. $1.50 each. 95¢ for 
orders of five or more. 

RECOMMENDATIONS ON A GENERAL 
MATHEMATICAL SCIENCES PROGRAM. Prepared by 
the Committee on the Undergraduate Program in 
Mathematics. (CUPM) 102 pp. Paperbound. $3.50 each. 


Order From: 


THE MATHEMATICAL ASSOCIATION 
OF AMERICA 

1529 Eighteenth Street, N.W. 
Washington, D.C. 20036 


FROM THE MAA... 


A revised edition of a classic 


A PRIMER OF REAL FUNCTIONS 

Raloh P Boas, Jr, 

Carus Mathematical Monograph, #18. 
Third Edition 


xi + 232 pages. Hardbound. 
List, $16.50. MAA Member: $12.00 


A gold mine of interesting, uncommon insight and examples. . .an orderly composition of 
24 partially independent elegant snapshots from the theory of sets and real functions.” 
Lynn A. Steen, commenting on the second edition of “A Primer of Real Functions” in THE 
AMERICAN MATHEMATICAL MONTHLY, 1974. 

The Third Edition includes the most significant revisions to date of this classic 
volume. Terminology has been modernized, proofs improved, and sections have been 
completely rewritten. Much new material has been added. The Primer contains the basic 
material on functions, limits, and continuity that students ought to know before starting a 
course in real or complex analysis. It is a good place for a student (or anyone else) to see 
techniques of real analysis at work in uncomplicated but interesting situations. 

The author says, “My principal objective was to describe some fascinating 
phenomena, most of which are not deep or difficult but rarely or never appear in text books 
or in standard courses.” Boas has achieved that objective. The result is a book that should 
be on every teacher’s book shelf, in every college library’s, and a part of every serious 
mathematics student’s experience. 


Order your copy now from: 


THE MATHEMATICAL ASSOCIATION 
2\ OF AMERICA 

1529 Eighteenth Street, N.W. 
Washington, 20036 


THE DOLCIANI 
MATHEMATICAL EXPOSITION SERIES. ——————= 


All seven of the books in this widely acclaimed series contain fascinating problems that have 
ingenious solutions and/or involve intriguing results. These books will challenge the advanced 
mathematician as well as stimulate the mathematically talented high school student. 

The seven volumes available in this series are: 


MATHEMATICAL GEMS I, by Ross Honsberger. xi + 176 pages, Hardbound. 
List: $16.50. MAA Member: $12.00. 


MATHEMATICAL GEMS II, by Ross Honsberger. ix + 182 pages. Hardbound. 
List: $16.50. MAA Member: $12.00. 


= MATHEMATICAL MORSELS, by Ross Honsberger. xii + 249 pages. Hardbound. 
| List: $21.00. MAA Member: $16.00. 


MATHEMATICAL PLUMS, edited by Ross Honsberger. Articles by R.P. Boas, G. D. Chakerian, 
H.L. Dorwart, D.T. Finkbeiner, Ross Honsberger, K. R. Rebman, and S.K. Stein. ix + 182 pages. 
= Hardbound. 
# §~—_List: $18.00. MAA Member: $13.50. 


GREAT MOMENTS IN MATHEMATICS BEFORE 1650, by Howard Eves. xiv + 269 pages. Hardbound. 
List: $22.00. MAA Member: $17.00. 


: MAXIMA AND MINIMA WITHOUT CALCULUS, by Ivan Niven, xv + 323 pages. Hardbound 
Bist: $24.50. MAA Member: $18.50. 


, GREAT MOMENTS IN MATHEMATICS AFTER 1650, by Howard Eves, xii + 263 pages. Hardbound. 
= _ List: $23.50. MAA Member: $17.50. 


SPECIAL TWO-VOLUME GREAT MOMENTS PACKAGE — “Great Moments... Before and After 1650 
List: $39.00. MAA Member: $29.50. 


THE MATHEMATICAL ASSOCIATION OF 
AMERICA 

1529 Eighteenth Street, N. W. 

Washington, D.C. 20036 


Only Harper & Row texts make math 
this accessible. Or practical. 


NEW! 
Britton & Bello 


TOPICS IN CONTEMPORARY MATHEMATICS 


Third Edition 


In its new edition this popular text continues 
to put math within the grasp of every liberal 
arts student by relating basic concepts to so- 
ciology, psychology, business, and consumer 
concerns. Now includes a new chapter on 
consumer mathematics, a new chapter on 
counting techniques, a revised geometry 


NEW? 
Leithold 


chapter with many new practical applica- 
tions, and revised chapters on probability 
and on calculators and computers. There is a 
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DRAWING SEIFERT SURFACES THAT FIBER THE FIGURE-8 KNOT 
COMPLEMENT IN S? OVER S! 


GEORGE K. FRANCIS 
Mathematics Department, University of Illinois, Urbana, IL 61801 


Few areas of contemporary mathematics can boast of such an abundance of elementary and 
compelling examples as low dimensional topology. This is the study of curves, surfaces and 
3-manifolds. The intense preoccupation of its practitioners with the concrete, tangible arrange- 
ment of plastic forms has led them to adopt a vividly pictorial genre of exposition. The process of 
inventing these apt examples, call it descriptive topology, appears to have its own visual logic and 
graphical rules. These also deserve serious study. As a pedagogical utility, descriptive topology 
should be straightforward and technically uncomplicated. It should never distract from the 
mathematics it means to illustrate. Above all, the topological design of its pictures should be 
memorable, ready to hand at the tip of a pencil or a piece of chalk. 

Apt, easy, unambiguous and memorable pictures are what this article is about. In order not to 
try the reader’s patience or offend editorial hospitality, however, I shall avoid vague generalities 
and focus on a particular example. Technical details on drawing pictures were outlined in [5] and 
a primer for a general method is in preparation [6]. Here I shall tell a picture story about 
visualizing the fibration over the circle of the figure-8 knot complement. This exercise of the 
imagination consists of filling up the void of space, closed by a point at infinity to form the 
3-sphere, S°, with a continuous succession of surfaces spanning the knot. That is, through each 
point not on the knot, K, there will pass a unique copy of a compact, connected, bordered, 2-sided 
surface, F, called a Seifert surface,, whose boundary is the knot. 

The reasons why one should want to do this, and why it should be possible at all, belong to the 
fascinating biography of this remarkable knot. It is too long to tell about here. May this brief 
anecdote serve as an invitation to a rewarding treasure hunt in the literature. For a start, you 
should look at Rolfsen’s masterpiece of descriptive topology [12] and then proceed to [9]. My own 
interest in it stems from a project to illustrate some ideas in [3] and [20]. 

This example is of particular interest to me because it belongs to an area of topology that 
already has a highly developed and effective graphical shorthand. This shorthand consists of 
“schematic” diagrams which are highly abstract, and often terse. Recognizable pictures of familiar 
shapes, arranged in space as dictated by the diagrams, help us see and remember the encoded 
information. Sometimes, a particularly well-designed picture can lead directly to an insight that is 
hard to decode from all the algebra generated in the service of precision. What follows is meant to 
be readable on an undergraduate level not much beyond advanced calculus. John Stillwell’s clear 
and superbly illustrated text [17] covers the topological gaps. The more sophisticated reader, 
whose indulgence I beg for frequent abus de langage, is invited to read into my pictures many 
more ideas from the theory of fibered knots than can be explained at this level. At the end are 
“solutions” to some of these implied exercises. 


G. K. Francis: Here is my story. I still have my first Bilderbuch. Printed, bound, and given to me by my 
grandfather on my second birthday in 1941, it survived the bombs and turmoil of my early years in Europe. My love 
of pictures, to look at and, later, to draw more for didactic than aesthetic purposes, filled my notebooks with doodles 
through my school years in South Bend, Indiana. Ky Fan and Yuri Rainich introduced me to topology and geometry 
at Notre Dame. But this was the era of Bourbaki, and for a decade I erred blindly in higher dimensions, studying 
differential topology with Raoul Bott. With his blessings I returned to the Midwest and apprenticed myself to the 
concrete, the particular, the visible topological analysis of Chuck Titus at Michigan. One summer, in Ann Arbor, I 
assisted Bernard Morin. He lectured on Thom and Mather and I drew his vivid pictures on the blackboard. I labored 
next in a multiply-connected labyrinth of geometrical disciplines. The corridors of Altgeld Hall, Urbana, are lined 
with string and plaster models: mute and dusty memories from the days of Klein and Poincare. Finally, Bill Abikoff 
arrived and with him Thurston’s topological renaissance. Armed with publications and protected by tenure, I headed 
out on my own quests. Whether I meet giants or windmills only time will tell. 
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Fic. 1. Views of the figure-8 knot. 
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Fic. 2. Seifert surfaces spanning the knot. 
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Figure 1. Views of the Figure-8 Knot. The figure-eight knot, by sailors “used only to prevent 
ropes from unreeving; it forms a large knob” [18], has been part of “topology” ever since Listing 
coined this name for the mathematical theory of position [11]. Dubbed “four-knot” by Tait, it is 
the only simple closed knotted curve in space whose picture can be drawn with four, alternating 
over and under, crossings. Here is a recipe for moving from one of its planar positions 1(12) to the 
other 1(22). [The abbreviation n(ij) refers to the detail in row i column j of Figure n.] Twist a 
small loop into a segment of 1(13); this allows you to shrink the large loop 1(21) while untwisting 
it 1(22). Now swing the large arcs forward, placing the knot in its most symmetric position in 
space 1(23). A rotation by 90° in the picture plane, followed by a reflection in that plane, moves 
the knot 1(23) into itself: it is its own “mirror image.” Invariance of a geometrical structure under 
such a turn-reflection (Drehspiegelung) is often useful. Here it will halve our effort in visualizing 
the fibration of its complement by Seifert surfaces. 


Figure 2. Seifert Surfaces Spanning the Knot. It can be quite difficult to see how a surface 
might span a knot which is drawn in a particular way. For this reason it is wise to experiment with 
several positions of the knot. Position 1(12) is not the most convenient but serves to illustrate a 
simple approach to spanning a knot. There is a systematic procedure for doing this, due to Seifert 
[14], based on the combinatorics of the knot projection, but this won’t be needed here. With a bit 
of practice you can learn to “see” twisting ribbons, as well as more or less convex patches, 
spanning parts of the knot. The difficulty is to see how these patches might fit together. In 2(11) 
two ribbons, with a full twist each, have come into conflict. They attach to “opposite” sides of the 
knot. Two ways of resolving this conflict locally are shown in 2(22). 2(21) is a line pattern for this 
picture. The conflict is resolved globally in 2(12) by continuing the lower ribbon so it fills out a 
bowl-like patch. This forces the upper ribbon to become a handle or strap across the bowl; the 
result is Seifert’s classic position [14]. For a different version, 2(13), continue the upper ribbon of 
2(11) to form a plane patch extending to infinity beyond the square border. This forces the lower 
strap to hang limply from the patch. 

The twisted straps of 2(11) could have been merged differently so that the bowl of 2(12) is in 
front of the upper strap, and the square patch of 2(13) is in front of the lower strap. This specifies 
four surfaces spanning the knot. These surfaces can be described in terms of 2(23), which is a 
“cubist” stylization of 2(12). For 2(13) draw the box in front. For the alternate version of 2(12) 
draw the horizontal strap inside the box; draw the box in front for the alternate of 2(13). A 
motion, technically called an isotopy in S’, that takes 2(13) to 2(12) is easy to imagine. Move the 
infinite, planar patch of 2(13) past infinity. Think of a huge bowl behind the plane of the paper 
with a square opening into which 2(13) fits. Shrink it to the shallow bowl of 2(12). Now, if you can 
already imagine an isotopy of 2(12) into a position where the bridge is in back of the bowl, and a 
similar isotopy through the fourth position back to 2(13), then you have already grasped the gist 
of this article. Remember, the motion must not involve any tears or self-penetration of the surface, 
and each surface must differ from every other in all of its interior points. The boundary is, of 
course, always the knot, K. 

Moreover, Seifert’s surface turns about K without sliding along it: a page in a ring binder, not a 
spiral notebook. 


Figure 3. Mnemonic Pictures of Fibrations. A fibration “over the circle, by Seifert surfaces” of 
the knot complement M = S*\ K may be described by a smooth map f: M > S' of M onto S' 
which is everywhere of maximal rank (equal to 1) and which is “well behaved” where M converges 
to the knot. 3(11) is a picture mnemonic for this notion. This essentially static structure on S°? 
translates to a moving surface, tied to the knot like a sail, by the following bit of differential 
topology. Orient the circle on which f takes its values and parametrize it by degrees from 0 to 360 
for typographical convenience. By the implicit function theorem, the inverse image F(t) = {p © 
M: f(p) = t) of each ¢t € S' is a properly embedded, two-sided, two-dimensional submanifold of 
M. The surfaces F(t) are oriented by the gradient vector field Vf, which is everywhere 
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Fic. 3. Mnemonic pictures of fibrations. 


perpendicular to them. Choose one of these, F = F(0), as reference surface and integrate along 
vf. This produces a flow on M that moves F through a succession of positions F(t), with the last 
coinciding with the first. That is, for each p © F solve the initial value problem 


dp/dt= Vf(p(t)),  p(0) =p. 
Even though F(360) = F(0), there is no reason to expect the flowlines to close up after one 
time around; in general p(360) = p(0). 
The knottedness of closed flowlines of Vf is very ably investigated in [3], [4]. The first return 
map, 


g: F > F, g(p(0)) = p(360), 


called the monodromy of the fibration, is a diffeomorphic self-map, or automorphism, of the 
surface F. This is an example of a Poincaré map for the dynamical system. The manifold may be 
reassembled from the monodromy by identifying top and bottom of the cylinder F x [0,360] by 
the recipe (p,0) ~ (g(p), 360). 3(12) is a picture mnemonic for such a mapping torus, M(g), of 
the automorphism g: F > F. 

I have used the conventional symbol, 3(21), for Seifert’s surface as cross section of M(g). It 
represents two annuli joined together. Since such a ribbon neighborhood of two circles crossing 
once, 3(22), can be joined with a disc, 3(23), to form a whole torus, F is topologically a torus with 
a disc removed. The difference between 3(22) and all the other versions of Seifert’s surface I have 
drawn is in the twisting of the ribbons. This is a property of the position of the surface in space, 
not of the surface as an abstract manifold. 

Even if the monodromy is not the identity map on F, it is still possible that it is isotopic to it. 
That is, for a smooth map 


h:FX[01J>F, h(p,0)=p, h(p,1) = 8(p), 
each h|F X {t} is a diffeomorphism. In that case the fibration is said to be trivial because M(g) 
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would be diffeomorphic to F x S'. As we shall see, this is not the case here. The isotopy class of g 
is an example of a so called pseudo-Anosov mapping in Thurston’s classification of the automor- 
phisms of surfaces [20]. 


Fic. 4. Alternative views of Seifert’s surface. 


Figure 4. Alternative Views of Seifert’s Surface. Here are some more pictures of F based on its 
definition as a disc with two oppositely twisted ribbons attached to its circumference. Unexpected 
equivalences of shapes result when a rubber sheet topologist twists ribbons. The owl 4(12) 
proceeds from the pussycat 4(11) by a Seifert move [13] that changes the twisting of ribbons into 
curling or writhing. [How their line patterns can be remembered from 3(21) is shown in 4(13).] 
More useful, if less amusing, is the cubist position 421), a simpler form of which will concern us 
for the remainder of the paper. 

The importance of such pictorial transmutations in surface topology warrants this brief 
digression. Curl a strip of paper like the ribbons in 4(23) and pull it taut; it twists like 421). A 
ribbon normal to the curl will writhe, like 4(22), but parallel to a vertical plane. Jim White 
explains the differential geometry of this phenomenon in [1], where its natural manifestation in 
molecular biology is uncovered. Note how right and left handedness of twist, writhe and curl 
expresses itself. What remains invariant under an isotopy of a closed ribbon in space is the 
number of times one edge links the other. The number of obvious twists in the ribbon need not 
equal this integer; the difference is called its writhing number. Both twist and writhe are real 
numbers .associated with the geometry of a particular position in space [21]. However, for a 
picture of a ribbon in general position, Lou Kauffman [10] found a direct, visually convenient way 
of counting twists and curls. The first integer is the net number of times the ribbon twists, 
ignoring the global overpasses. The second integer is the number of times the ribbon passes over 
itself from left to right minus the times it does so from right to left, as seen from (either of) its 
directions. Their sum is the linking number of the ribbon edges. 
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Fic. 5. Two surfaces isotopic in S?. 


Figure 5. Two Surfaces Isotopic in §°. Of all the positions of K and an initial spanning surface 
F, 4(21) seems to be the most versatile, once the extraneous bends in K are removed by widening 
the bridges to form 5(11). To aid the eye, the knot is half striped, half cabled, and 8 windows in F 
reveal otherwise invisible details. A window is a transparent patch on the surface whose projection 
to the picture plane is a topological disc. The surface is marked by 2 circles, which cross on the 
horizontal square floor and are directed towards the viewer by white and black arrowheads as they 
cross the bridges. This is position F(0); F(180) is shown below, 5(21). Here the horizontal floor 
extends away from the knot on the outside of the box-like frame. The generating circles of F(180) 
cross at the “center” of this window through infinity. Two simple, closed, directed, once crossing 
curves on the Seifert surface (torus minus disc) are said to generate F because they determine the 
homeomorphisms of F up to isotopy. In other words, I only need to tell you where the generators 
go under a mapping in order to identify the isotopy class of the homeomorphism. To visualize the 
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motion of F(0) to F(180), first imagine the floor of F(0) moving upward, like an elevator, as ¢ 
advances from 0. As the floor rises past a point on the knot, the surface turns 90° in the vicinity of 
this point. To see that the surface turns the same way all along the knot, orient the knot so that 
the striped part heads upward in back and downward in front. You should imagine the surface 
moving at other places as well, but so slowly that the picture does not change enough to require 
redrawing it. Soon, however, the floor of F(80) seems to get stuck below the white bridge. 


Fic. 6. Relevant positions of the trivial fibration. 


Figure 6. Relevant Positions of the Trivial Fibration. Since the motion “through” a bridge will 
be the hardest part of the visual exercise, let’s see what it would look like if the bridges were not 
there. Fig. 6 shows the relevant stages of a disc D spanning an unknotted curve C bent into a 
shape similar to the one at hand. At D(90) the spanning disc nearly minimizes area. It bubbles out 
on top at D(100). Note that during 80 < ¢ < 100 the surface swings almost 360° about the knot 
near A, while near B it remains almost stationary. The “almost” is important, because each D(t) 
must be disjoint from every other. The bubble grows and encloses nearly all of the upper half 
space at D(170). The balloon bursts at D(180) as the disc passes through infinity. To imagine the 
rest of the tour, reflect the figures in the horizontal plane, turn them 90° and retrace your steps till 
D(360) occupies the same position as D(0). This motion describes the fibration over the circle by 
discs of the complement in S? of an unknotted, closed curve C. 


Figure 7. The Shear Isotopy. This figure depicts five positions of an annulus A neighboring the 
white generator of Seifert’s surface F as it squeezes through the upper bridge between ¢ = 80 and 
t = 100. If the A(¢) were superimposed into one drawing, this would correspond to 6(22) in the 
example just considered. This detail of F(t) was chosen to fit into 2(12) as well as into 5(11). At 
the bottom is A(80) and A(100) is at the top. The position midway between these, 7(22), is part of 
F(90). Imagine A(80) shrinking almost into its minimal shape, sliding freely about the two bent 
rods. To aid the imagination, I have also drawn a position for t = 85 on the left, 7(21), and ¢t = 95 
on the right, 7(23). The shrinking of A(80) to A(90) via A(85) is not quite uniform. The bridge is 
somewhat stiffer, but catches up quickly between A(85) and A(90). 

The annulus A(80) is marked with an equatorial circle, white arrows, and a crosscut, black 
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Fic. 7. The shear isotopy. 


arrows. On A(85) and A(95) only the crosscut, and on A(90) only the equator has been drawn. 
You should trace out each of the missing curves to convince yourself that their position on A(100) 
is as shown. The equator still crosses the bridge towards the front, just as on 5(21). The crosscut, 
on the other hand, now turns right, following along the equator once around, before crossing it 
and continuing to the far edge of A(100). 

To express the global situation algebraically, let b and w denote the (isotopy classes of the) 
black and white generators of F(0); B and W those of F(180). Two simple closed curves on F are 
isotopic if you can slide one on top of the other without leaving the surface. Now draw a 
hemispherical dome over 5(21) that attaches to the circular rim of the floor. This approximates the 
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position F(100). As you can see from 7(11), w= W but b = B + W. The symbol B + W denotes 
the class of the closed curve obtained from a pair of generators of F by cutting them at their 
common point and rejoining them in the obvious way that does not produce conflicting directions 
on the new curve. This switching operation (Umschaltung) at crossings of directed curves and its 
higher dimensional analogues are one of the fundamental “cut and paste” procedures in topology. 
It was first used by Gauss in his study of knot diagrams and it is central to Seifert’s method for 
detecting surfaces spanning knots. For the present, however, I only want to switch generators 
of F. 

Thus, the generators (b, w) of F(0) move to (B + W, W) on F(180). This suggests the name 
shear isotopy for the motion that squeezes a surface through a twisted bridge. A better reason for 
the name, whose full import must regrettably be omitted here, goes as follows. The surface F 
minus its border curve K may be regarded as a torus with a point removed. The punctured torus 
may, in turn, be parametrized by the plane R X R minus the lattice Z X Z of points with integral 
coordinates. Points, whose coordinates differ by an element of the translation group Z x Z, name 
the same point on F\ K. Up to isotopy, this self map restricted to F\K may, this way, be 
1 1 


represented by the linear shear transformation whose matrix is 0 1 


Fic. 8. Self maps of the surface. 
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Figure 8. Self Maps of the Surface. To compute the fate of (B + W,W) as F(180) moves on 
to F(360) it is convenient to reverse time and view the motion F(360,270, 180) through the 
spectacles of the turn-reflection. To see how this takes (b, w) to (— w, 5), turn 8(11) to 8(12) and 
reflect to obtain 8(13). [The curve — w is w with direction reversed.] A ribbon twisting to the right 
reflects to one that twists to the left. Therefore, a picture of the shear isotopy A(280, 270, 260) at 
the lower bridge would look just like A(80, 90,100) of Fig. 7, but with different markings. 
Therefore, this conjugation has the effect of taking 


(b,w) > (-w,b) > (-W,B+ W)- (B,W- B). 


SO, since the forward motion from F(180) to F(360) takes (B, W — B) to (b, w), it also takes 
W =(W— B)+ Btow + 5D, and therefore B + W moves to b + (w + 5b). Since the monodromy 
map induces an isomorphism of H,(F), the argument may be summarized algebraically by 
decomposing its matrix representation thus 


F _ E H([° atib [9 yy 
1 1 0 1 1 OJLO I14L1 0 

These Gauss switches are legitimate bécause b + w crosses each generator once, up to isotopy 
on F’. This need not be the case on a one-sided surface, such as the Kleinbottle, 8(14), with a disc 
removed. To see what happens on F, look at the “standard,” untwisted version of Seifert’s surface, 
8(21), with generator b on the left, solid curve, and w on the right, dashed. The solid curve on 
8(22) shows why b + w = w + Bb and 8(23) shows how (b + w) — Bb is isotopic to w. 8(24) shows 
the image of b under the monodromy, g(b) = b + (b + w). You should trace into 8(24) a copy of 
b + w = g(w) that crosses the given curve once, from left to right, showing that the monodromy 
is Orientation preserving. , 

My picture story is over, but there are a few loose ends to mention. Readers initiated into the 
beautiful mysteries of Stalling’s theory of fibered links, [15], [16], may have noticed that grafting 
Picture 7, and its mirror image, into Picture 6 is a case of “plumbing” two oppositely oriented 
“Hopf fibrations,” [7], to form Picture 5. I did not recognize this until some time after designing 
the shear isotopy. The first graft splits the unknot into a pair of circles which link once. If you 
close up the two bent rods in each detail of Picture 7 into a pair of linked circles, L, the successive 
positions of the annulus spanning the link, fibers S? \ L over S'. If the second graft had also been 
left handed, the resulting Seifert surfaces, now embedded in S? with both straps 2(23) twisted 
alike, would fiber the complement of the trefoil knot. For a 3-crossing view of this knot, reverse 
over by under at the lower two crossings in 1(12) and twist the pendant loop, as in 1(21), directly. 
Hopf’s name is more commonly associated with the fibration of S? by circles over S*, induced by 
the projection of the complex plane to the complex projective line. Though the picture story 
relating these two notions of fiberings must be told another time, 8(31) suggests how the 
monodromy is a Dehn twist, as defined on p. 158 of [2]. One full clockwise turn of the striped rim 
of this annulus moves the marked crosscut, c = b — w, off the twisted bridge. But the monodromy 
does the same thing, g(b - w) =(b+w)-w=b. 

For more on this, you should study John Harer’s complete account of how every fibered link in 
S?, and even in an arbitrary 3-manifold, can be assembled out of such simple pieces. The 
plumbing of two Hopf links, as drawn in the square saddle shape of 4(21), illustrates a special case 
of a Murasugi sum. This generalized plumbing and its inverse operation preserve several other 
geometric properties of link complements. Originally treated algebraically, Dave Gabai explores 
these geometrically in two [8] of a series of beautifully illustrated papers on foliations and the 
topology of 3-manifolds. To him, perceiving how a spanning surface squeezes through unlikely 
places, as in the shear isotopy, is an exercise every low dimensional topologist should do. 

Manipulating knot diagrams is usually a 24-dimensional procedure; one stays close to the 
picture plane for reasons of accuracy and convenience in coding the topological information. 
When truly spatial, rotatable forms are desired, for instance to fix in the imagination what the 
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diagram represents, some simple, graphical tricks come in handy. How to attach a Seifert surface 
to the knot is the example at hand. A solid, tubular neighborhood, N, of K carries a canonical 
structure of simple closed curves on its toroidal surface, T, called a framing. The meridians, which 
go around T the short way, span discs inside N. Of all possible curves that go once around T the 
long way, only the so called /ongitudes span a 2-sided surface in the knot exterior, X = S°\ N. 
The Meyer-Vietoris isomorphism, 


A(T) = H,(X)® A,(N), 


characterizes these curves homologically: (1,0) is the class of meridians, (0,1) is that of the 
longitudes. Thus Seifert’s surface attaches to T along a longitude, and a longitude does not link 
the knot. 

Now from any view, the two contours of T visibly follow once around K the long way, but they 
need not be longitudes. Spanning surfaces are easier to draw when they are. I used elementary 
cusp forms, 8(32), to slide 2(12) around the knot until F attaches to T entirely along one contour, 
8(33). This led to position 7(22), midway between 7(31) and 7(11) in the shear isotopy. 


Acknowledgements. My pictures owe much to the critical eyes of the University of Illinois Geometric Potpourri. 
Technical assistance came from Wolfgang Haken, Lou Kauffman, Jim White, John Stillwell and Dave Gabai. I am 
greatly indebted to that master of descriptive topology, Jim Callahan, for invaluable advice in preparing this article. 
I would like to thank Joan Birman for explaining fibered knots to me originally, and Bob Williams for this 
irresistible challenge prefacing his own set of pictures in the appendix of [4]: 


One of the genuinely nontrivial aspects of 3-dimensional geometry-topology is the visualization of the 
fibration associated to the complement of a fibered knot in R? c S?. 
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VERY BASIC LIE THEORY 


ROGER HOWE 
Department of Mathematics, Yale University, New Haven, CT 06520 


Lie theory, the theory of Lie groups, Lie algebras and their applications, is a fundamental part 
of mathematics. Since World War IJ it has been the focus of a burgeoning research effort, and is 
now seen to touch a tremendous spectrum of mathematical areas, including classical, differential, 
and algebraic geometry, topology, ordinary and partial differential equations, complex analysis 
(one and several variables), group and ring theory, number theory, and physics, from classical to 
quantum and relativistic. 

It is impossible in a short space to convey the full compass of the subject, but we will cite some 
examples. An early major success of Lie theory, occurring when the subject was still in its infancy, 
was to provide a systematic understanding of the relationship between Euclidean geometry and 
the newer geometries (hyperbolic non-Euclidean or Lobachevskian, Riemann’s elliptic geometry, 
and projective geometry) that had arisen in the 19th century. This led Felix Klein to enunciate his 
Erlanger Programm [KI] for the systematic understanding of geometry. The principle of Klein’s 
program was that geometry should be understood as the study of quantities left invariant by the 
action of a group on a space. Another development in which Klein was involved was the 
Uniformization Theorem [Be] for Riemann surfaces. This theorem may be understood as saying 
that every connected two-manifold is a double coset space of the isometry group of one of the 3 
(Euclidean, hyperbolic, elliptic) standard 2-dimensional geometries. (See also the recent article [F] 
in this MONTHLY.) Three-manifolds are much more complex than two manifolds, but the 
intriguing work of Thurston [Th] has gone a long way toward showing that much of their structure 
can be understood in a way analogous to the 2-dimensional situation in terms of coset spaces of 
certain Lie groups. 

More or less contemporary with the final proof of the Uniformization Theorem was Einstein’s 
[E] invention of the special theory of relativity and its instatement of the Lorentz transformation 
as a basic feature of the kinematics of space-time. Einstein’s intuitive treatment of relativity was 
followed shortly by a more sophisticated treatment by Minkowski [Mk] in which Lorentz 
transformations were shown to constitute a certain Lie group, the isometry group of an indefinite 
‘Riemannian metric on R*. Similarly, shortly after Heisenberg [Hg] introduced his famous 
Commutation Relations in quantum mechanics, which underlie his Uncertainty Principle, Her- 
mann Weyl [W] showed they could be interpreted as the structure relations for the Lie algebra of a 
certain two-step nilpotent Lie group. As the group-theoretical underpinnings of physics became 
better appreciated, some physicists, perhaps most markedly Wigner [Wg], in essence advocated 
extending Klein’s Erlanger Programm to physics. Today, indeed, symmetry principles based on 
Lie theory are a standard tool and a major source of progress in theoretical physics. Quark theory 
[Dy], in particular, is primarily a (Lie) group-theoretical construct. 

These examples could be multiplied many times. The applications of Lie theory are astonishing 
in their pervasiveness and sometimes in their unexpectedness. The articles of Borel [Bo2] and 
Dyson [Dy] mention some. The recent article of Proctor [Pr] in this MONTHLY discusses an 
application to combinatorics. Some points of contact of Lie theory with the undergraduate 
curriculum are listed in §7. 

The article of Proctor also illustrates the need to broaden understanding of Lie theory. Proctor 
did not feel he could assume knowledge of basic Lie theoretic facts. Though hardly an unknown 
subject, Lie theory is poorly known in comparison to its importance. Especially since it provides 
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and applied. 
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unity of methods and viewpoints in the many subjects to which it relates, its wide dissemination 
seems worthwhile. Yet it has barely penetrated the undergraduate curriculum, and it is far from 
universally taught in graduate programs. 

Part of the reason for the pedagogy gap is that standard treatments [A], [Ch], [He] of the 
foundations of Lie theory involve substantial prerequisites, including the basic theory of differen- 
tiable manifolds, some additional differential geometry, and the theory of covering spaces. This 
approach tends to put a course in Lie theory, when available, in the second year of graduate study, 
after specialization has already begun. While a complete discussion of Lie theory does require 
fairly elaborate preparation, a large portion of its essence is accessible on a much simpler level, 
appropriate to advanced undergraduate instruction. This paper attempts to present the theory at 
that level. It presupposes only a knowledge of point set topology and calculus in normed vector 
spaces. In fact, for the Lie theory proper, only normed vector spaces are necessary. This 
simplification is achieved by not considering general or abstract Lie groups, but only groups 
concretely realized as groups of matrices. Since such groups provide the great bulk of significant 
examples of Lie groups, for many purposes this restriction is unimportant. 

The essential phenomenon of Lie theory, to be explicated in the rest of this paper, is that one 
may associate in a natural way to a Lie group G its Lie algebra g. The Lie algebra g is first of all a 
vector space and secondly is endowed with a bilinear nonassociative product called the Lie bracket 
or commutator and usually denoted [ , ]. Amazingly, the group G is almost completely determined 
by g and its Lie bracket. Thus for many purposes one can replace G with g. Since G is a 
complicated nonlinear object and g is just a vector space, it is usually vastly simpler to work with 
g. Otherwise intractable computations may become straightforward linear algebra. This is one 
source of the power of Lie theory. 

The basic object mediating between Lie groups and Lie algebras is the one-parameter group. 
Just as an abstract group is a coherent system of cyclic groups, a Lie group is a (very) coherent 
system of one-parameter groups. The purpose of the first two sections, therefore, is to provide 
some general philosophy about one-parameter groups. Section 1 provides background on homeo- 
morphism groups, and one-parameter groups are defined in a general context in §2. Discussion of 
Lie groups proper begins in §3. Technically it is independent of §§1 and 2; but these sections will, 
I hope, give some motivation for reading on. Those who need no motivation or dislike philosophy 
may go directly to §3. There one-parameter groups of linear transformations are defined and are 
described by means of the exponential map on matrices. In §4 the exponential map is studied, and 
the commutator bracket makes its appearance. Section 5 is the heart of the paper. It defines and 
gives examples of matrix groups, the class of Lie groups considered in this paper. Then it defines 
Lie algebras, and shows that every matrix group can be associated to a Lie algebra which is 
related to its group in a close and precise way. The main statement is Theorem 17, and Theorem 
19 and Corollary 20 are important complements. Finally §6 ties up some loose ends and §7, as 
noted, describes some connections of Lie theory with the standard curriculum. 

Bibliographical note: The arguments of sections 3, 4 and 5 are very close to those given by von 
Neumann [Nn] in his 1929 paper on Hilbert’s 5th problem. A modern development of basic Lie 
theory which incorporates these results is [Go]. 


1. Homeomorphism Groups 


In this section, we use the standard terminology of general topology, as for example in [Ke]. 

Let X be a set. Then the collection Bi(X) of biyections from X to itself is a group with 
composition of mappings as the group law. Now suppose X is in fact a topological space. Then the 
set Hm(X) of homeomorphisms from X to itself is a subgroup of Bi( X). It seems natural to try to 
topologize Hm(X). The topology should of course reflect how Hm(X) acts on X, so that maps 
close to the identity move points very little. But a topology on Hm(X) should also be consistent 
with the group structure of Hm(X). More precisely and generally, given a group G, if it is to be 
made into a topological space in a manner consistent with its group structure, the topology it is 
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given should satisfy two conditions. 


(1.1) (i) The multiplication map (2), g,) — 2,2, from G X G to G should be continuous. 
(ii) The inverse map g — g + from G to G should be continuous. 


A topology on G satisfying these two compatibility criteria is called a group topology. A group 
endowed with a group topology is called a topological group. Some standard treatments of 
topological groups are [Hn] and [P]. 

In short, then, we would like to make Hm(G) into a topological group. This is not so 
satisfactorily done for completely general X, but if X is locally compact Hausdorff, there is a nice 
topology on Hm(X), known as the compact-open topology. Before defining it, we make some 
general observations about group topologies. These will simplify the definition. 

Given a group G and an element g € G, define A ,, left-translation by g, and p,, right-translation 
by g, to be the maps 


(1.2) A,;G>G p,:G>G 
given by 

Ne(3’) = 32’ (8) = o’g.. 
For U CG G, set 
(1.3) gU=X,(U) Ug=p,-.(U). 


LEMMA 1. Let G be a topological group, and g € G. 

(a) The map \,: G > G is a homeomorphism. Similarly p,: G — G is a homeomorphism. 

(b) If U € Gis a neighborhood of the identity 1, of G, then gU and Ug are neighborhoods of g. 
Similarly if V © G is a neighborhood'of g, then g-'V and Vg™' are neighborhoods of 1g. 


Proof. One checks from the definition of A, that A is a homeomorphism, i.e., 
(1.4) Ag eAn = Ag 


for g,h € G. It follows directly from the condition (1.1) (i) that A, is continuous. Likewise, the 
map A ,-1 is also continuous. From (1.4) one concludes that 


~1 
(1.5) A,r =(A,) -. 
Hence i, is continuous with continuous inverse, that is, a homeomorphism. The proof for p, is 
essentially identical. 

Since A,(1¢) = g and A,(U) = gU by definition, part b) follows since the homeomorphic 
image of an open set is open. Hf 


COROLLARY 2. A group topology is determined by its system of neighborhoods of the identity. 


Proof. Indeed, a topology on G is determined by the collection of neighborhood systems of 
each point of G. But according to part (b) of the lemma, for a group topology, the system of 
neighborhoods around a point g € G is determined by the system of neighborhoods around 1,. @ 

Let us call a topology on G such that all A, and p, are homeomorphisms a homogeneous 
topology. Lemma 1 says group topologies are homogeneous. Evidently Corollary 2 applies to all 
homogeneous topologies, not only group topologies. Thus an obvious question is what conditions 
must a neighborhood system at the identity satisfy in order that the associated homogeneous 
topology be a group topology? This question has a simple answer. 


LEMMA 3. A homogeneous topology on a group G is a group topology if and only if the system of 
neighborhoods of 1, satisfies conditions (a) and (b) below. 
(a) If U is a neighborhood of 1¢, there is another neighborhood V of 1¢ such that V © U ~1: where 


(1.6) Ul't={gt:geu}. 
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(b) If U is a neighborhood of 1,4, there are other neighborhoods V, W of 1, such that VW GC U, 
where 


(1.7) VW ={gh:geEV,heE Wy}. 


Proof. The conditions (a) and (b) are clearly necessary for a topology to be a group topology, 
since they are one way of stating that the inverse and multiplication maps are continuous at 1¢. 
We will check this for condition (b). In order for multiplication to be continuous at 1, X l¢, 
given a neighborhood U C G of 1,, we must find a neighborhood U’ C G X G of 1, X 1, such 
that for any point (g, g’) of U’, the product gg’ is in U. But by definition of the product topology 
on G X G, any neighborhood U’ of 1, X 1, contains a product V X W, where V,W C G are 
both neighborhoods of 1,. But the image of V x W under the multiplication map is just the set 
VW defined in (1.7). So condition (b) amounts to continuity of multiplication at the point 
15> X1,E€GXG. 

Thus to complete the lemma we need to show that if the multiplication and inverse maps are 
continuous at the identity, and if the topology on G is homogeneous, then they are continuous 
everywhere. Let U be a neighborhood of 1,. Then since A, is a homeomorphism, gU is a 
neighborhood of g, and we may write 


(1.8) (gu) * = unig"! = p,(u) = p,((Ag-a(gu)) ) we. 


Thus on gU, the inverse map is a composition of A ,-1, the inverse map on U, and p,. Since A ,-1 
and p, are continuous, and A ,-1 takes g to 1,, and the inverse map is continuous at 1,, we see that 
the inverse map is continuous at g also. The proof that multiplication is continuous everywhere is 
analogous and is left as an exercise. 


We return to the question of topologizing Hm(X). Corollary 2 allows us to save work in our 
definition of the topology on Hm(X) by only defining neighborhoods of the identity map 1, on 
X, and declaring by fiat all left or right translates of these neighborhoods also to be open sets. 
Lemma 3 tells us what we must check to know our definition yields a group topology. 

From now on, we take X to be a locally compact Hausdorff space. Let C C X be compact, and 
let O D C be open. Define 


(1.9) U(C,O)= {he Hm(X):h(C)cO,h*(C)c Oo}. 


If {C,}, 1 <i <a, are compact subsets of X, and {O,} are open subsets of X such that C; € O,, 
set 


(1.10) U({G}{0,}) = 1 U(G, 0). 


DEFINITION. Let X be a locally compact Hausdorff space. The compact-open topology on 
Hm(X) is the homogeneous topology such that a base for the neighborhoods of 1, consists of the 
sets U({C,;}, {O;}) of equation (1.10). 

PROPOSITION 4. The compact-open topology on Hm(X) is a Hausdorff group topology. 

Proof. Since we have decreed the compact open topology to be homogeneous, we need only 
check the conditions of Lemma 3 to show it is a group topology. Condition (a) is automatic since 


the sets U(C, O) are defined to be invariant under the inverse map on Hm(X). Let us check 
condition (b). If U,, V; and W, are neighborhoods of 1, such that V,W, C U,, then evidently 


(nvm) emu 


Hence since the sets (1.10) are intersections of the sets U(C, O) of (1.9), it will be enough to check 
condition (b) with the neighborhood U of the form U = U(C,0O). Since X is locally compact 
Hausdorff, we can by a standard separation theorem (cf. [Ke, Chap. 5, Theorem 18]) find an open 
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O’ C X such that the closure C’ of O’ is compact, and 
CcoO’CC’ CO. 
Then set V = W= U(C’,O) 0 U(C, O’). If hy, h. € V, we find 
hyoh,(C) = hy (ha(C)) € 4, (0") CA (C’) € O 
and similarly for (h,°h,)~! = hz!eh;*. Thush,eh, € U(C, O) = U, or VW © U as was to be 
shown, and the compact-open topology is a group topology on Hm(X). 


To show that a group topology is Hausdorff is a fairly simple matter. We record the relevant 
observation as a separate result. 


LEMMA 5. Let G be a topological group. Let H C G be the intersection of all neighborhoods of 1. 
Then H is a normal subgroup of G. Further, G is Hausdorff if and only if H = {1G}. 


Proof. Suppose h,, h, € H. Given a neighborhood U of 1,, we can find neighborhoods V, W 
of 1, such that VW Cc U. Sinceh, € Vandh, € W, we see that h,h, © U. Hence h,h, © H also. 
In similar fashion, one sees that h[' © H. Hence H is a group. Since the conjugate gUg”' of a 
neighborhood of 1, is again a neighborhood of 1,, we see that H is also normal in G. 

Suppose H = 1,. Then given g € G, we can find a neighborhood U of 1, such that g € U. Let 
V, W be neighborhoods of 1,, such that VW C U. Then gV~' and W are neighborhoods of g and 
of 1,, respectively, and are disjoint. Now consider any two points g,, g, € G. Set g = g; 'g>, and 
apply the argument above. Translating on the left by g,, we find g,W and g,V~' are disjoint 
neighborhoods of g, and g,, respectively. Hence G is Hausdorff. @ 

From Lemma 5 we see Proposition 4 will be proved if we produce for each h # 1, in Hm(X) a 
compact C and open O such that h € U(C,O). Choose x € X such that h(x) # x. Then 
evidently h € U({x}, X — {h(x)}) & 


REMARK. In fact the compact-open topology on Hm(X) is better than Proposition 4 indicates. 
It is complete with respect to an appropriate uniform structure ([Ke, Chap. 6]). Also, if X is second 
countable (hence metrizable), then Hm(_X) is also second countable and metrizable. 


2. One-Parameter Groups: Flows and Differential Equations 


The real number system R equipped with addition and its familiar topology is, as the reader 
may easily check, a topological group. 


DEFINITION. A one-parameter group of homeomorphisms of (the locally compact Hausdorff 
space) X is a continuous homomorphism 


(2.1) g@:R > Hm(X). 


It will be convenient to denote the image under 9 of ¢ by 9, rather than p(t). Thus {¢,} is a 
family of homeomorphisms of X satisfying the rule 


(2.2) Pr ° Ps = Pr+s r,se R. 


Since for each ¢ the map gq, acts on X, a one-parameter group of homeomorphisms of X is also 
called an R-action on X, or an action by R on X. 
Given a one-parameter group y, of homeomorphisms of X, we can define a map 


(2.3) O:Rx X- X, 
®(t,x)=9,(x). 
The fact that ¢ — , is a homomorphism is captured by the identities 
(2.4) (i) ®(0,x) =x, 
(ii) O(s,®(t,x)) =O(s+ t,x). 
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The continuity of y is reflected in the continuity of ®. We state this fact formally. It will perhaps 
also shed some light on the significance of the compact-open topology on Hm(X). 


LEMMA 6. Let ®:R X X — X be a map. For t © R, define y,: X — X by formula (2.3) (ii). 
Then { 9,} is a one-parameter group of homeomorphisms if and only if 


(a) ® satisfies identities (2.4) and 


(b) is continuous. 


REMARK. According to this lemma, if our goal were simply to define a one parameter group of 
homeomorphisms in the quickest way, we could short-circuit the whole discussion of §1 and 
simply define a one-parameter group of homeomorphisms as a map ® satisfying the conditions of 
the lemma. However, that approach seemed unduly formalistic. 


Proof. It is a straightforward computation to verify that the identities (2.4) guarantee that for 
each ¢ the map 9, is in Bi(X) and t — 9g, is a homomorphism. Also it is obvious that the maps 9, 
will be in Hm(X) if and only if ® is continuous in x for each fixed ¢. Thus the main thrust of the 
lemma is that ¢ > g, is continuous from R to Hm(X) if and only if ® is jointly continuous in ¢ 
and x. Let us verify this. 

Suppose ® is continuous. Let C C X be compact, and O C X be open, with C C O. Choose 
any x € C. By identity (2.4)(i), the point (0, x) € R X X is in ®~1(0). By continuity of ® a 
neighborhood of (0, x) is contained in ®~1(0). This means there is a neighborhood N of x in X, 
and 6 > 0, depending on x, N, and O, such that ®(7, y) € 0 for y € N and |t| < 6. In other words 
y,(v) € O for |t| < 6 and y € N. Since C is compact, we can find a finite number of x, € C such 
that the associated neighborhoods N, cover C. Suppose then that ®(7, y,) © O for y, € N, and 
|t| < 6,. Set 6 = mind,. Then we have ®(t, c) € O for all c € C and |t| < 6. In other words, 
y, € U(C, O) for |t| < 5. Clearly, by repeating this argument for any finite collection of compact 
C,’s and open O,’s containing them, we can show that oy, € U({C,},{O,}) for all sufficiently 
small ¢. This shows that ¢ > gy, is continuous at the origin in R. But now we appeal to the 
following lemma. 


LEMMA 7. Let p: G > H be a homomorphism between topological groups. Then is continuous if 
and only if » is continuous at 1g. 


The proof of this lemma is left as an exercise to the reader, who will recognize in it the same 
spirit that informs Lemmas 2, 3, and 5. 

To finish Lemma 6, we must show that the continuity of t — gy, implies continuity of ®. 
Choose (t,x) € R X X, and set y= ®(t, x). Let V be a neighborhood of y. Since gq, is 
continuous, we can find a neighborhood W of x, with compact closure W, such that 9,(W) ¢ V. 
Since , is continuous in t, we can find e > 0 so that y, € U(y,(W), V) for |s| < e. But then if 
(t’,w) €(t — e,t + €) X W, we have 


(2, w) = @(t’ — ot, ®(t, w)) = PC P(t, w)) € %,-:(9,(W)) € V. 


In other words (t — ¢,t + €) X WC ® 1(V). Since V was an arbitrary neighborhood of y, we see 
® is continuous at (ft, x). Since (ft, x) is arbitrary, we see ® is continuous. ll 

Consider a one-parameter group g, of homeomorphisms of X and the associated map ® 
defined by formula (2.3). The map ® is a function of two variables, ¢ and x, and the maps 9, are 
obtained from ® by temporarily fixing ¢ and letting x vary. If on the other hand we fix x and let ¢ 
vary, we get a map ¢t > ®(t, x) = y,(x) which defines a continuous curve in X, traced by the 
moving point y,(x). Thus as ¢ varies, each point of x moves continuously inside X, and various 
points move in a coherent fashion, so that we can form a mental picture of them flowing through 
X, each point along its individual path. For this reason, a one-parameter group of homeomor- 
phisms of X is also sometimes called a flow on X. 
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The notion of a flow is closely related to the theory of differential equations. Indeed, let 
X = R", and write 
x=(x,,...,x,) %»ER",x,ER. 
Then 
®(t, x) = O(2, x1, X2,.-.,%,) = (O1 (4, X15... 5%, 50-5 D, (4, Xy5--- 5X, )) 


is a function from R"*! to R”. Suppose ® is not merely continuous, but differentiable. Define 


f:R" > R" 
by 
a® 

(2.5) f(x)= Sx]. 

t=0 
If we differentiate (2.4) (ii) with respect to s, and set s = 0, we obtain 

aD(t, x 

(2.6) I) ~ £(0(1, x)). 


In other words, for fixed x, the map y,(t) = ®(¢, x) is a solution of the system of differential 
equations 


(2.7) a = f(y), 


dy; 
dt 
The solution y, of (2.7) is the solution of (2.7) with initial condition y,(0) = x. 

The system (2.7) may be pictured geometrically as follows. At each point y € R”, one draws 
the vector f(y) = (f1(”);, £,(y);--->f,(”)). This gives a family of vectors which vary smoothly as 
y varies; such a family is called a vector field. A solution of the system (2.7) is a parametrized 
curve c(t) in R”, such that at each point c(t) of the curve the tangent vector c’(t) is the 
pre-assigned vector f(c(t)). The 2-dimensional system 


Hs.y) = (-y,x) 


=fi(Vis---In), forl <i<n. 


whose solutions are the circles 


(x(t), ¥(t)) = (acos(6) + t), asin(G + t)) 
is illustrated in Fig. 1. 
Suppose on the other hand that for each x we have a solution y,(t) of the system (2.7) with 
initial condition y,(0) = x. For s © R, consider the function 
Yz,s(t) =y, (t+ 5). 


Differentiation of y, , shows it also is a solution of the system (2.7), evidently with initial value 
Vy, s(0) = y,(s). The uniqueness part of the Existence and Uniqueness Theorem for ordinary 
differential equations [L], [HS], [R], therefore implies that 


(2.8) Ye (s t+ t) =yy 5(t) = csy(2)- 
If we then set 
(7, x) = y,(t), 


we find that identity (2.8) translates into identity (2.4) (ii). Of course, the initial condition 
y,(0) = x is just identity (2.4) (i). It follows that y,(x) = y,(¢) defines a one-parameter group of 
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homeomorphisms of R”. For the system of Fig. 1, the map 9, is just rotation through an angle of ¢ 
radians. 

In summary, we have seen that a (smooth) one-parameter group of diffeomorphisms of R” 
yields solutions of a system of differential equations of the form (2.7), and, conversely, a solution 
(for all time and all x) of system (2.7) yields a one-parameter group. The two constructs, solutions 
of systems of ordinary differential equations, and one-parameter groups, thus provide two 
different points of view on the same mathematical phenomenon. In other words, the notion of 
one-parameter group provides a geometric and global way of looking at the solutions of a system 
of ordinary differential equations. As such, it suggests ways of attacking and obtaining informa- 
tion about ordinary differential equations, and it provides a link between systems of ordinary 
differential equations and more complex geometric objects such as the Lie groups and Lie 
algebras discussed in the following sections. 


3. One-Parameter Groups of Linear Transformations 


In this section, we show how one-parameter groups of linear transformations of a vector space 
can be described using the exponential map on matrices. ) 

Let V be a finite dimensional real vector space. Let End(V) denote the algebra of linear maps 
from V to itself, and let GL(V) denote the group of invertible linear maps from V to itself. The 
usual name for GL(V) is the general linear group of V. If V=R", then End(V) = M,(IR), the 
n X n matrices, and GL(V) = GL,,(R), the matrices with nonvanishing determinants. 

Let || || be a norm on V (c.f. [L], [N]). In the usual way there is induced an operator norm, also 
denoted || ||, on End(V). We recall the definition: 


(3.1) || Al] = sup Nol :vEV— (0} AeEEnd(V). 


The norm on EndVY makes End(V) into a metric space. Since the determinant is a continuous 
function on End(V), we know that GL(V) is an open subset of EndV (see also (3.6) below), so it 
also is a metric space. 


DEFINITION. A one-parameter group of linear transformations of V is a continuous homomor- 
phism 
(3.2) M:R > GL(V). 
Thus M(t) is a collection of linear maps such that 
(i) M(0) =1,, the identity of V, 
(ii) M(s)M(t)=M(s+t)  5,tER, 
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(iii) M(t) depends continuously on t. 


REMARKS. (a) The topology on GL(V) is easily verified to be a group topology as defined in 1. 
Thus, for A © End V and r > 0, set 


(3.3) (A) = {A’ © End V: ||A’ — Al] <r}. 
First, the basic formula [N, p. 76], 
(3.4) ||A Bll < || Al] || BI 


implies that left and right multiplication are continuous. Hence the topology is homogeneous. 
Then the Neumann formula [N, p. 177], 


(3.5) (y- Ay t= 3a" 


valid for A with ||A|| < 1 shows that 


(3.6) B(1y))  ¢ B(1y) 

with s = r/(1 — r). Similarly the formula 

(3.7) (,+A)QQ,+ B)=1,+A+B+4+AB 
shows 


B.Ay)&, (1)) € B+ s+rs(1yp). 


Thus all the conditions of Lemma 3 are checked, and we have a group topology. 

(b)' Furthermore, it is not difficult to verify that the topology defined by the norm coincides 
with the compact-open topology defined in §1 on GL(V) as a subgroup of Hm(V). This is left as 
an exercise. Hence this definition of one-parameter group is a special case of the definition of §2. 

For A € End V, define 


(3.8) exp(A) = ay an 


Since ||A”|| < ||A||", we see, by the standard estimates in the exponential series, that the series 
defining exp A converges absolutely for all A and uniformly on any #,(0). Hence exp defines a 
smooth, in fact analytic, map from End(V) to itself. We will see shortly that in fact exp A € 
GL(V). 


PROPOSITION 8. If A and B in End V commute with each other, then 
(3.9) exp(A + B) = expA exp B. 


Proof. Computing formally we have 


oO n 
exp A exp B = | » «| 


= 1 i! 
-Sa{ Laie a'8" | 

i=0 |! \ mena min! 
_yl wa kpl-k 
-2 (2, ( 1) ) 


If A and B commute, the familiar binomial formula applies and says 


(4+B)'= E (fae 
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Substituting this in our formula for exp A exp B, and noting that all manipulations are valid 
because the series converge absolutely, we see the proposition follows. lm 


COROLLARY 9. For any A © End V, the map t — exp(tA) is a one-parameter group of linear 
transformations on V. In particular exp A € GL(V) and (exp(A))~' = exp(—A). 


Proof. Since for any real numbers s and ¢ the matrices sA and tA commute with one another, 
this corollary follows immediately from Proposition 8. & 
The main result of this section is the converse of Corollary 9. 


THEOREM 10. Every one-parameter group M of linear transformations of V has the form 
(3.10) M(t) = exp(tA) 


for some A € End V. 
The transformation A is called the infinitesimal generator of the group t — exp(tA). The flow 
illustrated in Figure 1 is in fact given by a one-parameter group with infinitesimal generator 


0 --1 
1 0} 
REMARK. Since for v € V we have 
(exp t4)(v) =v + tA(v) + 


n=2 


= t"A"(v) 
ni’ 
the infinitesimal generator of the one-parameter group M(t) = exp(tA) can be computed by the 
formula 
. Mt — d 

(3.11) A(v) = tim MOM 2 _ 4 yyyy| 

10 t at '=<0 
Thus the one-parameter group M(t) is associated by the discussion at the end of §2 to the system 
of differential equations 


(3.12) ~ = A(v). 


These equations are of course basic in the theory of linear systems, which is applied in electrical 
engineering, economics, etc. If we know that M(?t)(v) is differentiable, then the existence and 
uniqueness theorem for differential equations implies Theorem 10, but we do not know a priori 
that M(t) is differentiable. The burden of the proof of Theorem 3 is to get around this ignorance, 
thereby establishing that a merely continuous map ¢ — M(f) satisfying the group law (3.2) (ii) is 
in fact analytic. This is a recurrent theme in Lie theory, and 1s also expressed in the main theorem 
(Theorem 17) of this paper. It found its ultimate expression in Hilbert’s 5th Problem: to show that 
if a topological group is locally (.e., a neighborhood of every point is) homeomorphic to 
Euclidean space, then the group is in fact an analytic manifold with analytic group law (a Lie 
group). This problem was resolved positively in the early 1950’s by A. Gleason [G]. See also [Kal], 
[MZ]. 

We take up now the proof of Theorem 10. It will require some preliminary results. 

Let #,(A) be the open ball of radius r around A, as defined in formula (3.3). 


PROPOSITION 11. For sufficiently small r > 0, the map exp takes @,(0) bijectively onto an open 
neighborhood of 1, in GL(V ). One has exp(&,(0)) € &,(1,) where s = e” — 1. 


Proof. Let Dexp, be the differential of exp at A. It is a linear map from End(V) to End(V) 
defined. by 
exp(.A + 1B) — expA 
a 
From the definition (3.8) of exp, it is easy to compute that 


Dexp)(B) = B. 


Dexp,(B) = lim 
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That is D exp, is the identity map on End(V). In particular D exp, is invertible. Therefore the first 
statement of the proposition follows from the Inverse Function Theorem [L], [R]. The inclusion 
exp(#,(0)) € #,(1,) follows from the obvious termwise estimation of exp(A) — 1,. 


REMARK. If one defines 
(3.13) log(ly- A)= - 
n=1 


then just as for real numbers, one sees this series converges absolutely for ||A|| < 1. Further, for all 
B € @,(1,) one has 
(3.14) exp(log B) = B. 


Formula (3.14) is known in the scalar case, and this implies that in fact (3.14) is an identity in 
absolutely convergent power series, whence it follows in the matrix case. The formulas (3.13) and 
(3.14) allow an alternate proof of Proposition 11 which avoids appeal to the Inverse Function 
Theorem and gives the explicit estimate that exp is 1-1 on #,,, (0). However, this explicit value of 
r is not needed, and we need in any case to appeal to the Inverse Function Theorem below in 
Theorem 17, so this more explicit proof of Proposition 11 gives us no particular benefit. 


PROPOSITION 12. Choose an r < log 2, and let T be in exp&,(0), say T = expA. Then the 
transformation S = exp(A/2) is a square root of T; that is, S* = T. Moreover, S is the unique 
square root of T contained in exp &,(0). 


Proof. That S? = T follows directly from Proposition 8. It is only necessary to prove the 
uniqueness of S. From Proposition 11, we see that our restriction on r implies exp 4,(0) € #,(1,). 
Hence it will suffice to show that if A, B are distinct linear maps of norm less than 1, then 
(1, + A)? # (1, + B)*. Suppose the contrary. Then expanding the squares, cancelling the 1,’s 
and transposing, we find the equation 


2(A — B) = B* — A* = B(B—A)+(B-A)A. 
Taking norms yields 
2\|A — Bll < ||BI |B — Al] + 1B — Al All = CBI + [AID — All. 
This implies either ||A — B|| = 0, which-is false since A # B, or ||A]| + ||B|| > 2, which is false 
since both ||A|| and ||B|| are less than 1. This contradiction establishes the uniqueness of S. Ml 


Proof of Theorem 10. Let t ~ M(t) be a continuous one-parameter group in GL(V). Since 
M(0) = 1,, if we specify r > 0, we may by continuity and Proposition 11 find an e > 0 such that 
M(t) € exp(&,(0)) for |t| < e«. We take r < log2. Write 


M(e) = exp A, 
for appropriate A, € &,(0). If we set 
A= ( - A,, 

then M(e) = exp(eA). The transformations M(e/2) and exp((e/2)A) are then both square roots 
of M(t) lying in exp(#,(0)). By Proposition 12 we conclude 

M(e/2) = exp((e/2) 4). 
An obvious induction using Proposition 12 shows that 

M(2~"e) = exp(2"eA) 
for all positive integers n. Taking mth powers, we conclude 

M(m2~"e) = exp(m2~" € A) 
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for all integers m and n. Since the numbers m2™"e are dense in R, Theorem 10 follows by 
continuity. & 


4. Properties of the Exponential Map 


The map exp is the basic link between the linear structure on End V and the multiplicative 
structure on GL(V). We will describe some salient properties of this link. 

Choose r with 0 <r < 1/2 such that exp is one-to-one on #,(0). Choose r, < r so that if 
A,B = &, (0), then exp A exp B is contained in exp 4,(0). Then we can write 


(4.1) exp A exp B = exp C 


for some C € #,(0). The Inverse Function Theorem guarantees that C is a smooth (in fact 
analytic) function of A and B. There is a beautiful formula, the Campbell-Hausdorff formula [J1], 
[Se], which expresses C as a universal power series in A and B. To develop this completely would 
take too long. We will just give the first two terms in the expression for C. These suffice for most 
purposes. 

For A, B € End YV, write 


(4.2) [A,B] = AB -— BA. 


The quantity [A, B] is called the commutator of A and B, and will be seen later to provide the Lie 
bracket operation in the Lie algebras we construct. 


PROPOSITION 13. Suppose A, B,C have norm at most 1/2 and satisfy equation (4.1). Then we 
have 


(4.3) C=A+B+5[4,B] +S, 
where the remainder term S satisfies 

3 
(4.4) ISI] < 65( Al] + ||BID. 


Proof. We have 
(4.5) expC =1,+ C+ R,(C), 


where the remainder R,(C) is 
ore) cn 
R,(C) = » nl 
n=2 ~~ 

and satisfies the obvious estimate 
|Ri(C)|| <iicif » 
n=2 


when ||C|| < 1, hence certainly when ||C|| < 1/2. 
Similarly we have 


(4.6) expAexpB=1,+A+B+4+R,(4,B), 


[Kellan 


n! 


| < ||C\I? 


where by rearrangement of the double sum 


co 


R,(4,B)= > ai{ 2 (x) ater". 


n=2 


Hence we have the estimate 


R,(A, B)|| < (IIAl] + BID? 


y (al 130 | < (Il4ll + 1B)" 
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when ||A]| + ||Bl| < 1 
Comparing equations (4.5) and (4.6), we see that equation (4.1) implies 
(4.7) C=A+B+R,(A,B)—R,(C). 
Hence 
2 1 
Cll < All + BI + CAM! + IBID” + ICIP < 2(All + BID + ZIICl 


when A, B and C all have norm at most 4 
Thus 


(4.8) Cl] < ACA] + BI). 
Returning to equation (4.7), we further find 
2 
(4.9) ||C-(4 +B) <1R,(4, BDI + I1Ri(C)I < (All + IBID” + (4C/All + 11D) 
2 
= 17(|All + [1B]. 


We now refine these estimates to second order. In analogy with (4.5) we have 


(4.10) expC =1,+ C+— Cc 


7 R,(C), 


where 


R(c)= y & 


! 
n=3 n: 


is easily estimated by 


(4.11) “1R2(O)II< (5 ICIP 


when ||C|| < 1 
If we substitute expression (4.3) for C in equation (4.10), we obtain 


Ever + R,(C) 


(4.12) expC=1,+4+B+5[4,B] +845 


=1,+A + B+5[4,B]+5(A +B) +T 
=1,+A + B+5(A?+24B + BY) +T, 
where 
1 
T=S+ 5(C —~(A + B))+R,(C). 
On the other hand, we have 
(4.13) expAexpB=1,+A+B+ 5 (A? + 2AB + B?)+R,(A,B), 
where 
(oa) 1 ie.@) 
Ra(A,B)= © HY (pater 
. 1 
satisfies ||R2(A, B)|| < 3 (All + ||B|)° when ||A|| + ||Bl| <1 
Comparison of (4.12) and (4.13) in the of (4.1) yields 


S=R,(A,B)+= =((4 +B)’ - C?)-—R,(C). 
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Taking norms, we find 


1 
ISI <[|R2(A, BDI + 5 (ICA + B)(A + B- C) +(A + B- C)CI) + 1R2(C)I 
1 1 1 
<3 (IAll + Bil) + 7 CIAL! + [BI + [ICDA + B- Cll + ZIlelP? 
1 335 2 1 3 
< Z(IAll + IBID + 5 CIAL + BID - 27CAll + IBID)" + 3 (ACIIAl + IBID) 


< 65(|Al| + ||BIl)’, 


as was to be shown. 

We will derive two main consequences of Proposition 13. These relate group operations in 
GL(V) to the linear operations in End(V), and are crucial ingredients in the proof of the main 
theorem (Theorem 17 in §5) that relates Lie algebras to Lie groups. Proposition 14 relates group 
multiplication in GL(V) to addition in End(V), and Proposition 15 relates the group commutator 
operation to the bilinear commutator bracket defined in equation (4.2). 


PROPOSITION 14 (Trotter Product Formula). For A, B © End V, one has 
(4.14) exp(A + B) = lim (exp(4A/n)exp(B/n))". 
n- co 


Proof. For n large enough, A/n and B/n will be close enough to the origin that formula (4.3) 
applies. We then have 
exp( A /n )exp( B/n) = exp C,, 
where by estimate (4.9) 
; 2 
IC, ~(4 + B)/nl| < 17((|A]] + BI) Jn)’. 


Hence as n > 00, we see that nC, > A + B. Since exp nC, = (expC,)”, equation (4.14) fol- 
lows. 


Recall that the (linear) commutator [A, B] is defined in equation (4.2). Recall also that if g, h 
are elements of a group, then the group commutator of g and h, written (g: h), is the expression 


(g:h) = ghg*h-’. 
PROPOSITION 15 (Commutator formula). For A, B © End V, one has 


(4.15) exp[ A, B] im (exp( A /n )exp( B/n )exp( —A /n )exp(— B/n)) nm 


Jim ((exp(4/n) : exp(B/n))” 


Proof. As in Proposition 14, for large n we have 


A,B 
114,81, . 


exp( 4 /n )exp( B/n) = exp C, = exp| (A + B)/n + 5 5 .) 
n 


where 
(\|Al] + IBID? 
IS.Il < 65° 


Similarly 


exp(—A/n )exp(—B/n) = exp(—(4 + B)/n +(5)e + s,| = exp C’ 
n 
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with also 
isi < 65 MAL + BID" 
n 
Hence 
(exp(A/n) :exp( B/n)) = Exp G,exp C; = Exp By, 
where 


1 
E,=G,+ G+ 516,,G) + T, 
_ 14,8] Bly 


n? 
where 7; is the term S in equation (4.3) if A = C, and B= C,. 
It will suffice to show that there is a number y, depending on A and B, such that 


AB 


n 
nz 


sIC, iC] +S, + 8,4 T,, 


For then 
(exp E,)” = exp([A, B] + U,) 


with ||U,|| < y/n, and equation (4.15) follows. In turn, it will suffice to show that the 2nd, 3rd, 4th 
and 5th terms in the expression for E,, are each less than a constant times n~*. For S,, S’ and T,,, 


n? n 


this follows from Proposition 13. Thus we need only worry about [C,,, C;]. We compute 
1 1 —] 1 , 
[c,,C]=|-—(A+ B)+ 7,214: 8] + §,—(A+ B)+——-[A,B] +S 
n n In2 


=—[4 + BA, BI] +14 + BS, + 5] + GIL, B.S; ~ 5] 


+[S,, 5, ]. 


Using Proposition 13, we see that each of the four terms in this last sum is bounded by a constant 
times n°. (In fact, all terms except the first are bounded by a constant times n~*). 

There is one further concept involving the exponential map that is basic to Lie theory. It 
involves conjugation, which is generally referred to as the “adjoint action.” For g © GL(V) and 
T € End V, we can form the conjugate 


(4.16) Adg(A) = gAg™? 
The following proposition is easily verified and left as an exercise. 
PROPOSITION 16. (i) Adg(aA + bB) = aAdg(A)+ bAdg(B) for A,B € EndV; a,beER; 
and g © GL(V). 
(ii) Adg( AB) = Adg(A)Adg(B). 
(iii) Adg,g.(A) = Adg,(Adg,(A)). 
Formulas (i) and (ii) say Adg is an algebra automorphism of End V, and Formula (iii) says the 
map Ad:¢g — Adg is a group homomorphism from GL(V) to the automorphism group of 
End(V). The map Ad is called the adjoint action of GL(V) on End(V). 
Formula (iii) implies in particular that if exp ¢A is a one-parameter subgroup of GL(V), then 


Adexp fA is a one-parameter group of linear transformations on End V. Hence Adexp #A has 
infinitesimal generator x € End(End V). We can compute # by the formula 
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im (exp t4) B(exp(—1tA)) — B 
t—0 t 
d 
= 7, (exp (A) B(exp(—1A))],-0 


= (A(exp td) B(exp — t4) + (exp tA) B(—A)(exp — tA))],~0 
= AB — BA =[A,B]. 


t(B) = 


Here we have used the fact that 


© (exp(:A)) = Aexp(tA). 


This formula may be verified by direct calculation from the definition of exp(tA). Hence if we 
define 


adA:End V > End V 
by 
ad A(B) =[A, B], 
we have the following formula. 
PROPOSITION 17. For A © EndV 
(4.17) Ad(exp A) = exp(ad A). 


5. The Lie Algebra of a Matrix Group 


By a matrix group we mean a closed subgroup of GL(V) for some vector space V. This section 
shows a matrix group is a Lie group. What that means is expressed in Theorem 17. Most, though 
not all, Lie groups can be realized as matrix groups. This article discusses only matrix groups. 


EXAMPLES. (i) GL(V) itself. 


(ii) SL, (R), the special linear group, of n X n matrices of determinant 1. 


(iii) O, ,, the “pseudo-orthogonal groups,” consisting of all matrices in GL,,,(R) that 
preserve the indefinite inner product 
P ptq 
(x,x’)pg= Dxxi- YO x,x/,x,x’e R?*4, 
i=l i=pt+l 


(iv) SP,,,(IR), the real symplectic group, consisting of all matrices in SL,,,(R) that preserve 
the skew-symmetric bilinear form 


n 

\N / _ , 2n 

(x, x) = LX Xfan 7 XXj4nX, x’ © R™, 
i=1 


(v) The group P(U) of transformations that preserve a subspace U of V. For instance, if 
V =R", and U,, = R” = {(X1, X2,---,Xm>,0,0,...,0)}, where m <n, then 


P(U,) = (|4 x) :4 €GL,(R),BEGL,_,.(R), Xe Miy.n-m(R)}, 
Here M,,, ,-m(R) is the space of m X (n — m) real matrices. 

(vi) Any intersection of matrix groups is a, matrix group. For instance, the intersection 
1” _,P(U,,) of the groups P(U,,) of example (v) is the group of invertible upper 
triangular matrices. 

(vii) The group preserving some closed subgroup, not necessarily a subspace, of V. For 
example, let Z” C R” be the discrete subgroup of vectors with integral entries. Set 
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GL, (Z) = {A € GL,(R) : A(Z") = Z"}. 


Then GL,(Z) can also be shown to consist of matrices with integer entries and 
determinant +1. 

(vii) The group commuting with some family {7;} of operators on V is a matrix group. For 
example, we can identify C” with R*” by letting x,,_, and x, be the real and imaginary 
parts of the coordinate z; of z = (21, Z2,...,Z,) © ©”. If we do so, the operation of 
multiplication by a complex scalar becomes some (real) linear operator on R2”. Further, 
the group GL,(C) becomes identified with the subgroup of GL,,(R) formed by 
elements which commute with the multiplications by complex scalars. 

(ix) If G; is a matrix group in GL(V,), i= 1, 2, then G, X G, is a matrix group in 
GL(V, ® V,) in the obvious way. 

(x) If G is a matrix group, then G®°, the connected component of the identity in G, is a 
matrix group. 

(xi) The normalizer in GL(V) of a matrix group is a matrix group. 


The main result of this section is the essential phenomenon behind Lie theory: a matrix group 
has naturally attached to it a Lie algebra. Before showing this we recall what a Lie algebra is. 


DEFINITION. A real Lie algebra g is a real vector space equipped with a product 
(5.1) L]:axg79 
satisfying the identities 


(i) (Bilinearity). Fora,b € R andx, y,zeq, 
[ax + by, 2] = a[x, z] + bly, 2] 
[z, ax + by] = a[z, x] + b[z, y]. 


(5.2) (11) (Skew symmetry). For x, y € g, 
[x, y] = —{ly, x]. 


(111) (Jacobi Identity). For x, y,z Eg, 
[x..y,z]] + [z,.x, yl] + Ly, [z, x]] = 0. 


The first main example of a Lie algebra is End V equipped with the bracket operation [ ,] of 
commutator, as given in equation (4.2). It is left as an exercise to verify that this satisfies the 
correct identities. Any subspace of End V which is closed under [ ,] will become a Lie algebra in its 
own right. Since our main theorem will provide us with such a subspace for each matrix group, we 
will postpone a more explicit discussion of examples. 

Consider a matrix group G C GL(V). Let exp” 1(G) ¢ End V be the inverse image of G under 
exp. Since exp(nA) = (exp A)", it is clear that exp~‘(G) is closed under scalar multiplication by 
integers. Set 


g = {A © EndV:exptA € Gforallte R} = () texp71(G). 
teR* 


Observe that g is the collection of infinitesimal generators of one-parameter subgroups of G. We 
call g the Lie algebra of G. 


THEOREM 17. (a) The Lie algebra g of a matrix group G is a Lie algebra. 
(b) The map exp: g — G maps a neighborhood of 0 in g bijectively onto a neighborhood of ly 
in G. 


REMARKS. (i) Part (b) of Theorem 17 implies G is locally homeomorphic to Euclidean space. In 
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fact it is not hard to refine part (b) and show that G has the structure of a smooth manifold, such 
that the group multiplication is smooth, but we will not do that here. 

(ii) Theorem 17 provides a geometric picture of the relation between g and G. If a one-parame- 
ter group exp(tA) is regarded as a curve inside the vector space End V, then this curve passes 
through the identity 1, at time ¢ = 0. By differentiating the formula for exp tA, we see the tangent 
vector at the point 1, to this curve is just A. Thus, as we have defined it, g consists simply of all 
tangent vectors to the curves defined by one-parameter groups in G. But Theorem 17 asserts that 
these tangent vectors actually fill out some linear subspace (namely g) of End V, and further, if we 
make the smooth change of coordinates A — exp A, then this linear subspace g is bent in such a 
way that it lies entirely in G, and fills up G around 1,,. In other words, G is shown to be a smooth 
multidimensional surface inside End V, and g is simply its tangent space at the point 1,. 

The main burden of the proof of Theorem 17 is carried by the following technical result. 


LEMMA 18. Suppose { A,,} is a sequence in exp ‘(G), and ||A,,|| > 0. Let s,, be a sequence of real 
numbers. Then any cluster point of s,,A,, is in g. 


Proof. Let B be the cluster point. By passing to a subsequence if necessary we may assume that 
s,A, converges to B. Fix a number ¢ € R. Let m,, be an integer such that |m,, — ¢s,| < 1. Then 
m,A, converges to ¢B; for we have 


\|m,A, — tBl| = |(m,, — ts, ) A, + t(5,4, — BDI 
<|m,, — ts,||lAnll + [el ls,4, — Bll 
< ||A,I| + [4 lls,4, — Bll 
which converges to zero as n —> 00, by our assumptions on A, and B. Since m,A, € exp ‘(G), 
and exp 1(G) is closed, we see that tB € exp ‘(G). Since ¢ was arbitrary in R, we see that 
Beg.t ° , 
Proof of Theorem 17. We first show g is a subspace of End V. Since g is by definition closed 


under scalar multiplication, we need only show it is closed under addition. Take A, B © g. Then 
as in Proposition 14 we know that for large enough n 


exp(A/n )exp( B/n) = exp C,, 
where ||C,|| > 0, and nC, — A + B. Hence Lemma 18 implies A + B € g. 


Next we show that if A, B € g, then also [A, B] € g. As in Proposition 15 we know that for 
large n we have 
(exp(A/n) : exp( B/n)) = exp E, 
with E,— 0 and n’E, > [A,B]. Another application of Lemma 18 says [A, B] € g. This 
concludes part (a) of Theorem 17. 
We know gq is a linear subspace of End(V). Let Y C End(V) be a complementary subspace of 


g, so that End V = g @ Y. Let p, and p, be the projections of End V on g and Y, respectively, 
with respective kernels Y and g. Define a map E: End V — GL(V) by 


E(A) = exp( p,(A) )exp( p2(A)). 
By use of Proposition 13, we can compute that 


< (exp( p,(tA))exp( p2(tA)))|,20 = Pi(A) + p2(A) = A. 


This says that the differential of E at 0 is the identity map on End V, so that E takes small 
neighborhoods of 0 to neighborhoods of 1,, bijectively, by the Inverse Function Theorem. Choose 
a small ball #.(0) € End V, and suppose exp(&,(0) M g) does not cover a neighborhood of 1, in 
G. Then we can find a sequence B, € exp *(G) such that B, > 0, but B, € g. When B,, is close 
enough to 0, we may write 

exp B, = E(A,,) 
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for some A,. We will have A, — 0 as B, — 0. Then 


—1 
exp( p2(A,,)) = exp(pi(A,)) exp B, 
is also in G, and is nonzero by our assumption on B,. Since A, — 0, p,(A,) — 0 also. The 
sequence || p,(A,,)||1p>(A,,) will have cluster points, and these must be in g by Lemma 18. On the 
other hand, p,(A,,) © Y, so all cluster points must be in Y. This contradicts the fact that Y was 
chosen complementary to g, so statement (b) of Theorem 17 follows. & 


EXAMPLES. We will describe below the Lie algebras of some of the groups listed at the 
beginning of this section. The verification that the indicated Lie algebras are indeed the Lie 
algebras of the stated groups is left as an exercise. 


(i) The Lie algebra of GL(V) is of course End(V). 

(ii) The Lie algebra of SL,,(R) is the space of sl,,(IR) of n X n matrices of trace zero. 

(iii) Let B be a bilinear form on V. The isometry group of B is the group of invertible operators 
A such that 

B(Au, Av) =B(u,v) forall u,veV. 
The Lie algebra of this group is the space of operators B such that 
B(Bu,v) + B(u, Bv) = 0. 
In particular the Lie algebra 0,(V) of the orthogonal group O,(R) of isometries of the 
standard inner product on R” is the space of skew-symmetric matrices. 

(iv) The Lie algebra of the subgroup of GL(V) of maps commuting with given operators { 7; } 
is the subalgebra of End V commuting with the T,. 

(v) The Lie algebra of the group P({V;}) of invertible transformations which preserve each 
of the subspaces V, of V is the subalgebra of all transformations which preserve the V,. In 
particular, the Lie algebra of the group of invertible upper triangular matrices is the 
vector space of all upper triangular matrices. 

(vi) The Lie algebra of G, N G,, for matrix groups G,, is g, N g>. 

(vii) A matrix group G and its identity component G° have the same Lie algebra. 


After its existence, the second most important feature of g is that it is natural (in the sense of 
category theory). This is the content of our next theorem. 
Let g, h, be real Lie algebras. A homomorphism from g to } is a linear map 


L:g>) 
satisfying 
(5.3) L([x,y) =[Lx,Ly] x, yea. 
Let V,U be real vector spaces. 


THEOREM 19. Let G € GL(V) be a matrix group with Lie algebra g. Let 6: G > GL(U) be a 
continuous homomorphism. Then there is a homomorphism of Lie algebras 


(5.4) do: g — EndU 
such that 
(5.5) exp(d¢(A)) = o(exp 4). 


Proof. If A € g, then exp tA is a one-parameter subgroup of G, so $(exp(tA)) is a one-parame- 
ter subgroup of ¢(G) € GL(U). Hence by Theorem 10 we may write $(exp(tA)) = exp(tB) for 
some B € End U. If we define 


do(A) = B, 
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then equation (5.5) will obviously be satisfied. To prove this theorem, it suffices to show that d@ is 
a homomorphism of Lie algebras. But this follows directly from Propositions 14 and 15 which 
show that the Lie algebra operations in g are determined by operations in G. 


EXAMPLE. The formula (4.17) shows that g is an invariant subspace of End V under the 
operators Adg, g € G. The restriction of Adg to g is again denoted by Adg, and the resulting 
action of G on g is still called the adjoint action. In terms of Theorem 19, the formula (4.17) has 
the interpretation 


(5.6) d(Ad) = ad. 
An immediate consequence of Theorem 19 is: 


COROLLARY 20. If G, ¢ GL(V) and G, € GL(U) are isomorphic matrix groups, then their Lie 
algebras g, and g, are isomorphic as Lie algebras. 


Proof. Let 6: G, ~ Gy be a continuous isomorphism with continuous inverse ¢~*. Then in 
particular @ is a continuous homomorphism from G, to GL(U), and @7' is a continuous 
homomorphism from G, to GL(V). Theorem 19 therefore provides us with associated Lie algebra 
homomorphisms d¢ and d(@~*). It follows from the definition of the Lie algebra of a matrix 
group and formula (5.5) that in fact db(g,) € g>, and similarly d(¢~*)(g,) C g,. It further 
follows from formula (5.5) that since ¢~ 1° @ is the identity on G,, then also d($~')e dé is the 
identity on g,. In other words d(#~') = (do) 7", so dé is in fact a Lie algebra isomorphism from 
g, tog,. 0 


The converse of Corollary 20, that groups with isomorphic Lie algebras are isomorphic, is false. 
For example the rotation group 


_fjcos@ —sinéd|. 
502 = (| sin 0 sing [ee R} 


and the diagonal group 


p, = {| °| -aeER, a > oh 


both have Lie algebra isomorphic to R, but SO, is homeomorphic to a circle, while D, is 
homeomorphic to R, so they are certainly not isomorphic. 

However, the converse of Corollary 20 is in a sense almost true, so that the bracket operation 
on g almost determines G as a group. After the existence of the Lie algebra, this fact is the most 
remarkable in Lie theory. Its precise formulation is known as Lie’s Third Theorem. It is in proving 
a suitable version of Lie’s Third Theorem that Lie theory begins to get involved, so we will leave 
the story here. Precise treatments of these issues can be found in [A], [Ch], [He], [Sel]. 


6. Loose Ends and Further Developments 


In §§3, 4, and 5 we have shown that to each matrix group G, there is associated in a close and 
natural way, a Lie algebra g, the two being connected via one parameter groups and the 
exponential map. These facts constitute an important part of the foundations of Lie theory. We 
will describe briefly what we have omitted from the standard account. 

First, we have not treated Lie groups as abstract things-in-themselves, but have only dealt with 
them as subgroups of a standard group, GL(V). We could noi have discussed abstract Lie groups 
without assuming the standard language of differentiable manifolds. Our approach allowed us to 
bring to the fore the remarkable Theorem 17, which asserts that merely the requirements of being 
closed and being a group inside GL(V) (or any Lie group) suffices to make the group a smooth 
manifold. This indicates what a strong regularity condition the group property is. Research over 
the past decades have continued to underscore this theme [BT], [Ma], [Mo]. 
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Second, we have not demonstrated how complete and mutual is the relationship between Lie 
groups G and their Lie algebras g. It is in this direction that the principal technical complications 
of the theory lie. For example, although we have shown how to attach a Lie algebra to every 
matrix group, we have not tried to attach a group to every Lie subalgebra of End V. Indeed, this is 
not possible if one sticks to matrix groups; the one parameter groups obtained by exponentiating 
elements in a given Lie algebra g will generate a group which in a suitable sense has g as its Lie 
algebra but this group will not always be closed in GL(V). The simplest example is probably the 
one-parameter group exp 7A, in GL,(R), where 


0 1 0 0 
-1 0 0 0 
Ax=| 9 9 0 x 
0 0 -x 0 


and x is any irrational number. Also the question of the relation of two matrix groups which have 
isomorphic Lie algebras, essentially the question of the converse of Corollary 20, involves the 
notion of covering space and fundamental group [Ms] and is beyond the scope of this discussion. 
Interestingly enough, both these questions are most vexed for the most simple-minded case: 
abelian Lie groups and their Lie algebras. We close these brief remarks by pointing out that, when 
G is fairly nonabelian, especially if the center of G is discrete, the existence of the adjoint action 
and formula (5.6) in particular go a long way toward showing that G is nearly determined by g. 
After the foundations comes the rather extensive development of the structure theory of Lie 
algebras, with direct consequences for the groups. Several fine accounts of the theory of Lie 
algebras are available, for example [J], [Hu]. Beyond the theory of Lie groups and algebras in 
themselves lies the vast domain of their applications. We have mentioned a few of these in the 
introduction and in §7. Some representative references for applications are [BC], [HP], [Hr], [Ko], 
[Lo]. ° 

Our treatment in §§3, 4, 5 has been concrete in that we worked only inside End V, but it was 
also abstract in that it was coordinate free. We record here some common terminology used when 
bases are introduced. Let g C End V be a Lie subalgebra. Let { y,},1 < i < dim g be a basis for g. 
Then the fact that g is a Lie algebra amounts to the statement that the commutators [ y,, y,] are 
again linear combinations of the y,’s. Thus we have equations 


(6.1) [yi ¥) = Deir. 


where the ci are real numbers. The equations (6.1) are called the commutation relations of the y,’s 
and the cj are called the structure constants of g with respect to the y;. 


For example, set 
+_ [0 ql =(° °| -|} °| 
° Ie of & lt of @= Lo -1)° 


The matrices e*, e~, and h form a basis for sl,, the 2 < 2 traceless matrices, one of the most 
fundamental Lie algebras. It is easy to compute that their commutation relations are 


[h,e*] =2e*[h,e-] = —2e [et ,e J =h. 
7. Relations with the Standard Curriculum 


In this section we give some examples of how Lie theory makes contact with current staples of 
undergraduate mathematics. We must of course be very restrictive and brief. 


1. Many of the standard theorems of linear algebra are of course also part of the fabric of Lie 
theory, and gain coherence when considered in that light. For example, several of the standard 
canonical forms, e.g., Jordan form, the diagonalization of the (skew) Hermitian matrices, amount 
to classification of the conjugacy classes (orbits under the adjoint action) in a Lie algebra. Jordan 
form describes conjugacy classes in End(C”) ~ g1,(C), and diagonalization of Hermitian matrices 


1983] VERY BASIC LIE THEORY 621 


describes conjugacy classes in U,, the n X n unitary group. We cannot explain this interpretation 
of these results in detail, but encourage the reader to explore it by further reading. 

Also, SL, (IR) or SL,(C) are examples of an extremely important class of Lie groups called 
semisimple groups, and several well-known results in linear algebra are special cases for SL,,(R) or 
SL,,(C) of structure theorems for semisimple groups. (Since GL,, and SL,, are so similar, we state 
the results for GL,,.) The polar decomposition or singular value decomposition [St] says that any 
A € GL,,(R) may be written in the form 


A=0OS =0,D0,, 


where O, O,, and O, are orthogonal matrices, S is symmetric, and D is diagonal with positive 
entries. This is the specialization to GL, (R) of what is known as the Cartan decomposition |He] in 
the context of semisimple Lie groups. Also, the Gram-Schmidt orthonormalization procedure [St] 
says, in group-theoretical terms, that any A © GL, (R) may be written in the form 
A = OB = ODU, 

where O is orthogonal, B is upper triangular, D is diagonal with positive entries, and U is upper 
triangular with diagonal entries all equal to 1. For general semisimple groups, this is known as the 
Iwasawa decomposition [He]. 

Various basic features in the elimination theory, including the “LU factorization” [St] of a 
generic matrix into the product of an upper triangular and a lower triangular matrix, and the 
“reduced row-echelon form” [DN] are aspects of a different kind of decomposition of semisimple 
groups, known as the Bruhat decomposition [Bo]. 


2. The cross product on R? defines a Lie algebra structure on R°. This is in fact isomorphic to 
03, the Lie algebra of O,, the 3 X 3 skew symmetric matrices. The isomorphism is accomplished 
by 


4 


O -x -—-y 
(x,y,z)> Xx 0 —Z |. 
y 2 0 


The generalization of this correspondence to higher dimensions leads to the theory of spinors and 
Clifford algebras [J2]. 


3. The fact that second mixed partial derivatives are equal is a reflection of the fact that R” is 
an abelian Lie group. 


4. The theory of Fourier series and Fourier transform is best understood group-theoretically. 
See [Gr] for a discussion. 


5. It is fairly routine in quantum mechanics courses, in conjunction with the Schrédinger 
equation for the hydrogen atom and angular momentum, to introduce, “raising and lowering 
operators” [Me]. The operators belong to the complexification of 0,, which is isomorphic to 
sl,(C). The commutation relations of the Lie algebra figure importantly in the computations. The 
harmonic oscillator is also susceptible to a Lie-theoretic treatment. The Canonical Commutation 
Relations themselves are the laws for a bracket relation on a Lie algebra, known as the Heisenberg 
Lie algebra [Ca], [Ho]. The relations of this algebra with quantum mechanics, and physics 
generally, is deep and extensive. 


6. Perhaps the part of standard undergraduate mathematics that is pedagogically most 
compatible with Lie theory is differential equations. We have already discussed in §2 how the 
notion of one-parameter group is a geometrization of the solution of a system of differential 
equations. And in §3 we noted that one-parameter groups of linear transformations were 
associated with the very important class of linear, constant coefficient systems. Indeed, the 
exponential map and linear algebra techniques are often explicitly used in treating these systems 
[Br]. 
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Many of the important classical differential equations are related with Lie theory. Indeed much 
of the theory of special functions may be considered a branch of Lie theory [Mi], [V]. Below I 
state, always by way of example, some exercises which I have given to students in differential 
equations courses and which were favorably received. 


A.(i) Let P,Q and the identity operator J span a Lie algebra, with commutation relations 
[P,Q] =I, and of course [ P, J] = [Q, I] = 0. (These are the Canonical Commutation Relations.) 
Define L = (P — I)QP, and A, = (P — I)"Q" (so L = A,P). Show that 


(a) [Q,(P-1)"]=-n(P-1)"", 

(6) Anyi = (4, + 2)A4,, 

(c) [L,A,] = L— Aj, and 

(d) L(A, +n)=(A,+n)L4+(L+n)-(A, +7). 


(ii) Suppose v, is an eigenvector for L, with eigenvalue —n, so that (L + n)vu, = 0. Show from 
(d) above that (A, + 1)v, is an eigenvector for L, with eigenvalue —(n + 1). Conclude from (b) 
that if vp is an eigenvector of L with eigenvalue 0, then A,,v, = v,, is an eigenvector with eigenvalue 
—n. 


(iii) Show that if P = d/dx and Q = multiplication by x, then P and Q satisfy the relations 
above. Show also that 
ox 4 o> = Pp—], 


dx 
Conclude that a solution to the Laguerre equation zy” + (1 — z)y’ + ny = Ols 


e( a.) etx") = (P-1)"0"()s 


here 1 is the constant function on R. 
B.(i) Take P, Q and J as in A(i). Suppose Pu, = 0, and set v, = Q"(u,). Show inductively that 
Pu,, = nv,,_,. Conclude that v, is an eigenvector of eigenvalue n for QP. 


(ii) Put P = d/dx, Q = (d/dx) + x. Verify that these satisfy the correct commutation rela- 
tions, and show that 


_ 2 d 2/2 
QO=e xP? 9X2 | 
dx 


Show that solutions of Hermite’s equation y” + xy’ — ny = 0 are given by 


— x? d " x? n 
y=e (+) er = On (1). 


In addition to Rodrigues-type formulas such as the above, one can deduce in a purely formal 
manner recursion relations and other properties of the Hermite, Laguerre, Legendre, Bessel, and 
many other classical families of functions. 


I would like to thank Kenneth Gross for painstaking efforts to improve the readability of this paper. 
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holds. Summation extends over all d,d\(n,r). u is the greatest common square-free unitary 
divisor of n and r. (A divisor d of c is called unitary if (d, c/d) = 1.) d() is the Euler totient 
[Mobius function]. 


Solution by Chun-Nip Lee, student, Massachusetts Institute of Technology. The stated formula 
contains a misprint. A correct formula is 


o(nr/d*) du(d) = o(n/u)o(r/u) (v7). 


(Editor’s note. The greatest common square-free unitary divisor (g.c.s.f.u.d.) of p and p*%, 
a > 2, is 1; this is not the same as the greatest square free unitary divisor of the greatest common 
divisor of two numbers.) 


Proof by induction on the number of distinct prime factors of (n, r). If (n, r) = 1, then each side 
of the equation equals ¢(”)¢(r) by multiplicativity of ¢. Now suppose that the formula is true if 
(n,r) contains at most k distinct prime factors, k > 0, and that (, r) = p"e, where e contains k 
distinct prime factors, n > 1, and ( p, e) = 1. Write n = p’N and r = p°R, with (p, N) = (p, R) 
= |. Let U denote the g.c.s.f.u.d. of N and R and yu the g.c.s.f.u.d. of n and r. 

Let & denote the sum on the left side of the formula. We can express = as a sum over the 
divisors of e plus a sum over the remaining divisors of pe, since n(d) = 0 if d is divisible by the 
square of a prime. We obtain by multiplicativity 


w= Lb o(p’*?NR/d’) du(d) 


d|p"e 


= (o(p’*") — po(p’*?-?)) 2 o(NR/d?) du(d). 


dle 


The induction hypothesis now gives 
L = (¢(p""?) - po(p"***))6( 7] 65) o0(U), 
If » = p = 1, then u = pU and 
b= (p* - 2p)6(~)4(=)o.(u?) 
= o(2}o(E}oute) 


since @, is multiplicative and $,(p”) = p? — 2p. 
If one of v, p exceeds 1, then u = U and 


d= (phth — 2pr ter + pe) oF) o( J on(0?) 
= o(")6(0")o()o([ Job0(w?) 
= o(B)o( Eo) 


This completes the induction. 


ANSWERS TO PHOTOS ON PAGE 624 


Top: A. G. Kurosh (algebra, and in particular, group theory), taken in 1958. Bottom: 
L. Schwartz (analysis, and in particular, distributions) taken in 1969. 
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of Carleton, Macalester, and St. Olaf Colleges. Books submitted for review 
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Special Codes: 


T: Textbook 13-18: Grade Level 

S: Supplementary Reading 1-4 : Time in Semesters 
P: Professional Reading wx : Special Emphasis 
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General, S(15-17), P*¥, L. Judgment Under Uncertainty: Heuristics and Biases. Ed: Daniel Kahneman, 
Paul Slovic, Amos Tversky. Cambridge U Pr, 1982, xiii + 555 pp, $44.50; $14.95 (P). [ISBN: 0-521- 
24064-6; 0-521-28414-7] Very interesting collection of articles, dealing primarily with psychologi- 
cal studies of how people estimate probabilities and the types of biases produced by these heuristic 
methods. A "blemished portrait of human capabilities...emerges from this work," that has serious 
implications for decision making. Good set of references. RSK 


General, S(i5-18), P, L*. Winning Ways for Your Mathematical Plays, V. 1: Games in General. Elwyn 
R. Berlekamp, John H. Conway, Richard K. Guy. Academic Pr, 1982, xxxii + 437 pp, $58.50 (P). [ISBN: 
0-12-091101-9] Serious mathematics in whimsical dress: strategic analyses (loaded with non-stop 
outrageous puns) of several dozen variations on games such as nim, hackenbush, and kayles. The 
focus of Volume 1 is on the Grundy theory of two-person positional games and their variations 
(sums, urgent unions, misére) as encoded in Conway’s "surreal" numbers (On Numbers and Games, 1976, 
TR, November 1976). Both terminology (especially puns) and notation are highly specialized (the 
index lists over 100 symbols that are virtually unique to this book), making casual reading quite 
difficult. LAS 


General, S*, P*, L**. Powers of Ten. Philip and Phylis Morrison. WH Freeman, 1982, 159 pp, 
$29.95. [ISBN: 0-7167-1409-4] A stunning re-creation on paper of the famous short film "Powers of 
Ten": each right-hand page contain a view of a picnic in Chicago at different distances, from 1025 
meters (1 billion light years) to 10716 meters (0.1 fermi), while the left pages offer commentary 
and other illustrations at each scale. This new still film is framed and enriched by an opening 
essay on matters of size, concluding commentary, and references. This is the first volume in the 
new Scientific American Library, a series of books for the general reader. LAS 


General, S*, L*. Problem-Solving Through Problems. Loren C. Larson. Prob. Books in Math. 
Springer-Verlag, 1983, xi + 332 pp, $34. [ISBN: 0-387-90803-X] An anthology of over 700 problems, 
mostly from the Monthly and Putnam exams, classified by method of solution and embedded in a _suc- 
cession of sections each featuring the power of a single method. Intended especially for students 
preparing for mathematical competitions, the book is also well suited to self-study by any undergra-~ 
duate mathematics major who wants to hone the tools of his trade. LAS 


Precalculus, T(13: 1), S, L. Precalculus Mathematics, A Functions Approach. Floyd E. Helton, Mar- 
garet L. Lial. Scott Foresman, 1983, xi + 514 pp, $19.95. [ISBN: 0-673-15507-2] The central theme 
is function--definition, graphs, standard precalculus examples: polynomial, log and exponential, 
trigonometric. Other topics: coordinate geometry, systems of equations, complex numbers, sequences 
and series. Nice feature: effective use of examples to motivate definitions and development. Some 
applications. PZ 


History, S, L**. James Clerk Maxwell, A Biography. Ivan Tolstoy. U of Chicago Pr, 1981, viii + 
184 pp, $6.95 (P). [ISBN: 0-226-80787-8] A short, highly readable account of Maxwell’s life-- 
conservative, unassuming, quietly religious--and professional work--daring, revolutionary, power- 
fully mathematical. Tolstoy, an applied mathematician, conveys clearly both the context and signi- 
ficance of Maxwell”s electromagnetic theory without resort to any equations or symbols: he stresses 
instead the role of analogy in Maxwell’s work, illustrated by extensive expository quotations from 
Maxwell”s important papers. LCL 


History, $(13-16), L*. A Calculating People: The Spread of Numeracy in Early America. Patricia 
Cline Cohen. U of Chicago Pr, 1982, x + 271 pp, $22.50. [ISBN: 0-226-11283-7] An intriguing 
account of the role of arithmetical and statistical reasoning in eighteenth and early nineteenth 
century America, from publication of Bills of Mortality and impassioned argument (in 1722) about the 
merits of smallpox innoculation to slavery debates a century later engendered by suspect data on 
insanity in the 1840 census. Contains many insights into the early American history of statistics 
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(originally meaning statements, not necessarily quantitative, about affairs of state), including the 
founding of the American Statistical Society. LAS 


Number Theory, P. Multiple Trigonometric Sums. G.I. Arhipov, A.A. Karacuba, V.N. Cubarikov. 
Proc. of Steklov Inst. of Math., Tom 151. AMS, 1982, viii + 126 pp, $42 (P). [ISBN: 0-8218-3067-8] 
Theorems about multiple trigonometric sums applied to problems concerning the number of solutions of 
complicated systems of Diophantine equations. LAS 


Algebra, T(18), P. Modules and Rings. F. Kasch, D.A.R. Wallace. London Math. Society Mono., V. 17. 
Academic Pr, 1982, xiii + 372 pp, $63. [ISBN: 0-12-400350-8] Starting with the fundamental concepts 
of the theory of rings and modules (early emphasis on the importance of projective and injective 
modules) the author proceeds to develop a number of important themes that heretofore have not 
appeared in textbook form (e.g., rings with perfect duality and quasi-Frobenius rings). Numerous 
exercises. LCL 


Finite Mathematics, T(13-14: 1, 2). Applied Finite Mathematics, Third Edition. Howard Anton, Ber- 
nard Kolman. Academic Pr, 1982, xi + 593 pp. [ISBN: 0-12-059566-4] Minor modifications, including 
a few new exercises, and chapter review summaries. Added appendix on algebra review. (First Edi- 
tion, TR, January 1975; Second Edition, TR, June-July 1978.) LCL 


Calculus, S(13). Solutions Manual to Accompany Stein’s Calculus and Analytic Geometry, Third Edi- 
tion. Anthony Barcellos. McGraw-Hill, 1982, $10 (P). [ISBN: 0-07-061155-6] Complete solutions to 
odd-numbered exercises. LCL 


Real Analysis, T*¥(15: 2, 3), S, L*. Intermediate Real Analysis. Emanuel Fischer. Undergraduate 
Texts in Math. Springer-Verlag, 1983, xiv + 770 pp, $28. [ISBN: 0-387-90721-1] Assuming (correctly) 
that students learn little from slick proofs, the author has produced an extraordinarily complete 
treatment of the first course in real analysis (144 pages until Cauchy sequences). Examples and 
exercises abound. It would take (at least) a year to cover the material, but the students would 
know the reasoning as well as the statements of the major ideas in analysis. TAV 


Real Analysis, T(17: 2). Real Analysis, Second Edition. Serge Lang. Addison-Wesley, 1983, xiv + 
533 pp, $23.95. [ISBN: 0-201-14179-5] In six parts--general topology, Banach spaces and the cal- 
culus, functional analysis, integration in measured spaces, integration on locally compact and 
Euclidean spaces, and global analysig--gives the scope and flavor of this text. Carefully written 
with extensive problem sets, this contains much more analysis than the title suggests. TAV 


Real Analysis, T(15-16: 2). Principles of Real Analysis, Revised Edition. $.C. Malik. Halsted Pr, 
1980, viii + 379 pp, $19.95. Typical topics--sets, sequences, functions, integrals, derivatives-- 
clear exposition, a limited number of exercises. The printing is quite poor on grey paper. TAV 


Complex Analysis, T(15-16: 1), S, L. Complex Variables with Applications. A. David Wunsch. 
Addison-Wesley, 1983, viii + 439 pp, $26.95. [ISBN: 0-201-08885-1] A mathematically relatively ele- 
mentary undergraduate text. Real calculus ideas are reviewed, some proofs simplified, e.g., 
Cauchy-Goursat via Green’s theorem. Notable for the number and depth of physical applications, 
especially to electricity. List of topics quite standard; order, emphasis, notation less so, con- 
sistent with emphasis on applications. E.g., strong sections on residue integrals, integral 
transforms. Interesting problems, many applied. PZ 


Complex Analysis, T*(18: 1, 2), S, P*. Function Theory on Planar Domains: A Second Course in Com- 
plex Analysis. Stephen D. Fisher. Wiley, 1983, xiii + 269 pp, $34.95. [ISBN: 0-471-87314-4] The 
Dirichlet problem, Hardy spaces, inner functions, the corona theorem, and other function-theoretic 
topics are developed for multiply-connected planar domains, usually with real-analytic boundaries, 
While there are many references to additional readings, the text is self-contained (at the second- 
year graduate level). With exercises following each chapter. An excellent text or self-study 
manual. PZ 


Complex Analysis, P. Lecture Notes in Mathematics-978: Quasiconformal Mappings in the Plane: 
Parametrical Methods. Julian fawrynoeicz, Jan Krzyz. Springer-Verlag, 1983, vi + 177 pp, $10.50 
(P). [ISBN: 0-387-11989-2] From the foreward: "An exposition of analytic properties of quasiconfor- 
mal mappings in the plane..., a detailed and systematic study of the parametrical method with com- 
plete proofs..., and a brief account of variational methods." PZ 


Differential Equations, T(17-18: 1, 2), S, P. Shock Waves and Reaction-Diffusion Equations. Joel 
Smoller. Springer-Verlag, 1983, xxi + 581 pp, $39. [ISBN: 0-387-90752-1] A comprehensive introduc- 
tion to partial differential equations, emphasizing, but not limited to, the nonlinear topics of the 
title. Part I, on linear partial differential equations, could serve as an introductory graduate 
text (but note: no exercise sets). Focus on topological aspects: Part IV treats Morse theory and 
the Conley index. Most parts should be readable by most mathematicians. PZ 


Functional Analysis, $(18), P. Functional Analysis, Holomorphy, and Approximation Theory. Ed: Guido 
I. Zapata. Lect. Notes in Pure & Appl. Math., V. 83. Dekker, 1983, viii + 458 pp, $55 (P). [ISBN: 
0-8247-1634-5] Proceedings of the August 1979 International Seminar on Functional Analysis, Holomor- 
phy, and Approximation Theory at Rio de Janeiro. 19 papers, mostly on functional analysis topics, 
at the research and advanced expository level. For the nonspecialist: an interesting essay by 
Dieudonné on the history of functional analysis. PZ 
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Functional Analysis, T*(17: 1, 2), S, L. First Course in Functional Analysis, Second Edition. 
Casper Goffman, George Pedrick. Chelsea Pub, 1983, xi + 284 pp, $15.95. [ISBN: 0-8284-0319-8] A 
second edition (with revised bibliography) of the 1965 first-year graduate text. Unusually well- 
integrated, concrete point of view, convincingly presented: "Analysis is basic...abstract 
theories...are primarily of interest as tools...in treating problems in analysis." Many analysis 
topics motivate and illustrate--Lebesgue theory, reproducing kernels, Haar measure, Fourier series. 
PZ 


Analysis, $(17-18), P. Lecture Notes in Mathematics-977: On: Global Univalence Theorems. T. 
Parthasarathy. Springer-Verlag, 1983, viii + 106 pp, $8.50 (P). [ISBN: 0-387-11988-4] The problem 
studied is to give conditions for differentiable R®-yalued functions on subsets of R" to be globally 
univalent. The first two brief chapters collect rudiments of the theory from real analysis and 
matrix theory; the remainder of the monograph consists of history, old and new results, applica- 


tions, and open problems. PZ 


Algebraic Geometry, P. The Birational Geometry of Degenerations. Ed: Robert Friedman, David R. 
Morrison. Progress in Math., V. 29. Birkhauser Boston, 1983, ix + 386 pp, $27.50. [ISBN: 3-7643- 
3111-9] Nine papers from the Summer Algebraic Geometry Seminar, 1981, at Harvard, studying the 
geometry of birational equivalence classes of certain completions of families of compact complex 
manifolds over the punctured disk. The first paper reviews earlier work in the field and _ surveys 
the results in the remainder of the volume. PZ 


Algebraic Geometry, P. The Curves Seminar at Queen’s, Volume II. Ed: Anthony V. Geramita. Papers 
in Pure & Appl. Math., No. 61. Queen’s U, 1982, 216 pp, (P). Four research articles (intersection 
theory, Hilbert functions) and two expository articles (graded rings, ordinary singularities of 
curves). LCL 


Differential Geometry, P. Nonlinear Analysis on Manifolds. Monge-Ampére Equations. Thierry Aubin. 
Grund. der math. Wissenschaften, B. 252. Springer-Verlag, 1982, xii + 204 pp, $29.50. [ISBN: 0- 
387~90704-1] Introduces and treats nonlinear problems on Riemannian manifolds, emphasizing interplay 
betweeen analysis and geometry. First four chapters are foundational material on analysis of 
Riemannian manifolds; last four are applications, e.g., to real and complex Monge-Ampére equations. 
"A reference and...an introduction to research." PZ 


Geometry?? Solutions to the Three Historical Problems by Compass and Straightedge. Delvin J. John- 
son. Vantage Pr, 1982, v + 30 pp, $6.95. [ISBN: 533-05050-2] A worthless collection of strangely 
worded "theorems" (e.g., "Given the opportunity, an arc for an acute angle will give the chord for 


any given angle arc reduced to 1/3.") with three to seven line “proofs." JNC 


Geometry, T(14~16: 1), S, L. Foundations of Euclidean and Non-Euclidean Geometry. Richard L. 
Faber. Pure & Appl. Math., V. 73. Dekker, 1983, xi + 329 pp, $49.75. [ISBN: 0-8247-1748-1] The 
foundations of Euclidean and non-Euclidean geometry are treated with a strong historical flavor and 
great pedagogical concern. Half the volume is concerned with a relatively novel detailed develop- 
ment of hyperbolic geometry, its models and its relevance to physics. Many good exercises. SS 


Geometry, T(14-16: 1, 2), S, L. Invitation to Geometry. Z.A. Melzak. Wiley, 1983, ix + 225 pp, 
$29.95. [ISBN: 0-471-09209-6] An unusual and interesting potpourri of geometric topics appropriate 
for college-level students. Focus is on problems--pure and applied--from Euclidean geometry of two 
and three dimensions. Not only are most of the problems fascinating, but so are the methods of 
solution, many of which are developed as general problem-solving techniques for the reader. Rich 
sets of exercises. 55 


Topology, $(15-18), P. Shape Theory: The Inverse System Approach. S. Mardesié, J. Segal. Math. 
Lib., V. 26. Elsevier North-Holland, 1982, xv + 378 pp, $81.50. [ISBN: 0-444-86286-2] Chapter one 
presents the shape category of topological spaces with special emphasis on inverse limits and reso- 
lutions of spaces. The second chapter treats the algebraic topology of shape theory while chapter 
three contains a number of surveys of selected areas of shape theory. Many sections of biblio- 
graphic notes interspersed throughout the text. There is an appendix on polyhedra and one on 
Borsuk”s approach to shape. Extensive bibliography. List of special symbols. Subject index. RJA 


Operations Research, T(17-18: 1), P. Mathematical Aspects of Scheduling and Applications. R. Bell- 
man, A.O. Esogbue, I. Nabeshima. Modern Appl. Math. & Computer Sci., V. 4. Pergamon Pr, 1982, xiv 
+ 329 pp, $35. [ISBN: 0-08-026477-8] A unified treatment of scheduling problems and their solutions. 
State~of-the-art solution techniques are presented. Some familiarity with dynamic programming and 
integer ‘programming is assumed. AO 


Probability, T(17: 2), P. Approximating Countable Markov Chains. David Freedman. Springer-Verlag, 
1983, «x + 140 pp, $20. [ISBN: 0-387-90804-8] The third in a "trilogy"--follows the author’s Markov 
Chains and Brownian Motion and Diffusion (TR, June-July 1983). Constructive in approach, the text 
presumes familiarity with separability, and emphasizes the general (countable) process. A reprint 
of the 1972 Holden Day edition (TR, January 1973). TAV 


Probability, T(18: 2). Stochastic Calculus and Applications. Robert J. Elliott. Applic. of 
Math., No. 18, Springer-Verlag, 1982, ix + 302 pp, $42. [ISBN: 0-387-90763-7] Presumes probability, 
neasure, and a course in stochastic processes. The author attempts to follow the modern French 
sheory of random processes to a point where applications to system theory and control may be 


C104 Center Section November 


understood. The development makes use of Martingale methods in stochastic control. Contains an 
extensive bibliography. TAV 


Probability, P. Regular and Stochastic Motion. A.J. Lichtenberg, M.A. Lieberman. Appl. Math. 
Sci., No. 38. Springer-Verlag, 1983, xxi + 499 pp, $36. [ISBN: 0-387-90707-6] Treats stochastic 
motion in nonlinear oscillator systems. The main emphasis is on the intrinsic stochasticity of sys~ 
tems when the stochastic motion is generated by the dynamics itself, along with an introduction to 
chaotic motion in dissipative systems. Contains a comprehensive bibliography. TAV 


Probability, T(16: 1), 8, L. Studies in the Theory of Random Processes. A.V. Skorokhod. Dover 
Pub, 1982, viii + 199 pp, $4.50 (P). [ISBN: 0-486~64240-2] In three parts: random processes and sto- 
chastic integrals; stochastic differential equations; limit theorems for Markov processes. Differs 
from the usual treatment in its probabilistic rather than analytic approach. A nice book, an excel- 
lent buy. TAV 


Probability, T(17). Markov Chains. David Freedman. Springer-Verlag, 1983, xiv + 382 pp, $28. 
[ISBN: 0-387-90808-0] The first in a "trilogy" on Markov processes, this text goes well beyond the 
usual treatment. Presumes familiarity with first principles of the theory. Contains numerous exam~ 
ples, no exercises. Both the discrete and continuous chains are investigated. A reprint of the 
1971 Holden-Day edition (TR, November 1971). TAV 


Probability, P. Probabilistic Metric Spaces. B. Schweizer, A. Sklar. Ser. in Prob. & Appl. Math. 
Elsevier Sci Pub, 1983, xvi + 275 pp, $39. [ISBN: 0-444-00666-4] By replacing the normal metric on a 
space with a probability that two points are "close," one can define the PMS of the title. This 
book traces the development of the theory, various approaches to a triangle inequality and the 
resulting topological results. Contains a useful and extensive bibliography. TAV 


Probability, T(17: 1), P. Statistical Estimation for Stochastic Processes. K. Nanthi. Papers in 
Pure & Appl. Math., No. 62. Queen’s U, 1983, vii + 269 pp, (P). Most texts in stochastic processes 
avoid the problem of determining the values of the parameters used in the models. This survey fills 
that gap nicely and could be used as a follow-up to a one semester stochastic processes course. TAV 


Probability, P. Application of the Theory of Boundary Value Problems in the Analysis of a Queueing 
Model with Paired Services. J.P.C. Blanc. Math. Centre Tracts, No. 153. Math Centrum, 1982, 244 
pp, D£l. 33 (P). [ISBN: 90-6196-247-1] 

Probability, P. Queues and Point Processes. Peter Franken, et al. Wiley, 1982, 208 pp, $29.95. 
[ISBN: 0-471-10074-9] Using the recent (1976) concept of embedded marked point processes, the 
authors attempt to give a unified treatment to several basic problems in queueing systems theory. A 
variety of generalized arrival processes is considered as well as state-dependent service times. 
TAV 


Statistics, P. Lecture Notes in Statistics-l17: Asymptotic Optimal Inference for Non-ergodic Models. 
Ishwar V. Basawa, David John Scott. Springer-Verlag, 1983, xiii + 170 pp, $15 (©). [ISBN: 0-387- 
90810-2] 


Statistics, T(13-14; 1, 2). A First Course in Business Statistics, Second Edition. James T. 
McClave, P. George Benson. Dellen Pub, 1983, xvii + 546 pp, $24.95. [ISBN: 0-89517~-043-4] Presup- 
poses no college mathematics. The usual topics plus treatments of various probability distributions 
and a chapter on decision analysis using prior information. Many examples and exercises deal with 
real data. FLW 


Statistics, T*(13: 1). Basic Statistics: An Inferential Approach. Frank H. Dietrich, II, Thomas J. 
Kearns, Dellen Pub, 1983, xiv + 731 pp, $26.95. [ISBN: 0-89517~-044~2] Clearly written elementary 
text, but with more emphasis on inference-making than most. Topics covered are mostly standard, 
including regression and correlation, analysis of variance (completely randomized design only), 
chi-square tests, and non~parametric statistics. Extensive problem sets and sample quizzes (both 
chapter and cumulative). RSK 


Computer Programming, T(13: 1). BASIC Programming, Second Edition. Van Court Hare, Jr. HarBrace 
J, 1982, xv + 407 pp, $16.95 (PB). [ISBN: 0~15~505002~8] The text is divided into three parts: intro- 
ductory material, including sample start-up procedures; minimal BASIC statements; commonly available 
extensions of BASIC. Changes from the First Edition (TR, April 1972) include: omission of most of 
the historical discussion and translations from BASIC to Fortran; replacement of more difficult 
exercises and examples; new material especially designed for users of personal computers. JRG 


Computer Science, $(€15~18), P. Lecture Notes in Computer Science-141: GAG: A Practical Compiler 
Generator. Uwe Kastens, Brigitte Hutt, Erich Zimmermann. Springer-Verlag, 1982, iv + 156 pp, $10 
(P). (ISBN: 0-387-11591-9] A GAG is a compiler generator based on attributed grammars. Contains 
chapters on GAG systems; ALADIN-~a language for attributed definitions; an AG for a Pascal~Analyzer; 
and generating efficient compiler front-ends. Two appendices: one on AG for Pascal and on results 
of the usage of GAG. References. RJA 


Computer Science, §(15-18), P. Lecture Notes in Computer Science-139: An Attribute Grammar for the 
Semantic Analysis of Ada. J. Uhl, et al. Springer-Verlag, 1982, ix + 511 pp, $22 (P). [ISBN: 0~ 
387-11571-4] An attribute grammar (AG) is a tool used to specify formally the semantics of a 
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programming language and also a compiler for the language. Divided into three parts: (1) the 
development of the AG; (2) a rationale for the AG; (3) the AG itself. References. RJA 


Computer Science, P. Lecture Notes in Computer Science-145: Theoretical Computer Science. Ed: A.B. 
Cremers, H.P. Kriegel. Springer-Verlag, 1982, x + 365 pp, $16.50 (P). [ISBN: 0-387-11973-6] Con- 
tains the contents of the 3 invited lectures plus the 28 contributed papers presented at the 6th 
Biannual GI Symposium on Theoretical Computer Science, held in Dortmund, Germany, January 5-7, 1983 
and sponsored by the Special Interest Group for Automata and Formal Language Theory of the 
Gesellschaft fllr Informatik. RJA 


Computer Science, P, L. Deterministic Top-Down and Bottom-Up Parsing: Historical Notes and 
Bibliographies. Anton Nijholt. Math Centrum, 1983, 118 pp, Dfl. 16,50 (P). [ISBN: 90-6196-245-5] 
Three bibliographies, each introduced with a brief historical and definitional survey: 406 items on 
top down parsing of LL grammars (Left to right using Left-most derivations); 561 items on LR parsing 
(Left to right using Right-most derivations); and 251 items on precedence parsing. LAS 


Applications (Artificial Intelligence), S(16-18), P, L. The Handbook of Artificial Intelligence, 
Volume ILI. Ed: Paul R. Cohen, Edward A. Feigenbaum. William Kaufmann, 1982, xviii + 639 pp, $45. 
{ISBN: 0-86576-007~-1] This volume includes chapters on models of cognition, automatic deduction, 
vision, learning and inductive inference, planning and problem solving. Each chapter begins with a 
subject overview that includes historical and referential information. List of contributors. 
Bibliography for Volume ITI is cumulative for three volumes. RJA 


Applications (Engineering), T*(15: 1). Reliability Evaluation of Engineering Systems: Concepts and 
Techniques. Roy Billinton, Ronald N. Allan. Plenum Pr, 1983, x + 349 pp, $42.50. [ISBN: 0-306- 
41296~9] Presuming no prior experience with probability, the author develops the binomial and Pois- 
son distributions as they apply to reliability, then considers graph theoretic and Markov chain 
applications. Easily read. An excellent source of examples of an important application of proba- 
bility. TAV 


Applications (Physics), P, L*. Academician Andrei Dmitrievich Sakharov: Collected Scientific Works. 
Ed: D. ter Haar, D.V. and G.V. Chudnovsky. Dekker, 1982, xvi + 303 pp, $27.50. [ISBN: 0-8247-1714- 
7] 24 of Sakharov’s most important papers from 1947 through 1980, arranged in three groups (plasma 
physics, cosmology, and field theory), each group introduced by commentary by Sakharov and concluded 
by commentaries by other physicists. Also includes the text of Sakharov’s 1975 Nobel Peace Prize 
lecture, and lists of Sakharoy’s “divertissements"--problems in mathematics and physics “undertaken 
as a domestic pasttime,” including Ramsey theory, approximate trisections, number theory algorithms. 


LAS 


Applications (Physics), T(15-16: 1), P, L. A Symmetry Primer for Scientists. Joe Rosen. Wiley, 
1983, xiv + 192 pp, $26.95. [ISBN: 0-471-87672-0] First coherent textbook treatment of symmetry and 
its application to science, Introduction to group theory precedes statement and rigorous derivation 
of six symmetry principles: the equivalence and symmetry principles for processes in isolated physi- 
cal systems; the special and general symmetry evaluation principles; the special and general sym- 
metry evolution principles. Some examples and problems assume understanding of quantum theory. JRG 


Reviewers 


RJA: Richard J. Allen, St. Olaf; PB: Peder Bolstad, St. Olaf; RB: Richard Brown, Carleton; JNC: 
Judith WN. Cederberg, St. Olaf; CEC: Clifton E. Corzatt, St. Olaf; JD-B: John Dyer-~Bennet, Carleton; 
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programming language and also a compiler for the language. Divided into three parts: (1) the 
development of the AG; (2) a rationale for the AG; (3) the AG itself. References. RJA 


Computer Science, P. Lecture Notes in Computer Science-145: Theoretical Computer Science. Ed: A.B. 
Cremers, H.P. Kriegel. Springer-Verlag, 1982, x + 365 pp, $16.50 (P). [ISBN: 0-387-11973-6] Con- 
tains the contents of the 3 invited lectures plus the 28 contributed papers presented at the 6th 
Biannual GI Symposium on Theoretical Computer Science, held in Dortmund, Germany, January 5-7, 1983 
and sponsored by the Special Interest Group for Automata and Formal Language Theory of the 
Gesellschaft flr Informatik. RJA 


Computer Science, P, L. Deterministic Top-Down and Bottom-Up Parsing: Historical Notes and 
Bibliographies. Anton Nijholt. Math Centrum, 1983, 118 pp, Dfl. 16,50 (P). [ISBN: 90-6196-245-5] 
Three bibliographies, each introduced with a brief historical and definitional survey: 406 items on 
top down parsing of LL grammars (Left to right using Left~most derivations); 561 items on LR parsing 
(Left to right using Right-most derivations); and 251 items on precedence parsing. LAS 


Applications (Artificial Intelligence), $(16-18), P, L. The Handbook of Artificial Intelligence, 
Volume III. Ed: Paul R. Cohen, Edward A. Feigenbaum. William Kaufmann, 1982, xviii + 639 pp, $45. 
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subject overview that includes historical and referential information. List of contributors. 
Bibliography for Volume ITI is cumulative for three volumes. RJA 


Applications (Engineering), T*(15: 1). Reliability Evaluation of Engineering Systems: Concepts and 
Techniques. Roy Billinton, Ronald N. Allan. Plenum Pr, 1983, x + 349 pp, $42.50. [ISBN: 0-306- 
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son distributions as they apply to reliability, then considers graph theoretic and Markov chain 
applications. Easily read. An excellent source of examples of an important application of proba- 
bility. TAV 


Applications (Physics), P, L*. Academician Andrei Dmitrievich Sakharov: Collected Scientific Works. 
Ed: D. ter Haar, D.V. and G.V. Chudnovsky. Dekker, 1982, xvi + 303 pp, $27.50. [ISBN: 0-8247-1714~- 
7] 24 of Sakharov’s most important papers from 1947 through 1980, arranged in three groups (plasma 
physics, cosmology, and field theory), each group introduced by commentary by Sakharov and concluded 
by commentaries by other physicists. Also includes the text of Sakharov’s 1975 Nobel Peace Prize 
lecture, and lists of Sakharov’s “divertissements"--problems in mathematics and physics "undertaken 
as a domestic pasttime,” including Ramsey theory, approximate trisections, number theory algorithms. 
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of six symmetry principles: the equivalence and symmetry principles for processes in isolated physi- 
cal systems; the special and general symmetry evaluation principles; the special and general sym- 
metry evolution principles. Some examples and problems assume understanding of quantum theory. JRG 
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Invited Addresses: 


"Grammatical Representations in Natural Language Processing: Some Mathematical Results," by Ara- 
vind K. Joshi, University of Pennsylvania. 

"Artificial Intelligence: Basic Principles and Philosophical Issues," by Jaime G. Carbonell, 
Carnegie-Mellon University. 

"Logic and Its Uses in Artificial Intelligence and Databases," by Jack Minker, University of Mary- 
land. 

"Expert Systems,” by Casimir A. Kulikowski, Rutgers University. 


Wisconsin Section 


The fifty-first annual meeting of the Wisconsin Section met April 15-16, 1983 at the University 
of Wisconsin Center, Washington County, West Bend, Wisconsin. 


Invited Addresses: 
"What To Do Till the Computer Scientist Goes Away," by Paul Halmos, Indiana University. 
"Topological, Combinatorial and Geometric Fractals," by James Cannon, University of Wisconsin at 
Madison. 


"Products of Involutions: Is 2 x 2 Always 4?" by Paul Halmos, Indiana University. 


Short Presentations: 


"An Anthology of Non-Measurable Sets," by Jonathan Lewin, University of Wisconsin at Oshkosh. 

"Computer Languages for Distributive Processing," by David Hayes (Student, Ripon College). 

"A Case Study in the Development of Microcomputer Software for Mathematics," by Ed Gade, Univer- 
sity of Wisconson at Oshkosh. 

"Independence of Axioms," by John Oman, University of Wisconsin at Oshkosh. 

"Solving Max-Min Problems Using Inequalities," by Robert Horton, University of Wisconsin at White- 
water. 

"Inverted Plane Curves," by Orville Bierman, University of Wisconsin at Eau Claire. 

"An Introduction to Secret Codes," by Linda Deneen, Beloit College. 

"Probabilistic Arguments in Queueing Models and Their Regulation," by Nasser Hadidi, University of 
Wisconsin at Stout. ° 

"On Inequalities," by Steve Krevisky, University of Wisconsin Center, Washington County. 

"Constructing Projective Modules," by Andrew Machett, University of Wisconsin at LaCrosse. 

"Analytic Function Theory for Operators," by James Walker, University of Wisconsin at Eau Claire. 

"An Empirical Laboratory Course to Accompany Calculus I," by Dan Kalman, University of Wisconsin 
at Green Bay. 


Panel Discussion: 


"Mathematicians Teaching Computer Science," by Lois Brualdi, Edgewood College; Tom Napps, Lawrence 
University; Steve Senger, University of Wisconsin at LaCrosse; Bhagat Singh, University of 
Wisconsin Center at Manitowoc; Larry Thorsen, St. Norbert College. 


Northeastern Section 


The Spring meeting of the Northeastern Section was held at Bowdoin College in Brunswick, Maine 
on June 17-18, 1983. Approximately 90 participants attended the meeting. 


Invited Addresses: 


"Philosophers Who Think About Themselves: In Defense of Recreational Mathematics," by Raymond M. 
Smullyan, Lehman College, CUNY, Indiana University. 
"New Directions-~New Challenges in College Mathematics," by Donald L. Kreider, Dartmouth College. 


Short Presentations: 


* "Designing an Introductory Course in Applied Mathematics," by Igor Najfeld, Tufts University. 

* "Mathematical Modeling: The Missing Component," by William E. Boyce, Rensselaer Polytechnic 
Institute, 

* "How Are We Doing With the Next Generation?," by Florence D. Jacobson, Director of the Women and 
Mathematics Program for Connecticut, and Professor Emerita, Albertus Magnus College. 


Computer Use Workshops: 


"BASIC for Beginners," by Patricia Strauss, Rhode Island College. 

"PASCAL for the Non-Believer," by James W. Uebelacker, University of New Haven. 

"MAT LAB," by I. Gary Rosen, Bowdoin College. 

"muMATH," by David Ryan, Ansonia High School and University of Hartford. 

"SCRIPSIT," by Raymond J. McGivney, University of Hartford, and Edwin A. Rosenberg, Western Con- 
necticut State University. 
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Contributed Papers: 


"Computer~Generated Stereographic Images for Calculus of Several Variables," by Kenneth G. Hamil- 
ton, Colby College. 

"Fear of Calculus," by Stephanie F. Troyer, University of Hartford. 

"Calculus and Computer Modeling: Not Calculus as Usual," by Kenneth Hoffman, Hampshire College. 


Metropolitan New York Section 


The forty-second annual meeting of the Metropolitan New York section was held at St. John’s 
University on May 7, 1983. Approximately 150 persons were in attendance. 


Invited Presentations: 


"Iteration of Complex Quadratic Polynomials," by Dennis Sullivan, Einstein Professor of Mathemat~ 
ics, Queens College. 

"Alternatives for the Freshman Mathematics Curriculum: Discrete Mathematics vs. the Calculus," by 
Ronald Douglas, SUNY at Stony Brook, and Anthony Ralston, SUNY at Buffalo. 


Panel Discussion: 


"Meeting the Needs of Mathematical Talented High School Students," by Robert Cowen (Moderator), 
Queens College. Panelists: Carl Goodman, Cardozo High School; Irwin Kaufman, New York City 
Board of Education; David Kelly, Hampshire College; Edwin Moise, Queens College; and Mark Saul, 
Bronx High School of Science. 


Short Presentations: 


"Microcomputers and Student Questionnaires in a Beginning Course in Statistics," by Geoffrey Akst, 
Manhattan Community College. 

"The Arkin-Hoggatt Game," by J. Arkin, Spring Valley, New York, and V.E. Hoggatt, San Jose, Cali- 
fornia. 

"A Comparison of the Distribution of Women’s and Men’s Calculus Grades," by Sherry Blackman, The 
College of Staten Island. 

"Convergence and Divergence of Infinite Series--A Probability Approach Through the St. Petersburg 
Paradox," by Allan J. Ceasar, U.S. Merchant Marine Academy. 

"Hamilton®’s Number Couples," by Alan Chutsky, Queensborough Community College. 

"Statistics through the Eye of the Computer," by Florence S. Gordon, Adelphi University. 

"Solved and Unsolved Problems in Axiomatic Geometry," by Martha Harrell, St. Jdhn’s University. 

"Real Variable Inversion of Laplace Transforms," by Harvey J. Hinden, Hi-Tech Editorial, Inc. 

"Quantitative Modelling Techniques for Crime Prediction," by Alan Hoenig, John Jay College of 
Criminal Justice. 

"On the Distribution of Planar and Parabolic Points on Developable Surfaces," by Martin Lewinter, 
SUNY at Purchase. 

"A Matrix Equation for the Group Inverse," by Maurice Machover, St. John’s University. 

"The Right Student in the Right Mathematics Course: Placement with the CUNY Mathematics Test or An 
Instructor~Developed Test," by Seymour W. Pustilnik, New York City Technical College. 

"A Classical Tensor Theory of Gravitation which Arises from the Principle of Equivalence," by 
David Shelupsky, The City College. 

"Math Enrichment for Elementary School Girls, A Pilot Study," by Helga Schwartz, Queensborough 
Community College. 

"Using Microcomputers to Improve Statistics Teaching," by Lawrence Sher, Borough of Manhattan Com- 
munity College. 

"A Poisson Process with a Markovian Intensity Function," by Frederick Solomon, SUNY at Purchase. 

"A Doubly Stochastic Poisson Process with Intensity Function Generated by Birth and Death Pro- 
cess," by Frederick Solomon, SUNY at Purchase. 


Awards: 


Noam D. Elkies of Columbia University received the Section award for the highest regional score 
in the William Putnam Mathematical Competition. 


Douglas Jongreis of G.W. Hewlett High School, Michael Reid of Brooklyn Technical High School, 


and David Zuckerman of Stuyvesant High School received the Charles Salkind Awards. These three stu~ 
dents tied for the highest regional score in the MAA High School Mathematics Contest. 


Texas Section 


The annual meeting of the Texas section was held at North Texas State University in Denton, 
Texas on April 8-9, 1983. Registered meeting attendance was 266. 
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Invited Lectures: 


"Paradoxical Coverings of the Real Line," by Ivan Niven, University of Oregon and President of 
MAA, 

"Duality in Mathematics (The Role of Compact Hausdorff Spaces in Mathematics)," by S. Kakutani, 
Yale University. 

"Perceptions Regarding Discrete Mathematics and Calculus for the Freshman Year," by R.D. Anderson, 
Louisiana State University. 

"The State of School Mathematics in Texas," by Barbara Montalto, Texas Education Agency. 


Contributed Papers: 


"Coloring Graphs and Combinatorial Geometries," by Joseph Kung, North Texas State University. 

"A Note on Dy(x™) for Rational r," by Peter Lindstrom, North Lake College. 

"Some Theory and Application of Geometric Inequalities," by J.M. Stark, Lamar University. 

"Some Mappings of Sequence Spaces," by David F, Dawson, North Texas State University. 

"Conmutativity and Norm Conditions in Banach Algebras," by Victor A. Belfi, Texas Christian 
University. 

"Approximating a Function by Means of an M-polynomial,” by Frank N. Huggins, University of Texas at 
Arlington. 

"S-essentially T, Spaces," by Charles Dorsett, Louisiana Tech University. 

"Local Weights in Metric Spaces," by Jim Bradford, Abilene Christian University. 

"Approximating the Length of Material on a Cylindrical Roll," by Frank H. Mathis and Danny W. 
Turner, Baylor University. 

"Symmetric Positive Systems with Nonlinear Boundary Conditions," by Alfonso Castro, Southwest 
Texas State University. 

"Expanding Mappings and Fixed Points," by A.A. Gillespie and B.B. Williams, University of Texas at 
Arlington. 

"Differential Equations with Piecewise Constant Delays," by Joseph Wiener, Pan American Univer- 
sity. 

"An Application of Discrete Cross-correlation," by Philip R. Sanfilippo, Farmers Branch, Texas. 

"Cloud and Precipitation Analysis," by C. Bandy and R. Torrejon, Southwest Texas State University. 

"TeX at Texas A&M University," by Norman W. Naugle, Texas A&M University. 

"Scientific Output Problems in Microcomputers," by Roy Dean Alston, Stephen F. Austin State 
University. . 

"Obtaining Sommerfeld’s Diffraction Integral via Fourier Transform," by E. Dennis Huthnance, 
Midwestern State University, 

"Some Recent Experiences with Logo," by Norman W. Naugle, Texas A&M University. 

"Computer Literacy: A Call for the Problem-solving Approach," by Michael Murphy and Nancy Rich, 
University of Houston~Downtown College. 

"A Proof-teaching Computer System," by Michael K. Jones, Fort Worth, Texas. 

"Teaching Mathematics in Africa," by Neal Hart, Sam Houston State University. 

"Secondary Mathematics Dilemma: An Elementary Solution," by Bill Aslan, East Texas State Univer- 
sity. 

"Another Approach in Teaching Arithmetic and Algebraic Fractions," by Bella Wiener, Pan American 
University. 

"The Fascinating Fur Farmer," by G. Marvin Eargle, Appalachian State University, Boone, North 
Carolina. 

"Injecting Vigor as a Prelude to Rigor or How to Teach College Algebra Like a Real Math Course," 
by G. Edgar Parker, Pan American University. 

"Preradicals Induced by Homomorphisms," by Ed Oxford, Baylor University. 

"The Splitting of the Kunneth Sequence," by Andre Deutz, Texas A&M University at Galveston. 

"Solving Radical Equations," by John Huber, Pan American University. 

"Axioms for C*-Algebras: A Survey from 1943-1983," by Robert S. Doran, Texas Christian University. 

"The Dunford-Pettis Property," by Tommy Leavelle, North Texas State University. 

"The Existence of Nonzero Maps from Injectives to Projectives," by Jeffrey W. Neslen, Texas Chris- 
tian University. 

"On G-finite Algebras," by Joseph M. Szucs, Texas A&M University at Galveston. 


Student Papers: 


"A Linear Analysis of Liniger and Ruegsegger’s Mathematical Model of Fibrinolysis," by Joseph F. 
Gerda, Jr., University of Texas at Arlington. 

"Approximation of e* Using Maclaurin Series and Range Reduction,” by Robert F. Jones, Sam Houston 
State University. 

"Oscillation Properties of Solutions of First Order Linear Difference Equations," by Errol Mon- 
criffe, Texas Southern University. 

"Schensted’s Algorithm and Young Tableaux," by David C. Sutherland, North Texas State University. 

"On the Domain of Existence of First Order Partial Differential Equations," by M.E. Brewster and 
R. Kannan, University of Texas at Arlington. 

"Group Representation Theory with Applications," by Brice McIntyre, East Texas State University. 

"Unconditional and Absolute Convergence," by Catherine Abbott, North Texas State University. 


A "premeeting™ Developmental Mathematics Workshop was sponsored by Addison-Wesley Publishing 
Company. The speaker for this session was F. Demana of Ohio State University who spoke on "Articu- 
lation Between High Schools and Colleges." 


WHEN IS R? A DIVISION ALGEBRA? 


STEVEN C. ALTHOEN anD LAWRENCE D. KUGLER 
Department of Mathematics, The University of Michigan-Flint, Flint, MI 48503 


Under usual complex multiplication, the vector space R* is a division algebra, which is to say 

for all uw, v, and w 
Gi) u-(o+w)=u-vot+u-wand(ut+v):-w=u-wto-w; 

(ii) a(u- v0) = (au) -v = u- (av) for every real number a (making R? an algebra); and 

(iii) for every v and u # 0 in R”, the equations u- x = v and y - u = v have unique solutions 
in R*; (making R* a division algebra). 

Condition (iii) is equivalent to 
(iii’)* u- vo = O implies u = Oorv = 0. 


The equivalence of conditions (iii) and (i’) follows from the invertibility of left and right 
translation (both linear transformations) by nonzero elements of the algebra. 

Are there other ways of defining multiplication to make R? a division algebra? There are, of 
course, division algebras of higher dimension, for example, the quaternions and Cayley numbers. 
It is well known that multiplication in these systems is completely determined by a multiplication 
table for a basis and that both are noncommutative, while the Cayley numbers are nonassociative. 

These facts suggest two things about an investigation of different multiplications for R7. First, 
it suffices to study basis multiplication since the multiplicative structure of an algebra is 
completely determined by a multiplication table for any basis. The proof of this statement 
involves a straightforward application of properties (1) and (ii) above. Second, the investigation 
should include algebras that are noncommutative, nonassociative, or even lack a unit. Indeed, the 
following theorem shows that without easing conditions on the multiplication the search ends 
abruptly. 


THEOREM |. If a 2-dimensional real division algebra either has a unit or is associative, it is C, the 
complex numbers. 


Proof. Suppose the algebra has a unit, e. Extend {e} to a basis (e, u} for R*. Let v = u?. The 
multiplication table 


e u 
4 e u 
u' Uu D 


determines the algebra. It is obvious that multiplication is commutative on the basis and it is not 
hard to check that it is associative there as well. It is also easy to verify that if the basis elements 
of a finite dimensional algebra commute (associate), then the algebra is commutative (associative). 
By a famous theorem of Frobenius (see [3], [6]) this algebra is C. 


S. C. Althoen: I received my Ph.D. in algebraic topology from the City University of New York. I then taught 
two years at Hofstra University. For the last seven years I have been at The University of Michigan-Flint. My 
nonmathematical interests include running and genealogy. Of the many individuals who have influenced my 
mathematical development, I wish to mention Robert M. McLeod at Kenyon College and my thesis advisor, Eldon 
Dyer. 

L. D. Kugler: I received my doctorate from the University of California, Los Angeles, where I was a student of 
Abraham Robinson, working in nonstandard analysis. I have been at The University of Michigan-Flint since 1966, 
where I now am Dean of the College of Arts and Sciences. To maintain sanity, I continue to teach and also play the 
bass clarinet. 


*Some authors replace (iii’) with the existence of multiplicative inverses. This is equivalent only in the 
associative, finite-dimensional case (see [1]). Also, solvability of just one equation in (iii) is, in fact, equivalent to 
(iii’). 
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Suppose the algebra is associative. Since it is a division algebra, its nonzero elements form a 
semigroup for which all equations u-x=v and y-u=v have unique solutions. Such a 
semigroup must be a group [2], so that the algebra has a unit and must be C as before. @ 


The associativity assumption in Theorem 1 can be weakened. For example, the same result 
holds for alternative algebras in which 
u-(u-v)=(u-u)-v and u-(v-v)=(u-v)-v 


for all u and v. To see this, investigate the product (u + v)-[(u t+ v)- yu]. 
Commutativity alone does not force a 2-dimensional division algebra to be €. Consider the 
multiplication table 


es 
S 

| 
= 


This defines a division algebra, for if a = a,u + a,v #0, b= b,u + bv, and x = x\u4+ x0, 
then the equation 
a-x=b 
yields the system 
2A,X, — A,X, = bd, 
| A,X, + A,X, = by 


which has a unique solution for any vector b since the determinant 2a; + a3 is nonzero. This 
algebra is not associative: (u - uv) +o = 2v while u- (u- v) = v; hence it cannot be C. 

Now that we have an example of a 2-dimensional division algebra that is not C, it is natural to 
ask: Can we find and classify all 2-dimensional real division algebras? 


1. Idempotents. We adopt the standard approach to the classification problem and consider 
the general multiplication table for a 2-dimensional division algebra 


TABLE | 
u D 
a,,u + by\e a\2u + Dy9v 
Qu + bo \v0 A57uU + Dood 


where {u, v) is a basis and a; ,, b;; are real numbers. Classification will be easier if we can simplify 
this table. Complex multiplication suggests the possible simplification a,, = 1, b,, = 0. Thus, we 
are led to consider idempotents: nonzero elements w for which w? = w. 

Conditions for an idempotent are easily obtained. Set w = xu + yo, x, yin R. 


Since {u, v} is a basis, the equation w” = w yields 


_ 2 2 
X= Ayx” + (a5 + Ay,) xy + Any 


1 
") y= by\x" + (by + by) xy + by y’. 
Nontrivial solutions of these conic equations correspond to idempotents. A conic is determined by 
five points no three of which are collinear. Thus, one expects these conics to intersect at 0, 1, 2, 3, 
or infinitely many points other than the origin, yielding algebras with the corresponding number 
of idempotents. Each of these cases actually occurs. Examples i = 0, 1, 2, and 3 are algebras with 
exactly i idempotents. Example 4 is an algebra with infinitely many idempotents. 


EXAMPLE 0. The trivial algebra: all products zero. 
EXAMPLE |. C. 
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EXAMPLE 2. 


EXAMPLE 3. 


u D 


u| llu+6v0 4u+4+ Tv 
o! 6u+S50 2u+ 5v 


The idempotents are: 2u — 3v, —u + 2v, and (1/23)(u + v). 


EXAMPLE 4. 


Here tu + (1 — t)v is idempotent for all real numbers ¢. 
If we now restrict our attention to division algebras, how many idempotents can there be? 


THEOREM 2. Every 2-dimensional real division algebra has at least one idempotent. 


Proof. In Table 1, if b,, = 0, then a,, # 0, for otherwise u” = 0, contradicting condition (iii’). 
Then (1/a,,)u is idempotent. Similarly, if a,. = 0, then (1/5,,.)v is idempotent. These idempo- 
tents correspond to an intersection of the conic equations (1) at a point on either the x or y axis. If 
b,, # 0 and a,, + 0, then perhaps a rotation would yield a new basis {u’, v’} such that (u’)* = fu’ 
from which we would get the idempotent (1//f)u’. To this end, let 


u’ = cut+ sv 


} 


vo = —sut cv, 
where c and s are the cosine and sine of the rotation angle. Then 
(2) u’-u’=f(c,s)u’ t+ g(c,s)v’, 
where f and g are the homogeneous cubic polynomials 

f(y) = ayy x? + (ayy + ag, + by) x7y + (Dyn + Ba, + An) xy* + by yy’, 

g(x,y) = by x? + (dy + by ay) x7 y — (ay + ax, — by) xy* — ayy’. 
Let r be a real root of the cubic equation g(1, y) = 0 and set 

c=1vr?2 +1, s=r/vr2 +1. 
Since g is homogeneous, g(c, s) = 0. It follows from (2) that 
u’-u' =f(c,s)u’, 


where f(c, s) # 0 because (u’)* # 0. Thus, u’/f(c, 5) is an idempotent. & 


COROLLARY. Every 2-dimensional real division algebra D contains a subalgebra isomorphic to R. 


Proof. The injection i: R — D given by i(r) = r: wis an isomorphism for any idempotent u. & 
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Theorem 2 implies that every 2-dimensional real division algebra has a multiplication table of 
the form 
TABLE 2 
u t B] 


u a)2u + Diov 
ay,;u + by,0 AxU + boo 


In what follows we assume that the multiplication table is in this form. 

The table of Example 4 is already in the form of Table 2. However, this algebra with infinitely 
many idempotents is not a division algebra since (u — v)”? = 0. 

This example points out the need for an effective way to determine whether an algebra is a 
division algebra. Paralleling our earlier example, let 

=au+ao#0,b= but bv,x = xu + xv. 
Then by Cramer’s Rule, the equation 
a-x=b 

always has a solution if and only if 
A, + 7A7,  AA\ + A,Qd) 


Q(a,, a) = 


a,b, aby. + ayboy 
for all pairs of real numbers (a,, a,) # (0,0). If we set 
a b a b 
A, = 12 12 A= 21 21 
Ay, by Ay, by 


the quadratic form Q has discriminant 
2 
(by, - A;) — 4b Ap. 
Therefore, Q(x, y) is nonzero for all (x, y) # (0,0) if and only if this discriminant is negative. 
THEOREM 3. An algebra described by Table 2 is a division algebra if and only if 


2 
(3) (by, — Ay) < 4b, A), 
where 
ay dy» Aa, by, 
A, = A, = | 
Ay, by, ? Ar, Dy» 


An example of a 2-dimensional division algebra with only one idempotent is C. Can a 
2-dimensional division algebra have two or more idempotents? 

If the algebra has two distinct idempotents u and v, then {u,v} forms a basis. Otherwise, 
u = av,a #0 and 


av =u=u?=(av) =a’v* =a’v 


so that a = a*, a= 1 and u = v. Table | now takes the form 


TABLE 3 
u D 
u u Q\2u + Dy ov 
1D) a5,Uu + by )v D 


LEMMA. In a 2-dimensional real division algebra with multiplication table given by Table 3, 
(2 + ay,)(Dy2 + 631) F 1. 
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Proof. Let A = Q1> + QA 15 B = bi5 + bs1; C = Q\7b>,, D= Q5,D,>. If AB = 1, then 


The discriminant condition (3) becomes 
(5) C?+ D*?-2CD—-2C-2D+1<0. 


From (5) we see that neither C nor D is zero. Thus 
by, = C/ay, FO and by = D/ay, #0. 
If we substitute these expressions into (4) and let E = a,,/a,,, we obtain 
DE? +(C+D-—1)E+C=0. 
Thus £ is a solution to a quadratic; so the discriminant 
(6) C? + D*-2CD—-2C-2D+4+120. 
Since (5) and (6) are contradictory, the lemma is proved. @ 


THEOREM 4. A 2-dimensional real division algebra has exactly one, two, or three idempotents. 


Proof. If the algebra has 2 idempotents u and v, then it can be presented as in Table 3. For 
such a table, the system (1) of conics reduces to 


x = x? + Axy 
feeere 
so that 
; ors x = —Ay+1 
y=0, y= —-Bx+l. 


If A = 1 or B = 1, then since AB # 1, these degenerate conics intersect at precisely three points; 
(0, 0), (1,0), and (0, 1), yielding exactly two idempotents u and v. Now suppose A # 1 and B # 1. 
The lemma implies that the lines x = —Ay +1 and y = —Bx-+ 1 are neither parallel nor 
coincident. Thus, they will intersect to determine the unique third idempotent 

A-— | n B- 1 

AB—1° AB—1"' 

How can algebras with two or three idempotents be distinguished from those with only one? 

The function g(1, y) in the proof of Theorem 2 does this. First, note that g(1, y) is defined even if 
b,, = 0. Now for algebras with Table 2 we have 


gs(,y)= —y[ayy? +(A-by)y +O - B)]. 
Distinct real roots of this polynomial yield distinct cosines, c, which in turn produce distinct 
idempotents u’. The root 0 corresponds to the idempotent uw. The quadratic factor of g(1, y) will 
have no real roots if and only if its discriminant is negative. Thus, we have proved the following 
proposition. 


PROPOSITION. An algebra has exactly one idempotent if and only if 
2 
(7) (A — by) < 4a,,(1 — B), 
where A = a, + a>, and B=b,, + b,,. 0 


2. The Two-Idempotent Case. An algebra with exactly two idempotents u and v can be 
presented by Table 3 where either A = 1 or B = 1 but not both. Interchanging basis elements 
switches A and B, so we may take A = | and B # 1. Since automorphisms preserve idempotents, 
a two-idempotent algebra is uniquely determined by a table with A = 1. 
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THEOREM 5. Let @ and @’ be 2-dimensional real division algebras, each with exactly two 
idempotents. Then @ and @’ are isomorphic if and only if their presentations by Table 3 with A = | 
are identical. @ 


Example 2 above defines a 2-dimensional real division algebra with exactly two idempotents. 


3. The Three-Idempotent Case. For an algebra with three idempotents u, v, and w, any pair, 
say {u, v), may be used as a basis for a multiplication table: 


u v 
u u au + b,0 
VD} aut byyv v 


The third idempotent, w, was given in the proof of Theorem 4. 
It is straightforward to construct the multiplication tables based on {u, w} and {v, w): 


u Cyou t+ byw v v Cyo0 + aw 
C,,u + bow w W| C50 + ayw w 


where 


— by) — (by. — 1)(a@z, — 1) 
«12 AB — | 
and c,, is obtained by interchanging subscripts. The classification theorem for three-idempotent 
algebras now follows from the fact that an algebra isomorphism preserves idempotents and is 
completely determined by its effect on a basis. 


THEOREM 6. Let @ and @’ be 2-dimensional real division algebras, each with three idempotents. 
Then @ and &’ are isomorphic if and only if there is a correspondence between the idempotents for 
which the appropriate multiplication tables are identical. & 


4. The Single-Idempotent Case. For a single-idempotent division algebra, the coefficient a,, in 
Table 2 must be nonzero, for otherwise (1/b,,)v is another idempotent. One approach to 
simplification is a basis change to make b,, vanish. In fact, the next theorem shows when it is 
possible to simplify Table 2 to 


TABLE 4 
u .D) 
u u au + Dyov 
v ay,Uu + bo,0 +u 


THEOREM 7. Let @ be a single-idempotent 2-dimensional real division algebra given by Table 2. 
If by. + by, = B F O, then & has a basis {u, w) for which w+ w = +u. Furthermore, w is unique up 
to sign. 


Proof. We must find a, B # 0 so that w = au + Bo satisfies w- w = +u. Calculating w - w by 
Table 2 and equating coefficients yields 

a’ + aBA + B’a,, = +1 

aBB+ B’b,,= 0 


3 


where A = @)> + GQ}. 
If b,. = 0, then since B # 0, we have a = 0 and B = +1/|a,,| so that w = +v/ yla,)]. 
If b,, # 0, we obtain 


a’ (1 ™ AB/b>, + Ba4,/b%, ) = + l, 
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which can be solved uniquely up to sign for a by selecting the sign on the right to match the sign 
of the coefficient of a7. m 
Here is an example: 


EXAMPLE 5. 
u 0 
u u () 
o'utvo —-Uu 


The apparent table dependence of the condition B # 0 seems to complicate the classification 
problem. But a straightforward investigation of the basis change from {u, v} to {u, w = au + Bo), 
B # 0, using Table 2 shows that 


(8) u-w=(a+a,B — b,a)ut bw, 
(9) Ww-hk= (a + a>,B _— b,,a) u + bw, 
which proves the following proposition. 


PROPOSITION. In a 2-dimensional real algebra with idempotent u, the coefficients b,, and b,, in 
Table 2 are invariant under change of basis from {u, v) to (u, w), as is B = b), + by). Ol 


The invariance of B and the fact that the complex numbers satisfy the condition B + 0 
motivates the next definition. 


DEFINITION. A 2-dimensional real division algebra given by Table 2 is quasicomplex if B # 0. 


Real 2-dimensional quasicomplex division algebras (including C) are classified in the next 
theorem. 


THEOREM 8. Let @ and @’ be quasicomplex 2-dimensional real division algebras, each with one 
idempotent, presented by multiplication tables T and T’ in the form of Table 4 on the bases (u, v)} and 
{u’, v’}, respectively. Let T’” be the table for @’ based on {u’, —v}. Then @ and @’ are isomorphic if 
and only if the coefficients of T are identical to those of T’ or T”. 


Proof. If ¢: @ > @’ is an isomorphism, then $(u) = uw’ since the algebras have only one 
idempotent, and (#(v))”? = +u’. By the uniqueness of v’ up to sign, ¢(v) = +v’. Since {u, v) 
and {u’, @(v)} are bases, the tables must have the same coefficients. ™ 


Another approach to the simplification of Table 2 is to find a basis so that the table will 
resemble the complex numbers in having a, = a,, = 0. This will be particularly useful if the 
algebra is not quasicomplex, but has independent interest. Such a basis change can be studied as 
before by investigating the effect on Table 2, but there is another way. A vector w = au + Bo, 
B +0 for which u- w=k,w and w:u=k,w is a solution to two simultaneous eigenvector 
problems. In fact, let R,, and L,, be the linear transformations on R? defined by R,(z) = u- z and 
L,(z) =z: u. Then w must satisfy 

R,(w) =kw, 
L,(w) = kw. 


The matrices for R, and L,, with respect to the basis {u, v} are respectively, 


1 ay d 1 ay 

0 b,| ~~ J0 by 
with eigenvalues 1, corresponding to the eigenvector u, b,, = k,, and b,, = k, corresponding to 
w. The simultaneous eigenvector equations are 


(1 — by )a + a,,8 = 0 
(0) (i — by,)a + ay,8 = 0° 
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which will have nontrivial solutions if and only if the determinant 


1— Dy ayy 


= A,—-(a.-—a 
1— by, ay 2 — (ay. — a1) 


is zero. The solvability of the simultaneous eigenvector problem is independent of the basis vector 
v in Table 2, so if A, = a,, — a, for one basis as in Table 2, the same relation holds for any other 
basis as in Table 2. This invariance makes possible the following definition, suggested by the fact 
that for a commutative algebra A, = 0 = a, — a),. 


DEFINITION. A 2-dimensional real division algebra given by Table 2 is quasicommutative if 
A, = Q1> —_ A515 where 


ay dy 
A, = 


Ay, by, 


THEOREM 9. A quasicommutative 2-dimensional real division algebra given by Table 2 in which 
b,, and b,, are not both 1 has a basis {u, v} with multiplication table 


TABLE 5 


u 


u Dy 20 
bz\0 A57U + bo9v 


where a5, = +1. 


Proof. As we have seen, quasicommutativity guarantees the existence of the simultaneous 
eigenvector, and since either b,, # 1 or b,, # 1, there is a solution of (10) with B # 0. (If bj, or 
b,, is 1, it follows immediately from the definition of quasicommutativity that the corresponding 
@,, OF a>, is zero.) Since a, # 0 and v is an eigenvector, the pair (u, v/ Vlax| } is a basis of 
eigenvectors whose multiplication table has a,, = +1.0 


Here is an example. 


EXAMPLE 6. 
u ) 
u u 0 
oD —-Uv u+v 


The classification of nonquasicomplex algebras begins with the quasicommutative case. 


THEOREM 10. Let @ and @’ be nonquasicomplex, quasicommutative 2-dimensional real division 
algebras. Then each has a unique multiplication table in the form of Table 5. The algebras @ and @’ 
are isomorphic if and only if the coefficients in these tables are identical. 


Proof. Since B = b,, + b,, = 0, b,, and b,, cannot both equal 1. Thus, by Theorem 9, @ and 
@’ have Table 5 presentations. These are unique because the idempotent and the normalized 
solution to the eigenvector problem are unique. Since an isomorphism must preserve both, the 
coefficients agree. 


Note that a nonquasicomplex, quasicommutative division algebra is not commutative, since 
b,. = b, # 0 is ruled out. 

The only remaining algebras are those which are neither quasicommutative nor quasicomplex. 
Apparently, the best Table 2 simplification available is to make a,, = a,, =.1. 


THEOREM 11. A nonquasicommutative, nonquasicomplex 2-dimensional real division algebra with 
exactly one idempotent has a unique multiplication table in the form of Table 6. 
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TABLE 6 


u + Dyov 
D u— Dy Az7uU + by50 
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Two such algebras are isomorphic if and only if the respective coefficients in Table 6 for each algebra 


are identical. 


Proof. Setting the coefficients of wu in equations (8) and (9) equal to | and noting that 


bp, = — Dy, we obtain 


(11) 


(1 — by)a t+ ayB = 1 
(1 + b,)a + a,,B = ia 


Since the algebra is not quasicommutative, 


1—- by ay 
1+ by ay 


+0 


so (11) has a unique solution. By Cramer’s Rule, 6 = 0 in this solution if and only if b,, = 0. But 
if bj, = 0, then u-(a,.u — v) = 0, so that a,.u — v = 0 by (i), contradicting the fact that 
{u, v0) is a basis. Thus, this unique solution will yield Table 6. Since the table is unique, the 
isomorphism result is immediate. ™ 


Here is an example. 


EXAMPLE 7. 


u 0 
Uu Uu ut+v 
olu-v 2Z2ut+v 


5. Summary. Given a 2-dimensional real division algebra, one may select a basis so that the 
multiplication table assumes one of the following four forms: 


TABLE 3 
(Two or three idempotents) 
u t B] 


u Qu + Div 
Q7,u + bv D 
TABLE 4 
(Quasicomplex; includes C) 
u v 
u u au + Dygv 
: D) Q7,u + bz)0 +Uu 
TABLE 5 


(Quasicommutative, not quasicomplex) 


Dyov 
bo ,v +u + bo90 


TABLE 6 
(Neither quasicommutative nor quasicomplex) 


ut+ byov 
u— Dy9v QA52U + by 0 
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Tables 5 and 6 are uniquely determined by the algebra, and Table 4 has only a sign ambiguity in 
v. A two-idempotent algebra is uniquely determined by Table 3 with a,, + a,, = 1. A three- 
idempotent algebra has 3 tables like Table 3, but each is readily obtained from any other. Note 
that these tables do not automatically define division algebras by virtue of their form. In fact, 
setting b,, = 0 in any one of them results in an algebra with zero divisors. It is therefore necessary 
to verify that condition (3) is satisfied. 

The classification assigns algorithmically to every 2-dimensional real division algebra a 
canonical multiplication table by successively examining three properties: number of idempotents, 
quasicomplexity, and quasicommutativity. It is noteworthy that for some division algebras there 
exist tables of more than one canonical form. For example, a nonquasicommutative algebra may 
be presented by Table 4 as well as Table 6, provided b,, # b,,. 


6. Concluding Remarks. Although all our results depend upon first locating an idempotent, it 
is easy to generalize Theorem 3 so that it applies directly to Table 1. 


THEOREM 12. An algebra described by Table \ is a division algebra if and only if 


(A, - A,) < 44,43, 


where A, and A, are as before and 


Qyy Dy» 


The tables for a three-idempotent algebra can also be determined without actually finding any 


idempotents. A slightly deeper investigation of the rotation described in Theorem 2 yields 
functions 


F(x) = ayyx? + (ay, — Ay, — by) x* — (ay, + dj, — by) x + by, 
G(x) = ayx? + (ay, — Aa) — Bg,) x? — (ay, + By, — By) x + Dy, 


which can be used to generate table coefficients as follows. Let r;, i = 1,2,3 denote the three real 
roots of g(1, y) = 0. Let 


F(7;) G(r) 
an = 9 V5; = . 
fC, 7) fd, 7;,) 
Then all tables based on idempotents are given by 
u v 
u u r\jUu + 1, ;0 
DO} hut Pr, ;0 D) 


It is also possible to give coordinate-free characterizations of two important concepts in this 
paper. 


PROPOSITION. An algebra @ is not quasicomplex if and only if there exists a real-valued function 
k: @ > R such that 


uo t+ vu=k(v)u 
foralvoe@.& 
PROPOSITION. An algebra & with idempotent u is quasicommutative if and only if 
(uv )u = u(vu) 
foraloe@. a 


Finally, we note that the discussion preceding Theorem 3 implies that C” is never a division 
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algebra since by the Fundamental Theorem of Algebra, the quadratic form Q will always have 
nontrivial zeros. In particular, no multiplication on C* will make it the quaternions. 


7. Historical Note. The study of linear algebras was initiated by Benjamin Peirce, who by 1870 
had worked out 115 multiplication tables [7]. In 1908 J. H. M. Wedderburn [10, p. 111] gave what 
we believe to be the first example of a 2-dimensional division algebra without identity (although it 
was over the integers modulo 2, not R). In 1954 Segre [8] used Bezout’s Theorem to prove 
Theorem 2 for all finite-dimensional real division algebras. In 1958 Luchian [4] classified 
2-dimensional real algebras with divisors of zero. Thus, this paper together with ours gives a 
Classification of all 2-dimensional real algebras. In 1960 Markus [5] classified commutative 
2-dimensional real algebras to help in the study of quadratic differential equations. Finally, in 
1970 Wallace [9] gave a classification of 2-dimensional power associative real linear algebras. 


The research for this project was supported (in part) by a grant from the Faculty Development Fund of the 
University of Michigan-Flint. 
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1. Introduction. As observed by Hardy and Wright [2, p..300], the problem of characterizing 
the natural numbers which are representable by sums of two squares reduces ultimately to proof 
of the following classical theorem of Fermat. 


THEOREM |. Any rational prime p of the form 4m + \ can be expressed as a sum of two squares. 


The adjective “rational” is here required to distinguish between the domain of “ordinary” 
integers and the domain of Gaussian integers, a setting in which Theorem | can be proved. 
However, it is certainly desirable to construct proofs of this important theorem which are 
independent of the theory of Gaussian integers. Accordingly, we propose in this note to deduce 
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Theorem 1 as a simple consequence of well-known partition identities. Details of the proof are 
supplied in Section 2. In this the leading role is played by the celebrated Gauss-Jacobi triple-prod- 
uct identity: 


(o@) (o@) 
(1) TT] — x27) Ft ax?) $a!) = OY atx’, 
n=1 n=—0o 


valid for each pair of complex numbers a, x such that a # 0 and |x| < 1. For an accessible proof 
of (1) see this MONTHLY [1, pp. 270-272]. As a matter of fact, we do not require the full force of 
(1). Rather, we need the following special case which results from the substitution a > —1: viz., 


(2) aCe ~x")\(1—x?""')=14 2 (-1)"x" 


2. Proof of Theorem 1. For the sake of brevity let F(x) denote the right side of (2), then take 
the logarithmic derivative and multiply the resulting identity by x to get: 


F'(x) 


* F(x) 


= xD,{log F(x)} 


= “ess — x") + Ylox(t — oe) 
-¥ en — Ix 


I I 
| | 
M f 8 
a 
M 
Qu 
| 
M 
ae 
M 
Qu 


The following definition naturally simplifies our discussion. 


DEFINITION. For each positive integer n, o(n) denotes the sum of all positive divisors of n, the 
nonnegative integer b(n) denotes the exponent of the highest power of 2 dividing n, and O(n) is 
then defined by the equation n = 2°”0(n). Hence, 0(n) is odd. For convenience we define an 
auxiliary arithmetical function w by the equation w(n) = o(n) + o(0(n)). 

We now state our result, as follows: 


{Lan(n)x"} F(x) = —xF’(x), 
1 
or equivalently, 

[Swonyar\ f+ 2d (-1yta} = 2 (tar 

l 1 

Expanding the left side of the foregoing identity we have: 

Yx"{ w(n) 42D (-1)/w(n-7?)} = -2(-1)"" 2 n? 

1 j=l 


On the left side of this identity summation of the inner sum is extended over all values of j for 
which n — j* is positive. Equating coefficients of like powers we then have: 


w(n) + 2% (= 1)'w(n = 2) = {2-1 1'r?, ifn=r?, 


j= otherwise. 


For n odd, w(n) = o(n) + o(O(n)) = 20(n). Hence, for such n we divide the foregoing 
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recurrence by 2 to get: 
(3) 


. 2 2 2 if2m+1=Pr 
2m + 1) - 2m+1—(2j-1)°)+2 2m+1- (2 -{r 1 , 
o(2m ) Xml m (23 y) Lol m ( iy) 0, otherwise. 
Now let there be given a prime p = 4m + 1, for some integer m. Hence o(p)=p+1=4m 
+ 2, and recurrence (3) becomes: 


4m +2—Yw(4m 4+ 1—- (27-1)°) +2 0(4m + 1 - (27)°) = 0, 


or equivalently, 


2m+ 1 — Mm OF DDS gam 4 1 —- (27)°) =0. 


Next 
w(n) = 6(2°™0(n)) + 6(0(n)) = 2°™* 'o(0(n)), 


owing to the multiplicativity of o. Hence, 2 divides w(n); and, for n even, 4 divides w(n). From 
this it follows that the sum Lo(4m + 1 — (27)*) is odd, whence at least one of the summands, say 
o(4m + 1 — (2k)?), is odd. But, as is well known (and easy to prove), o(7) is odd if and only if n 
is a square or twice a square. In the present case we must have: 4m + 1 — (2k)* = (2i + 1)’, for 
some integer i. Thus, 


p= (2k) + (214+ 1). 


REMARKS. In [3, pp. 446-448] Uspensky and Heaslet discuss an arithmetical function T which 
is related to our function w by: 27(n) = w(n). These authors, following Liouville, then establish 
the desired recursive formula for T; and, as an application of the recurrence they prove Fermat’s 
theorem by use of an argument quite similar to ours. However, their discussion depends on a very 
lengthy prior discussion of the methods of Liouville, whereas our discussion begins with the 
triple-product identity and is thereafter self-contained. 
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NECESSARY AND SUFFICIENT CONDITIONS FOR OSCILLATIONS 


G. LADAS, Y. G. SFICAS* AND I. P, STAVROULAKIS** 
Department of Mathematics, University of Rhode Island, Kingston, R.I. 02881 


1. Introduction. Our aim in this paper is to obtain a necessary and sufficient condition under 
which all solutions of the retarded differential equation 


(1) x(t) + Vpxlt—1) <0, 150 


i=] 


oscillate, where p,; are positive numbers and 7, nonnegative numbers, i = 1,2,..., 7. 


*On sabbatical leave from the University of Ioannina, Greece. 
On leave from the University of Ioannina, Greece. 
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Retarded differential equations provide mathematical models for physical systems in which the 
rate of change of the system depends not only on its present stage, but also on its past history. 
Such equations appear in biology, control theory, ecology, economics, number theory, physics, etc. 

The Oscillation Theory of retarded differential equations has been extensively developed 
during the past few years. See, for example, [1]-[8] and the references cited therein. Of particular 
importance has been the study of oscillations which are caused by the retarded arguments. Note 
that when all delays 7, in Equation (1) are zero, that is, in the case of a first order ordinary 
differential equation, the nontrivial solutions are not oscillatory. However, for retarded differen- 
tial equations, under appropriate hypotheses, it is possible to have nontrivial oscillatory solutions. 
The behavior of solutions is radically altered by the introduction of delays. 

It is known that in the case of stability the roots of the characteristic equation determine the 
stability character of the solutions. Our conjecture was that the oscillatory character of the 
solutions was also determined by the roots of the characteristic equation. This conjecture is in fact 
true and the proof requires only elementary tools. 

Our main result is the following theorem. 


THEOREM. Consider the retarded differential equation 


n 
(1) x'(t)+ i px(t-—7)=0, 120 
i=1 
where p; are positive numbers and 1, nonnegative numbers, i = \,..., n. Then all solutions of (1) 


oscillate if and only if 
n 
(2) —rA+ ¥ pei > 0, forallA > 0. 
i=l 
As is customary, a function is said to oscillate if it has arbitrarily large zeros. We shall assume 
without loss of generality that 7, = max({t,,..., 7,,}; solutions of (1) are then continuous functions 


x defined on [—r,, 00) that satisfy (1). As usual, we shall use the term “eventually” to mean “for 
some T € (0, 00) and all t © [T, 00)”. 


2. Proof of Main Result. To prove the theorem we need the following lemma. 


LEMMA. If the continuous positive function x defined on [—7, 00) is a solution of the retarded 
differential inequality 


(3) x'(t) + px(t-—7) <0, t2>0 
where p and 7 are positive numbers, then 
(pt/2) x(t-—7) <x(t), t> 37/2. 

Proof. For given s > 7, we integrate both sides of (3) from s to s + 7/2; using the fact that x is 
decreasing on [0, 00), we find that 
(4) —x(s) + (pr/2)x(s — 1/2) < x(s + 7/2) — x(s) + (pt/2)x(s — 7/2) <0, s2r. 

For given ¢ > 37/2 we apply (4) to s = ¢ — 7/2 and to s = ¢, and find 

x(t — 7/2) > (pt/2)x(t—7) and x(t) > (pr/2)x(t-— 17/2), t > 37/2; 

the assertion follows by combining these inequalities. 


Proof of the Theorem. Assume first that (2) does not hold. We may then choose A, > 0 such 
that 


n 
=o + E pie = 0. 


i=] 


But then x, defined on [—1,, 00) by x(t) = e~*, is a nonoscillatory solution of (1). 
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Assume conversely, that not all solutions of (1) oscillate. ‘There exists, therefore, a solution that 
is eventually positive; by (1), such a solution is eventually decreasing. Since every left-translate of 
a solution of (1) is also a solution, we may choose a decreasing positive solution x of (1), and this 
shall remain fixed throughout the proof. 

When all 7; are 0, (2) obviously does not hold. We shall therefore assume that 7, = max(rT,,..., 
7,» > 0. We define the set 


A = {A> 0]x’(t) +Ax(t) < 0 eventually). 
From (1) we have 
(5) x'(t)+p,x(t-—7,) <0, ¢>0. 
On the one hand, since x is decreasing, if follows from (5) that 
x'(t) +p,x(t) <0, +20, 
so that p, © A. On the other hand, it follows from (5) by the Lemma that 


x(t— 1) <| 2 


Putn 
Since x is decreasing, this implies 


2 
x(t), t > 32,/2. 


O=x’(t)+ x p,x(t —7;) < x(t) + D p.x( — 1) 


i=] 


2n 
< x/(t) + | a p;x(t), t > 37,/2. 
Prt} j= 


Therefore (2/p,7,)’X"_, p; is an upper bound of A. Since A is nonempty and bounded, we may 
set \) = sup A. 
Let A € A be given, and define y on [—1,, 00) by y(t) = x(t)e*’. Then there is a suitable 
T € (0, 00) such that 
y(t) = (x(t) +Ax(t))e** <0, t>T 


and therefore y is decreasing on [T, 00). It follows that 


O=x'(t) + » p;x(t — 7;) — x’(t) + » piy(t — 7, )e U-W 


i=] i=]1 


n n 
>x'(t)+ ¥ py(the MP = x(t) + ¥ pjer"x(t), t> T+ 7. 
i=l i=1 
This shows that £”_, p,e*” © A and therefore 0"_, p,e*” < Ag. Since A € A was arbitrary, we 
conclude that 


n 
Ao; 
>» Die ™ < Xo 


i=] 
and therefore (2) does not hold. 
The proof of the theorem is complete. 


Note. After we had derived the above result (see [4]), we discovered that the same theorem was 
also proved by Tramov [8]. An equivalent version of the theorem has also been proved by Hunt 
and Yorke [1]. The three proofs are different. 
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R> IS THE UNION OF DISJOINT CIRCLES 


_ ANDRZEJ SZULKIN 
Department of Mathematics, University of Stockholm, 11385 Stockholm, Sweden 


In a problem seminar held at the Royal Institute of Technology in Stockholm, Professor H. S. 
Shapiro asked for a solution of the problem of covering the three-dimensional Euclidean space R° 
by disjoint Jordan curves. (It can be shown that the two-dimensional Euclidean space cannot be 
covered in such a way.) In this note we shall construct a family of disjoint circles whose union is 
R>. By acircle we mean a set isometric to {((x, y) € R?: x” + y” = p*), where p is a positive real 
number. 

Given r > 0, let S. = (x, y,z) € R?: x? + y? + z* =r’). Then 

Re=US. 


r>0 


Denote by C the union of the circles 


(x,y,z) © R°:(x-4k-1)°+y?=1,2=0), k=0,41,42,.... 


FIG. | 


One verifies readily that C M S, consists of two points for each r > 0 (see Fig. 1). Hence 


R2-C=UT, 


r>0 


where 
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T, = S,-— (CN S,) = S, — {two points). 


Now one can remove a great circle C, from each T,, r > 0, in such a way that T.—- C. = T/ UT”, 
where T, and 7;” are open hemispheres with one point missing. Thus it remains to cover all 7 and 
T,” by disjoint circles. The covering can be obtained, e.g., by intersecting each T’ and 7,” with a 
family of planes as indicated by Fig. 2. 


THE TEACHING OF MATHEMATICS 


EDITED BY MARY R. WARDROP AND ROBERT F. WARDROP 


Material for this department should be sent to Professor Robert F. Wardrop, Department of Mathematics, Central 
Michigan University, Mount Pleasant, MI 48859, 


A WAY OF TEACHING ABSTRACT ALGEBRA 


HAYA FREEDMAN 
Department of Mathematics, LSE, Houghton Street, London, England WC2A 2AE 


“We are informed about everything. We know 
nothing.” 
(S. Bellow, To Jerusalem and Back.) 


The lecture method is the most commonly used in teaching mathematics, in spite of its many 
shortcomings which are widely recognized by mathematicians and educators. 

“A good lecture is usually systematic, complete, precise—and dull; it is a bad teaching 
instrument,” writes P. R. Halmos [1]. “It is simplistic to suppose that people remember what they 
are told and understand the things that are explained to them clearly,” states E. E. Moise [2]. 

This kind of criticism created the impetus to find a new way of teaching mathematics. During 
the last seven years, a method of teaching algebra, which does not depend solely on books and 
lectures, has been developed at this college. The essence of this method is to encourage students to 
play a more active part in their own education, in particular, to let them share responsibility with 
their teacher for selecting, preparing, and presenting the material taught in algebra. The purpose 
of this paper is to describe how this method works. 
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The students are introduced to this way of teaching gradually, in three stages. The first year 
students have, in addition to a traditional lecture in algebra, an informal seminar, which is done 
on a voluntary basis—no marks are awarded for it. Each participant has to read an extract from a 
paper or a book, or solve an interesting problem and lecture on it to the class. The listeners are 
encouraged to ask as many questions as they wish. 

The second year seminar is a much more formal affair, and a whole term is devoted to it. The 
amount of work required to complete a second year project is considerably greater than that spent 
on a first year seminar—in addition to a couple of lectures, each student has to submit a written 
report, and this time, unlike the first year, marks are awarded for the project. Material for the 
projects is distributed at the end of the first year, so that the students can do some preliminary 
work during their summer vacation, and then complete their projects during term time. Each 
student is given one or two original papers and a list of prerequisites with references to accessible 
literature. After a short introductory talk, the student is packed off to do the work on his own. He 
or she is very welcome to return for help with any problems that may arise, but no advice is ever 
offered unless explicitly requested by the student. Each speaker is allowed three or four hours to 
lecture on his or her project. The student has to decide how to arrange the material: which of the 
proofs to give in full, and which to outline, or omit altogether. The student is also required to 
provide suitable examples and counterexamples. In addition, any material new to the class, or 
even only to some of its members, which is needed as a prerequisite for understanding the lecture, 
has to be explained adequately by the speaker. Following the lecture, each student has a téte-a-téte 
discussion with the instructor, in which both the lecture and the written report are scrutinized. 

The third year courses are designed with the following objectives in mind: First the majority of 
lectures are to be prepared and delivered by students. Second, the whole course is constructed 
around a few focal points of interest. Third, students are instructed to use suitable original papers 
rather than books and lecture notes: 

To make these points clearer a course on abelian groups, given in 1979 at this college, is 
described in some detail. Four students took this course. Each student was given a chapter from a 
book specially compiled for this course. Such a chapter included up to four original papers dealing 
with a specific topic in abelian groups, explanatory notes to cover difficult points, and exercises 
designed to fill in gaps in proofs, or to widen the scope of the papers. In addition, it contained an 
introduction and a summary with references for further study. 

The first student was given the task of presenting criteria under which an abelian group is a 
direct sum of cyclic groups. His chapter included the following papers: 


1. A. Kertesz, On the decomposibility of abelian p-groups into the direct sum of cyclic groups, 
Acta Math. Acad. Sci. Hungar., 3 (1952) 121-125. 

2. R. Rado, A proof of the basis theorem for finitely generated abelian groups, J. London 
Math. Soc., 26 (1951) 74-75. 

3. T. Szele, On a theorem of Pontrajagin, Acta Math. Acad. Sci. Hungar., 2 (1951) 121-125. 

4. T. Szele, On direct sums of cyclic groups, Publ. Math. Debrecen, 2 (1952) 76-78. 


On his own initiative he added to his reading list extracts from papers by R. Baer, H. Priifer 
and L. Kulikov. 

The second student lectured on basic subgroups. His chapter included papers by L. Fuchs, P. 
Hill, M. L. Kaloujnine and T. Szele. The other two talks were on pure subgroups and Ulm’s 
theorem. The latter of these proved Ulm’s theorem for countable abelian groups and for totally 
projective groups. 

The four students worked as a group and had a great deal of discussion among themselves. 
Between them they gave about fifteen two-hour lectures and covered in their written projects 
about a dozen original papers. At the end of the academic year each student had to pass a written 
exam based on all of the four projects, excluding some of the more advanced material. For 
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example, the syllabus for the written exam included Ulm’s theorem for countable abelian groups 
but excluded Walker’s paper on Ulm’s theorem for totally projective groups. 

Each project, whether intended for a beginner or a more advanced student, should be selected 
with the utmost care. The following guide lines are used. 


(a) The project must be related to what the student already knows, should know or desires to 
know. 

(b) The educational needs of each student must be taken into consideration. In particular, 
weaker students should be offered easy projects, whereas gifted students should be 
stimulated to develop their intellectual powers and aesthetic perceptions to their full 
capacity. 

(c) One must work with each student individually, and endeavour to answer cheerfully all the 
questions asked, whether it involves a lengthy explanation of a trivial matter, or requires 
the instructor to learn material new to him or her. 

(d) Students must be given freedom to develop their own way of collecting, analyzing and 
evaluating any information they might need for their work. 

(e) Since it 1s not always easy to forecast how well a student will take to the prescribed 
material, each project has to be constructed in two or three sections. The first section 
presents a mini project which grows to a more advanced project with the addition of 
further sections. In this way it is possible for the student to gain a sense of achievment and 
satisfaction even if only a part of the project is accomplished. 


Our experience has shown that most students love this method of teaching and that they derive 
a great sense of satisfaction from their work. They tend to achieve better exam results than 
students who are taught in the traditional way. Furthermore, a higher percentage of graduates 
among, those who have been exposed to seminars in algebra become interested in taking a higher 
degree in mathematics (usually specializing in algebra). Finally, since students are enabled to 
study at different levels, according to their own talents and interests, they become more 
forthcoming, and it is easier for the instructor to assess the potential of each student and identify 
his or her weak points. 

The main disadvantage of this method is that it works effectively for small size classes only, say 
about a dozen students, although it may possibly be adaptable to larger classes. It probably means 
additional work for the teacher, although not as much as might seem, since one obtains a great 
deal of help from the students. For example, one of the first year students discovered, after giving 
his seminar, that he loved teaching, and he organized a special revision class to help some of his 
classmates. On another occasion, a Master of Philosophy student volunteered to supervise the 
project of a second year student. She was convinced that this was a good way of learning new 
material in which she happened to be interested at that time. At the end of the academic year 
students are asked for their comments on the teaching method. Here are a few extracts from their 
essays. 

“First and foremost, the seminar is taught by students; in short, it is a place where discussions 
and group participation are encouraged. Second, the seminar exposes the student to a dozen 
topics that could not be possibly be covered in lectures. Whereas the lecture provides one with a 
sound algebraic base, the seminar discovers areas in mathematics that are beyond the definitions 
and basic theorems. Third, the situation of students as professor is fruitful not only because the 
transformation of learner into teaching is rewarding (yet demanding) but also because, as teacher, 
the student will learn his or her subject matter like no other way.” (S. Fine, a general course 
student reading a first year algebra course.) 

“TI feel that every student should be made to read a number of research papers. This exercise 
gives a new perspective on how human knowledge grows, something a text book can rarely do. 
Finally, the most beneficial aspect of the seminar system is that it improves a person’s ability to 
communicate information. In my own case, I learned a great deal on how to present material—what 
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items to stress, what details to omit, etc. This was a most rewarding experience, especially since it 
increased my confidence in my own abilities.” (M. Khan, reading for a Master of Science Degree.) 

“One of the most important aspects for me is the sense of satisfaction experienced on 
completion of the seminar.” (P. A. Smith, third year.) 

Finally, two comments are in order. First, no special magic attaches to abstract algebra. It was 
chosen for this experiment purely for personal reasons, since it happened to be the author’s field 
of interest. The method could equally be applied to the teaching of other subjects in mathematics. 

Second, it is not necessary that a subject be taught for three years. Experience has shown that 
students who took only first and second year algebra achieved many of the same benefits as those 
who did abstract algebra for three years. 
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EpITOR EMERITUS: EMorY P. STARKE. COLLABORATING EDITORS: VINCENT BRUNO, FRANK S. CATER, 
GULBANK D. CHAKERIAN, A. M. DAWES, UNDERWOOD DUDLEY, RICHARD A. GIBBS, CLARK GIVENS, 
RICHARD M. GRASSL, DOUGLAS A. HENSLEY, ISRAEL N. HERSTEIN, ROBERT H. JOHNSON, ELGIN H. 
JOHNSTON, MURRAY S. KLAMKIN,:-DANIEL J. KLEITMAN, JOSEPH D. E. KONHAUSER, FREDERICK W. 
LUTTMANN, MARVIN MARCUS, LOUISE E. Moser, M. J. PELLING, C. M. REIS, J. O. SHALLIT, 
B. L. R. SHAWYER, EDWARD T. H. WANG, AND ALBERT WILANSKY. 


Send all proposed problems, typed and in duplicate if possible, to Professor G. L. Alexanderson, Department 
of Mathematics, University of Santa Clara, Santa Clara, CA 95053. Please include solutions, relevant references, 
elc, ; 
An asterisk (*) indicates that neither the proposer nor the editors supplied a solution. 

Solutions should be sent to the addresses given at the head of each problem set. 

A publishable solution must, above all, be correct. Given correctness, elegance and conciseness are preferred. 
The answer to the problem should appear right at the beginning. If your method yields a more general result, so 
much the better. If you discover that a MONTHLY problem has already been solved in the literature, you should of 
course tell the editors; include a copy of the solution if you can. 


ELEMENTARY PROBLEMS 


Solutions of these Elementary Problems should be mailed in duplicate to Professor G. L. Alexanderson, 
Department of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by March 31, 1984. Please 
place the solver’s name and mailing address on each (double-spaced) sheet. Include a self-addressed card or label 
(for acknowlegdement). 


E 3019. Proposed by Chen-Te Yen, Chung- Yuan Christian University, Taiwan. 

Find all solutions to the equations (1) 1 + 3% = 7° + 3° and (2) 1 + 5% = 7° + 5° in integers a, 
b,c. 

E 3020. Proposed by Clark Kimberling, University of Evansville. 


Suppose ABC is a nonisosceles triangle. Find three hyperbolas concurrent in a point P such 
that triangles APB, APC, and BPC all have the same perimeter. (*) How does this common 
perimeter compare with that of ABC? 
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E 3021. Proposed by Geng-Zhe Chang and Edward T. H. Wang, Wilfrid Laurier University. 
Let 


D(x) = y (7ya +x)*(Q—x)"™“. 


Express p, (x) as an explicit function of 1 — x”. 


E 3022. Proposed by John Sadowsky, Columbia, MD. 


Show that, for any a > 0 and any positive integer N, 


yey (% — - (444 k+1 ) 


1+(k-l)a k + am 
E 3023. Proposed by J. G. Mauldon, Amherst College. 


You are given an accurate indicating spring scale (not a balance) and seven coins, each of 
which weighs either x or y, where x and y are unknown. In five weighings, determine the weight of 
each coin. 


E 3024. Proposed by Eugene M. Luks, Bucknell University. 


Let a, b be integers with b > 0. Prove that there are infinitely many positive integers n with the 
property that if p is a prime divisor of n° + a, then p is also a divisor of k’ + a for some integer k 
with |k|? <n. (The case b = 2, a = 3 was problem B-3 on the 1981 William Lowell Putnam 
Competition.) 


SOLUTIONS OF ELEMENTARY PROBLEMS 


Primes, Ratios, and Bernoulli Numbers 


E 2890 [1981, 444]. Proposed by Barry J. Powell, Kirkland, Washington. 


Let p > 3 be a prime for which 2p — 1 = q is also prime. Let B,,, = N>,,/D>,, tepresent the 
2mth Bernoulli number, where N,,, is the numerator and D,,, is the denominator (in usual 
notation). Prove that |N,, — | > |N2,I- 

*This suggests that there are infinitely many positive integers m for which |N,,, >| > |No,nl- 
Can this be proved? 


Solution by J. S. Frame, Michigan State University. The primes p and q = 2p — 1 have the 
form p = 6n + 1,qg= 12n+ 1,n>0. Forn =1, 


Ni _ 691 > 
Fe = > 10a”. 


For n> 1, we show that |N,,_ >| > 8981|N,,|, and conjecture that the ratio |(N,,_ )/N2,| is 
unbounded. By the von Staudt-Clausen theorem, the denominator D,,, is the product of the 
distinct primes r such that (r — 1)|2m. Since p + 1 and 2p + 1 are divisible respectively by 2 and 
3, D,, = 6. Also 


D,,-2 = Di, = 2:3-°5-R-p-q=5Rpqd,, 


where 5R is the. product of all primes r between 4 and 6n such that (r — 1)|12n. Next, from the 
zeta function relation 


{(2m) = y n-2m = |B, ,,(27)°"/2(2m)! > (2m + 2) 


n=1 
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we conclude that |B,,_2/B,,| > 227/pq, and hence |N2p —2/N2p| > 1077R. Since 25 is not 
prime, n * 2. For n > 2, we note that (7 - 13)|R, so 


Nop 2/2 pI > 9107? > 898] 


for p > 7, and 2p — | prime. The values of R/91 for the first ten prime pairs p, g corresponding 
to n = 1,3,5,6, 13, 16, 23, 26, 33,35 are 1/91, 1,11, 19,53, 17,47, 53 - 79,19 - 23 - 37 - 67,11 - 29 
- 31-43-61 - 71. It appears probable that R and the ratio |N,,_./N,,| are unbounded. 


Also solved by R. C. Smith and S. I. Smith and the proposer. 


Annular Families 


E 2892 [1981, 444]. Proposed by F. S. Cater, Portland State University. 


Let X be a nonvoid set; let k > 0 be an integer. Characterize the families U of subsets (of X), 
card U = 2*, satisfying (*) for every A & U there are exactly k sets B © U such that (A \ B) U 
(B \ A) is a singleton. (Compare E 2792 [1979, 702].) 


Solution by Robert Patenaude, California State University, Los Angeles. The families U are 
precisely those of the form Y = {Z: S C Z C T) for some two subsets S, T of X with cardinality 
(T\ S)=|T \ S| = k. First of all it is clear that any U of the above form satisfies the conditions 
of the problem. 

Denote, as usual, the symmetric difference (A \ B) U(B \ A) by AAB. If U is any family of 
subsets of X and Y C X, put 


UY = (CAY: CE U}. 


It follows that |U*| = |U| and (U’)” = U. Thus for any Y C X,U" has 2* sets and satisfies (*) if 
and only if U does. Fix Y € U and put U,*" = (C € U?: |C| =i), so that @ = YAY € U* and 
|Up | = 1. For i > 1 each set in U,” contains at most i sets in U,_, and thus it must be contained in 
at least k — i sets of U,", ,. It follows that 


(k— i) 1< Gt IGA 


lu > Sui>L [| = 2% 


i>0 iro! 


and the induction gives |U,"| > («| SO 


Since |U*| = 2* this forces each set in U,” to contain exactly i sets in U,”_, so every subset of a set 
in U” is also in U*. Since |U;| = 1, say U, = {D), we clearly have U” = 2”. Now, retrieve 


U=(U")' =(2°)" ={Z: Y\ DE ZC YUD)}), 


soS=YND,T=YUD. 
This problem is closely connected with E 2795 [1979, 703; 1980, 757] where (in a different 
formulation) it is shown that if U satisfies (*), then |U| > 2°. 


Also solved by the proposer. 


A Totient Identity 


E 2916 [1981, 763]. Proposed by R. Sivaramakrishnan, University of Calicut, India. 


Let $,(r) represent the number of integers, a, 1 < a < r, with (a,r) = (a+ 1,r) = 1. Prove 
that, for r, n > 1, the relation 


2(ur/d*) du(d) = o(n/u)o(r/u)o2(u) 
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holds. Summation extends over all d,d|(n,r). u is the greatest common square-free unitary 
divisor of n and r. (A divisor d of c is called unitary if (d, c/d) = 1.) 6(1) is the Euler totient 
[Mobius function]. 


Solution by Chun-Nip Lee, student, Massachusetts Institute of Technology. The stated formula 
contains a misprint. A correct formula is 


> 6(nr/d?) du(d) = o(n/u)o(r/u)o2(u?). 


(Editor’s note. The greatest common square-free unitary divisor (g.c.s.f.u.d.) of p and p*%, 
a > 2,is 1; this is not the same as the greatest square free unitary divisor of the greatest common 
divisor of two numbers.) 


Proof by induction on the number of distinct prime factors of (n, r). If (n, r) = 1, then each side 
of the equation equals $(”)¢(r) by multiplicativity of 6. Now suppose that the formula is true if 
(n, r) contains at most k distinct prime factors, k > 0, and that (n, r) = p"e, where e contains k 
distinct prime factors, y > 1, and (p, e) = 1. Write n = p’N andr = p°R, with (p, N) = (p, R) 
= |. Let U denote the g.c.s.f.u.d. of N and R and u the g.c.s.f.u.d. of n and r. 

Let 2 denote the sum on the left side of the formula. We can express 2 as a sum over the 
divisors of e plus a sum over the remaining divisors of pe, since p(d) = 0 if d is divisible by the 
square of a prime. We obtain by multiplicativity 


= L o(p’*?NR/d*) dp(d) 


d|p"e 


= ($(p"**) ~ POD?) LG (NR/ a") du(d). 
The induction hypothesis now gives 
X= (o(p""*) - po(p"**?))a( TZ) 6( | Jo2(U?). 
If y = p = 1, then u = pU and 
¥ = (x? - 2p) 9(=)o(= )o(07) 
=o(2}o(E) ete. 


since , is multiplicative and ¢,( p*) = p* — 2p. 
If one of v, p exceeds 1, then u = U and 


d= (pth — 2prrer' + pe2)o( TZ) 65 Jo0(U2) 
= (0) 6(p")#( > |o( 2) o2(u?) 
- «(2)o( 2). 


This completes the induction. 


ANSWERS TO PHOTOS ON PAGE 624 


Top: A. G. Kurosh (algebra, and in particular, group theory), taken in 1958. Bottom: 
L. Schwartz (analysis, and in particular, distributions) taken in 1969. 
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ADVANCED PROBLEMS 


Solutions of these Advanced Problems should be mailed in duplicate to Professor G. L. Alexanderson, Department 
of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by March 31, 1984. The solver’s full 
post-office address should be on each sheet. 
6442. Proposed by Vladimir Naroditsky, San Jose State University. 
Show that if 
N 
lim )) a,exp(id,t) 
PO k=] 


exists, then either all X’s are 0 or all a’s are 0. 


6443. Proposed by C. Thron and B. Tomaszewski, University of Wisconsin. 


Let a, >a, > °:: 34,4, 20 be a decreasing sequence of positive real numbers and let 
E1, €p5+++5 9,41 be a Bernoulli sequence of independent random variables, i.e., P(e; = 1) = 
P(e; = —1) = 4. Prove that 

2n+ 1 2n+1 


P 


» E;a; 


i=1 


< a, 


» E;a; 


i=] 


tae 


_ | 2n+ | 
0) >t ( n ) 


6444. Proposed by Fred Galvin, University of Kansas. 


A topological space X is functionally countable if every continuous real-valued function defined 
on X has a countable range. Prove or disprove: the product of two functionally countable spaces is 
functionally countable. 


SOLUTIONS OF ADVANCED PROBLEMS 
lim, > [sin n|'7" = 1 


6379 [1982, 134]. Proposed by Robert Curry and James O. Friel, California State University, 
Fullerton. 


Find liminf, , ,,|sin n|'””. 


Solution by Gunnar Berg, Uppsala Universitet, Matematiska Institutionen, Sweden. Let m, n be 
positive integers such that m > 1 and |n — ma| < 7/2. Then m < n and, by a theorem of K. 
Mahler [“On the approximation of 7”, Indag. Math., 15 (1953) 30-42], |jn — ma| > m~*'. Hence 


. . 2 2 _ 
1 > |sin n| = |sin(n — ma)| > clit — mal > —n ay 


It follows that lim, , , |sin n|!”” = 1. 


n— oo 


Also solved by Michael Golomb, Robert B. Israel (Canada), Kee-Wai Lau (Hong Kong), O. P. Lossers (The 
Netherlands), T. K. Louton & C. C. Rousseau, John M. Masley & John G. Milcetich, Daniel A. Rawsthorne, Joseph 
J. Roseman, and Jose Felipe Voloch (Brazil). 

John M. Masley & John G. Milcetich based their solution on a result of Cijsouw (see p. 95 in Transcendance 
Theory: Advances and Applications, edited by A. Baker and D. W. Masser) which states that there is an absolute 
positive constant c such that if { is any algebraic number of degree N and height H, then 


|7 — §| > exp(—cN(1 + Nlog N + log H)({1 + log N)). 


They observed that it follows easily from this result that lim, _,,,|sin n¢|'/” = 1 when ¢ is any nonzero algebraic 
number. 
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Functions Agreeing Infinitely Often 


6380 [1982, 214]. Proposed by Juris Steprans, University of Toronto. 


Let / be a finite ordinal, and let °/ denote the set of functions from w to / where w is the first 
infinite ordinal. Is the proposition P: If F C*/ and card(F) < 2*°, then there exists g €°/ such 
that for all f © F there are infinitely many m for which f(m) = g(m) true, false, or independent 
of ZFC? 


Solution by the proposer. The proposition is true. First note that it suffices to show that if F C°/ 
and card(F) < 2%°, then there is g ©*/ such that for each f € F there is at least one integer m 
such that f(m) = g(m). To see this let {A,: n © w) partition w into disjoint infinite subsets. Let 
F = {ft A,: f € F} and apply the proposition to find g,: A, — J such that for each f € F, there 
is at least one m € A, such that f(m) = g,(m). Then let g = U{g,: n € w}. The proposition is 
proved as follows. Let D(/) be the discrete topology on / and let D(/)™ have the usual product 
topology. It is known (see page 111, General Topology, by R. Engelking, Warsaw, 1977) that 
D(1)® is separable. Hence, let (h,: k & w} enumerate a dense subset of D(/)*. 

For r € R define g,: w > / by g,(k) = h,(r). Now suppose that F is a counterexample to the 
proposition. Then for each r € R there is f € F such that for each m € w, f(m) # g,(m). Since 
card( F) < 2™° it follows from the pigeonhole principle that for some f € F and some X C R such 
that card(X) = /, for each r © X and each m Ew, f(m) # g,(m). Now let b: X > 1 be a 
bijection, then [b] = {h € D(1)®: b C h) is an open set in D(/)® and hence there is k € w such 
that h, €[b] Ge, b Ch,). Choose t € X such that b(t) = f(k). Then g,(k) = h,(t) = b(t) = 
f(k). This is a contradiction. 

F, Galvin pointed out that the result has been obtained earlier by J. Baumgartner (unpub- 
lished). 


Evaluation of an Integral 
6382 [1982, 214]. Proposed by Hing-kam Lam, The Chinese University of Hong Kong. 
Evaluate fjIn x1n?(1 — x) dx/x. 
Solution by Sidney Heller, Brookhaven National Laboratory, Upton, New York. Let 


f(x) = pane a. 


Then 
f(x)=- DS for0<x<l, 
n=1 1 
and hence 
1 in xIn?(1 — x) In(l-—x) Inx 
f . dx = — f'f(x)(— - 7k) ae 
1 1 In a“ x" 
=~ J f(x)f'(x) de - f i-x 2 ox 
_. l 2 = l l ln 
= — af Oe heed In x dx 


1.5 o2 1 1 1 
—50(2) + ¥ (1+ Get 5) 


n=l 


] = l 
~ 587 (2) + » -) 


l<m<n™mn 


l > | p42 _ 1 
358 (2) +5 (8 (2) — §(4)) = ~ 554) 180 ° 
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Also solved by Paul S. Bruckman, Donald Caccia, The Chico Problem Group, C. Georghiou (Greece), M. L. 
Glasser, Nathaniel Grossman, A. P. Guinand (Canada), John R. Hatcher, L. Kuipers (Switzerland), O. P. Lossers 
(The Netherlands), T. K. Louton & C. C. Rousseau, L. E. Mattics, William A. Newcomb, Otto G. Ruehr, L. Van 
Hamme (Belgium), John S. White, Doug Wiedemann, Joseph Wiener, and the proposer. A partial solution was 
received from Murray R. Spiegel. 

A number of solvers pointed out that the integral and many like it are evaluated in Polylogarithms and Associated 
Functions (North-Holland, New York, 1981), by Leonard Lewin. 


A Limit Involving Euler’s Totient 


6383 [1982, 278]. Proposed by Eliot Jacobson, University of Arizona, Tucson. 
Let @(x) denote the totient function, and define recursively 6’(x) = 6(” '(x)). Let 
A,,(n) =Kx <n: o*(x) =m, some k}}. 


Show that lim A,,(n)/n exists for all m > 0, and calculate its value. 


Solution by the University of South Alabama Problem Group, Mobile, Alabama. If m = 2°, the 
limit is 1; otherwise it is zero. Indeed, if m is divisible by an odd prime and $(k) = m, then k is 
divisible by an odd prime and k has at most as many prime divisors (counting multiplicity) as m. 
So, in this case, numbers x such that $’(x) = m must have a bounded number of prime divisors 
and the set of such numbers has natural density zero. (See Niven and Zuckerman, An Introduction 
to the Theory of Numbers, 3rd ed., Theorem 11.8.) On the other hand, if m = 2° and if x has more 
than s distinct prime divisors, then ¢’(x) = m for some r and the set of such numbers x has 
natural density 1. 


n— 0 


Also solved by the proposer. A partial solution was received from Carl Pomerance. 


Numbers of the Form m? — n 


6384 [1982, 278]. Proposed by Barry Powell, Kirkland, Washington. 


(a) For any positive integers m and n for which m — n # 1, prove that the set {m? — ny), (where 
p = 2,3,5,7,... is the set of primes) contains infinitely many composites. 

(b)* Give an infinite set of n such that (n + 1)? — n is composite infinitely often (p = 
2,3,5,...). 


Solution by Robert B. Israel, University of British Columbia, Vancouver, B.C., Canada. We must 
of course assume that m # 1. Instead of excluding the case m — n = 1, we exclude only the case 
m = 2,n = | (which yields the Mersenne numbers). Suppose therefore that m > 1, mn > 2 and 
that g is a prime factor of mn — 1. If p= k(q— 1) — 1, where k is a positive integer, then 
m(m? — n) = 1 — mn= 0 (mod gq) and so m? — nis divisible by g. By Dirichlet’s theorem, there 
are infinitely many primes p of this form. Hence m? — n is composite for infinitely many primes p 
when m> 1,mn> 2. In particular, (n + 1)? — n is composite for infinitely many primes p 
whenever n > 1. This deals with both parts of the problem. 


Also solved by Miroslav D. Asic (Yugoslavia), Mihaly Bencze (Romania), Robert Breusch, Hugh M. Edgar, F. 
Gobel (The Netherlands), E. Grosswald, Mark Kantrowitz, Keith A. Kearnes, O. P. Lossers (The Netherlands), 
Aaron Meyerowitz, Peter Schumer, Robert E. Shafer, H. Shank (Canada), Claudia Spiro, Richard Stong, University 
of South Alabama Problem Group, Keith Wayland (Puerto Rico), and the proposer. 


A number of the solvers failed to consider the case m —n = —1. 
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A Simple Closed Curve Circumscribing No Rectangle 


6385 [1982, 279]. Proposed by Mark D. Meyerson, U.S. Naval Academy, Annapolis, MD. 


Prove or disprove: Every simple closed curve in Euclidean space contains the vertices of a 
rectangle. (It is known to be true in the Euclidean plane.) 


Solution by Miroslav D. ASi¢é, University of Belgrade, Yugoslavia. The assertion is false. Let L be 
the curve in R° consisting of four line segments AB, BC, CD, DA where A = (0, 0,0), B = (0,0, 1), 
C = (1,0,0), D = (1, 1,0). Assume there is a rectangle PORS with vertices on L. Clearly, no two 
vertices of the rectangle can lie on the same (closed) line segment. This means that each (open) 
line segment contains exactly one of the points P, QO, R, S. It is easily seen that the consecutive 
vertices of the rectangle must lie on consecutive line segments of L; without loss of generality we 
can assume that P,Q, R,S are on AB, BC, CD, DA respectively. But then PQ and QR are 
parallel to AC and BD respectively, which is impossible since AC and BD are not orthogonal. 


Also solved by Benny N. Cheng, Robert B. Israel (Canada), P. B. Kronheimer (England), O. P. Lossers (The 
Netherlands), J. G. Mauldon, Marion W. Orlowski (South Africa), M. Pachter (South Africa), J. Schaer (Canada), 
Allen J. Schwenk (Canada), Paulo Ney Rocha de Souza (Brazil), and the proposer. 


REVIEWS 


EDITED BY ALLAN L. EDMONDS AND JOHN H. EWING 
COLLABORATING EDITOR FOR FILMS: SEYMOUR SCHUSTER, CARLETON COLLEGE 


Infinite Processes: Background to Analysis. By A. Gardiner. Springer-Verlag, New York, 1982. 
x + 306 pp. 


R. P. BOAS 
Department of Mathematics, Northwestern University, Evanston, IL 60201 


Once upon a time there were an Algebraist and an Analyst who were participating in Hermann 
Weyl’s current-literature seminar. The Algebraist undertook to present a celebrated paper on the 
distribution of the values of meromorphic functions. After a while, being unable to verify the 
transition from a certain sentence to the next one, the Algebraist appealed to the Analyst for help, 
and was provided with a six-page argument for that particular point. The Algebraist, aghast, 
complained, “But in Algebra we prove the theorems.” 

This story brings out the wide individual variation in what is accepted as “rigorous mathe- 
matics.” In this review, I am going to substitute the term “careful mathematics” (since standards 
of rigor are so variable). We should distinguish careful mathematics from heuristic mathematics, 
which is (mistakenly) sometimes considered not to be mathematics at all. All mathematicians use 
heuristic mathematics some of the time, but all serious mathematicians use careful mathematics a 
substantial part of the time. 

For students who hope to become serious mathematicians, there are courses in which they are 
introduced to careful proofs, often for the first time in their lives (now that careful Euclidean 
geometry is no longer a staple of the high school curriculum). As Gardiner remarks (p. 270), the 
actual writing out in detail of the formal proofs is tedious; “what is fun is discovering the 
answer.” This is true, but it does not alter the fact that today’s college students (in the USA, 
anyway) are not much accustomed to careful proofs. 

If there are courses, there will be textbooks. What should these textbooks be like? Gardiner’s 
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answer is embodied in this book, which he suggests is useable either before, during, or after the 
first course in careful mathematics. If you are going to have to persuade students that they need a 
more precise understanding of the mathematics they thought they knew, and if they have little 
experience with careful mathematics, then you must first persuade them that there is something 
wrong with what they thought they knew; for only then will the new ideas have a chance of 
becoming credible. Gardiner seems to believe that this persuasion is best performed by means of 
history, an approach that should appeal to those students who share the current preoccupation 
with “roots.” Advocates of the historical approach (but not Gardiner) often quote Santayana, 
“Those who cannot remember the past are condemned to repeat it.” [1] This rings true enough for 
statesmen, but the history of mathematics is largely the study of the mistakes of our predecessors, 
which we no more need to repeat than we need to study the abacus after having learned to use a 
computer. Most mathematicians find it more profitable to concentrate on what is correct (or 
presumed to be correct) right now. It is entertaining to learn what Leibniz and John Bernoulli 
thought about log(—1) (p. 262—without references!), but this does not particularly illuminate 
modern complex analysis. Biologists say that ontogeny recapitulates phylogeny, that is, the 
development of the individual parallels the development of the species; but no such principle 
seems to be valid for mathematics. 

In Gardiner’s words, “the whole book is essentially a study of why one must and how one can 
introduce precision into certain infinite processes...” (p. 4). Chapter I is introductory; Chapter II 
deals with numbers, in great detail, from the positive integers to the irrationals. Chapter HI 
analyzes the ideas of area and volume and also of the length of curves. Chapter III is considerably 
shorter than Chapter II, but the subject has a much shorter history. Both chapters are lucid and 
extremely detailed; so much so that the book gives the impression of “a missionary talking to 
cannibals” (Littlewood’s phrase [2]). American readers need to be alert for occasional Anglicisms, 
however; for example, the phonetic‘spelling of the name of the letter r as “tor,” final r being silent 
in standard British pronunciation (p. 148). 

Chapter 4, on functions, traces the history of “function” from its beginnings down to the now 
conventional informal formal definition. The remoter past is fully dealt with, but there is no 
mention of the many alternative ways of thinking about functions, for instance as operators, 
mappings, random variables, fields (in the physicists’ sense), or even as sets of ordered pairs. I 
would have thought that experience with other interpretations of the word would be more helpful 
than examples of the tentative gropings of the pas. It does seem that many of the difficulties 
associated with “function” could have been avoided if earlier generations had had “the vision to 
give it existence through definition.” [3] 

Now, I am well aware that some people never succeed in grasping the ideas involved in that 
first contact with careful mathematics; and that some others approach the subject with a wariness 
more appropriate to a subject like spherical trigonometry: they can do the problems but they can’t 
see the point. “What can rigorous proofs be used for?” they complain. Gardiner seems to think 
that all students are like these people. He writes (p. v), “I have consistently resisted the temptation 
to believe that the process of understanding these ideas can be magically accelerated by 
confronting the reader with ready-made abstractions.” 

There is, however, a class of students for whom abstractions can be liberating; and my own 
experience persuades me that this class is not empty. Through my sophomore year in college, the 
closest I came to knowing anything about careful mathematics was hearing J. L. Doob (then a 
graduate student) holding forth on the lack of rigor in Osgood’s Calculus. The next year I took 
Walsh’s course in complex analysis; this was quite a careful course (in the sense in which I am 
using “careful”), and sufficiently abstract so that my year’s lecture notes contain no pictures at all 
(my idea, not Walsh’s). This course was a revelation: to say that I fell in love with the subject 
would be quite an accurate description of my reaction. I can speak with assurance only of myself, 
but I have known a fair number of professional mathematicians who started out in much the same 
way. On the whole, we /ike abstractions, whether ready-made or not. Most of us, however, try to 
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present them (to students) with introductory examples and illustrations of their applicability. In 
short, we try to make new ideas credible by embedding them in more familiar mathematical 
settings, rather than as the culmination of the ideas of the past. 
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LETTERS TO THE EDITOR 


Material for this department should be prepared exactly the same way as submitted manuscripts (see the inside 
front cover) and sent to Professor P. R. Halmos, Department of Mathematics, Indiana University, Bloomington, 
IN 47405, 


Editor: 


Nowadays, the usual derivation of canonical forms for n * n matrices A over a field F involves 
such matters as invariant subspaces and cyclic vectors. Briefly, one constructs from a given matrix 
A acertain F[x]-module V. Since F[x] is a principal ideal domain, there is a very good analogy 
between F[ x ]-modules and abelian groups (Z-modules). In particular, the analog of V is a finite 
abelian group. Just as a finite abelian group G is characterized by some integers (G is a direct sum 
of cyclic groups and we may take the orders of the summands), so the module V is characterized 
by some polynomials (the invariant factors of A). I like this proof; the analogy is quite beautiful, 
and one can understand what is going on. 

But current proofs have a defect; given a matrix A, they do not indicate how to compute its 
invariant factors. (Note that the minimal polynomial of A is one of them.) Thus, there are two 
articles in the January 1983 issue of the MONTHLY that seem to have overlooked an old theorem. 
(“An algorithmic derivation of the Jordan canonical form,’ by Fletcher and Sorensen, and 
“An algorithm for the minimal polynomial of a matrix,” by Gelbaum.) The theorem says that if B 
is a matrix with (polynomial) entries in F[x], then one can put B in diagonal form 
diag(g,(x), 2(X),..., Z,(X)), Where g;(x)|g;, (x), using elementary row and column operations. 
(In so doing, one needs the Euclidean algorithm for the gcd of two polynomials.) In particular, 
this can be done for B = xI — A. The nonconstant g,(x) are the invariant factors of A, and g,(x) 
is the minimal polynomial of A. A purely matrix-theoretic proof of this result can be found, for 
example, in “Introduction to Algebraic Theories,” by A. A. Albert, University of Chicago Press, 
1941. 


Joseph Rotman 
Department of Mathematics 
University of Illinois 
Urbana, Illinois 61801 


MISCELLANEA 


114. There exists a passion for understanding just as there exists a passion for music. This 
passion is more common in children, and is usually lost with age. Without this passion there 
would be neither mathematics nor natural science. This passion which has always existed in me 
has never lost its sparkle. 


—Albert Einstein: The Man and his Theories, Souvenir Press, 1961, p. 114. 
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from the MAA Studies 
in Mathematics... 


Studies in 
Computer Science 


Edited by Seymour Pollack 
Volume 22, MAA Studies in Mathematics 
408 pp. Cloth. List: $29.00 MAA Member: $22.00 


Studies in Computer Science continues the tradition of excellence of 
the respected series MAA Studies in Mathematics. Written by com- 
puter scientists for mathematicians, the book presents a readable and 
balanced discussion of the role of mathematics in computer science, 
and the contributions of computing to mathematics. 

Opening with an historical overview of the development of com- 
puter science, the narrative continues with three articles on the 
nature of computer programs and the programming process, a dis- 
cussion of computational complexity, and a concluding group of 
articles on the impact of computer science on artificial intelligence, 
numerical analysis, statistics, and simulation. 

In the introduction the editor expresses his hope that the book 
will provide ‘’an interesting look at an explosive field in the process of 
becoming.’’ What an understatement! The process is well advanced; 
the impact of computer science on mathematics and its teaching is 
already immense. And this book is not just interesting, it is exciting 
and important as background reading for anyone interested in the 
interactions between mathematics and computer science. Teachers 
will find that it enriches their teaching and students will find it 
rewarding supplementary reading. Order your copy today! 


ORDER FROM: The Mathematical Association of America 
1529 Eighteenth Street, N.W. 
Washington, D.C. 20036 


VITALTOOLS IN APPLIED MATHEMATICS... 


APPLIED ABSTRACT ALGEBRA 

Ki Hang Kim and Fred W. Roush, both of 
Alabama State University 

This easy-to-follow text combines more tradi- 
tional subject matter with topics of current 
interest and illustrative applications. Empha- 
sizes many important topics that other texts 
skim over—binary relations, lattices, Boolean 
matrices, network theory, group representa- 
tion (both finite and continuous. 


(0-27441-7) 1983 300pp. $24.95 Paper 


INTEGRAL EQUATION METHODS 

IN SCATTERING THEORY 

David Colton, University of Delaware, and 
Rainer Kress, University of Géttingen 

A comprehensive, state-of-the-art survey of 
these methods, including both direct and 
inverse problems. 


(1-86420-X) 1983 271pp. $37.50 


FUNCTION THEORY ON PLANAR 
DOMAINS 

A Second Course in Complex Analysis 

Stephen D. Fisher, Northwestern University 
Clearly presents the basic facts in the theory of 
functions. Emphasizes domains bounded by a 
finite number of disjoint analytic simple closed 
curves. Many illustrative exercises. 


(1-87314-4) 1983 269pp. $34.95 


PARTIAL DIFFERENTIAL EQUATIONS IN 
APPLIED MATHEMATICS 

Erich Zauderer, Polytechnic Institute 

of New York 

Introduces techniques for solving linear and 
nonlinear PDEs. Treats perturbation and 
asymptotic methods for the solution of PDEs 


(1-87517-1) 1983 779pp. $49.95 


FIRST COURSE IN LINEAR ALGEBRA 
P.B. Bhattacharya, formerly of the University 
of Delhi, $.K. Jain, Ohio State University, and 
S.R. Nagpaul, University of Delhi 

Starting with matrices and determinants, the 
book develops the theory of vector spaces 
and goes on to canonical forms of matrices, 
Euclidean spaces, and tensor algebras. 


(0-27442-5) Sept.1983 190pp. $12.95 


OPTIMIZATION AND NONSMOOTH 
ANALYSIS 

Frank H. Clarke, University of British 
Columbia 

The first unified and in-depth account of this 
important area of mathematics, together with 
a broad range of applications to optimization. 


(1-87504-X) 1983 308pp. $34.95 


COMBINATORIAL ENUMERATION 

lan P. Goulden and David M. Jackson, both of 
the University of Waterloo, Ontario, Canada 
An encyclopedic account of the mathemat- 
ical theory and problem-solving techniques 
associated with enumeration problems for a 
wide variety of discrete structures and their 
substructures. 


(1-86654-7) 1983 569 pp. $47.50 


PROBLEM COMPLEXITY AND METHOD 
EFFICIENCY IN OPTIMIZATION 

A.S. Nemirovsky and D.B. Yudin, State 
University of Moscow 

A systematic treatment of both theoretically 
and practically important questions concerned 
with the complexity of problems and efficiency 
of methods in mathematical programming. 


(1-10345-4) 1983 388pp. $49.95 


NUMERICAL SOLUTION OF 
DIFFERENTIAL EQUATIONS, 2nd Ed. 
M.K. Jain, Indian Institute of Technology, 
Delhi 

‘An excellent compendium of viable numeri- 
cal methods...accomplished with clarity and 
erudition’’—Mathematical Reviews (on the 
first edition) 


Now updated with expanded coverage and 
new applications for adaptive numerical 
methods, difference schemes, elliptic equa- 
tions, and the finite element method. 


(0-27389-5) 1983 600pp. $29.95 
COMBINATORIAL INTEGRAL 
GEOMETRY 


With Applications to Mathematical Stereology 
R.V. Ambartzumian, Armenian Academy of 
Sciences, USSR 

Presents a new basic principle by which the 
probabilities associated with random geomet- 
rical objects are related to measurements in 
the underlying space. Problems, examples, 
figures. 


(1-27977-3) 1982 220pp. $45.00 


Order through your bookstore, or write to 
Julie Harrington, Dept. 4-1264. 
For faster service, CALLTOLL FREE 1-800-526- 
5368. In New Jersey, call collect 201 342-6707. 
Order Code #4-1264. VISA, MasterCard, American 
Express accepted on phone orders. 

JOHN WILEY & SONS, Inc. 


605 Third Avenue, New York, N.Y. 10158 
oH Canada: 22 Worcester Road, Rexdale, Ontario M9W 


Prices subject to change and higher in Canada. 
092-4-1264 


Eminent Mathematicians and Mathematical Expositors speak to 


STUDENTS and TEACHERS in... 


An internationally acclaimed paperback 

series providing: 

e stimulating excursions for students 
beyond traditional school mathemat- 
ics. 

® supplementary reading for school and 
college classrooms. 

e valuable background reading for 
teachers. 

@ challenging problems for solvers of all 
ages from high school competitions in 
the US and abroad. 


The New Mathematical Library is published by 
the MATHEMATICAL ASSOCIATION OF 
AMERICA. The volumes are paperbound. 
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sequences and polynomials (e.g. Bernoulli) 
RICHARD MCINTOSH 98 

Elementary number theory: Partitions JOHN A 
EWELL 635 

Geometry of numbers: Mean value theorems 
MICHAEL E MAYs 677 

Multiplicative theory: Asymptotic results on 
arithmetic functions JOHN F HUGHES AND JO 
SHALLIT 468 

Additive theory: Sums of squares JOHN A 
EWELL 635 

Sequences of integers: Special sequences (den- 
sity, multiplicative, additive and other proper- 
ties) RICHARD K Guy 35 

Sequences of integers: A miscellany of Erdos 
problems RICHARD K Guy 118 


ALGEBRAIC NUMBER THEORY, FIELD 
THEORY AND POLYNOMIALS 


Algebraic number theory: global fields: Ana- 
logues in number fields of elementary number 
theory JAMES T Cross 518 

Algebraic number theory: global fields: Poly- 
nomials (irreducibility, etc.) RICHARD K Guy 
120 


ALGEBRAIC GEOMETRY 


Advanced exposition (research surveys, mono- 
graphs, etc.): Algebra, geometry, and algebraic 
geometry: some interconnections KEITH 
KENDIG 161 
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LINEAR AND MULTILINEAR ALGEBRA: 
MATRIX THEORY (finite and infinite) 


Advanced exposition (research surveys, mono- 
graphs, etc.): Cross products of vectors in higher 
dimensional Euclidean spaces WS MASsEy 697 
Linear transformations, semilinear transforma- 
tions: An algorithm for the minimal poly- 
nomial of a matrix BERNARD R GELBAUM 43 
Matrix inversion, generalized inverses: A pen- 
tad of pointed problems RICHARD K Guy 120 
Eigenvalues, singular values, and eigenvectors: 
Matrices with integer entries and integer eigen- 
values J-C RENAUD 202 

Canonical forms, reductions, classification: An 
algorithmic derivation of the Jordan canonical 
form R FLETCHER AND DC SORENSEN 12 
Canonical forms, reductions, classifications: 
Singular value analysis of cryptograms CLEVE 
MOLER AND DONALD MORRISON 78 

Matrices of integers: The slow continued frac- 
tion algorithm via2 x 2 integer matrices HARRY 
APPELGATE AND HIRONORI ONISHI 443 
Matrices of integers: Author vs referee: a case 
history for middle level mathematicians 
ROBERT C THOMPSON 661 

Tricks or treats with the Hilbert matrix 
Man-DUuEN CHo! 301 


NONASSOCIATIVE RINGS AND ALGE- 


BRAS 


General nonassociative rings: Division algebras 
SC ALTHOEN AND LD KUGLER 625 


CATEGORY THEORY, HOMOLOGICAL 
ALGEBRA 


Elementary exposition, textbooks: The Riesz 
representation theorem revisited DONALD G 
HARTIG 277 


GROUP THEORY AND GENERALIZA- 
TIONS 


Foundations: Axiomatics and elementary prop- 
erties WELLS JOHNSON 132 

Permutation groups: Finite automorphism 
groups of algebraic, geometric, or combina- 
torial structures WILLIAM C WATERHOUSE 378 
Structure and classification of infinite or finite 
groups: Extensions, wreath products, and other 
compositions R HIRSHON 312 

Structure and classification of infinite or finite 
groups: An elementary proof of the isomor- 
phism C* = §! L RICHARD DuerFy 201 
Special aspects of infinite or finite groups: 
Generators, relations, and presentations R 
HIRSHON 312 
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TOPOLOGICAL GROUPS, LIE GROUPS 


Elementary exposition, textbooks: Very basic 
Lie theory ROGER Howe 600 


REAL FUNCTIONS 


Functions of one variable: One-variable calcu- 
lus HERBERT E KASUBE 210 BRUCE C BERNDT 
505 

Functions of one variable: Continuity and re- 
lated questions (modulus of continuity, semi- 
continuity, discontinuities, etc.) JA GUTHRIE 
126 

Functions of one variable: Iteration GENG-ZHE 
CHANG AND PHILIP J DAVIS 421 

Functions of one variable: Singular functions, 
Cantor functions, functions with other special 
properties EE Posey AND JE VAUGHAN 281 
Liu WEN 283 WA CopPPEL 456 

Functions of one variable: Fractional deriva- 
tives and integrals BERTRAM Ross 562 
Functions of one variable: Antidifferentiation 
HEBERT E KASUBE 210 H HARUKI AND S 
HARUKI 464 

Functions of one variable: Functions of bound- 
ed variation, generalizations PS BULLEN 560 
Polynomials, rational functions: Another look 
at the Chebyshev polynomials EL ORTIZ AND 
TJ RIVLIN 3 

Miscellaneous topics: Real-analytic functions 
MICHAEL J HOFFMAN AND RICHARD KaTZ 557 
Miscellaneous topics: C® functions, quasi-ana- 
lytic functions RAY REDHEFFER 130 MICHAEL J 
HOFFMAN AND RICHARD KaTz 557 


MEASURE AND INTEGRATION 


Classical measure theory: Measurable and non- 
measurable functions, sequences of measurable 
functions, modes of convergence LEE A RUBEL 
AND ARISTOMENIS SISKAKIS 394 

Classical measure theory: Length, area, volume, 
other geometric measure theory WALTER RUDIN 
41 

Classical measure theory: Measure, category, 
and the sums of sets Z KOMINEK 561 
Measures on spaces with additional structure: 
Integration theory via linear functionals (Radon 
measures, Daniell integrals, etc.), represen- 
ting measures DONALD HARTIG 277 JURGEN 
KINDLER 396 

Measures on spaces with additional structure: 
Determining a fair border THEODORE P HILL 
438 


FUNCTIONS OF A COMPLEX VARIABLE 


Geometric function theory: Polynomials Mor- 
RIS MARDEN 267 
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Geometric function theory: Conformal map- 
pings of special domains ELGIN JOHNSTON 701 
Geometric function theory: General theory of 
conformal mappings STEVEN G KRANTZ 555 
Miscellaneous topics of analysis in the complex 
domain: Approximation in the complex do- 
main CHARLES BLAIR AND LEE A RUBEL 331 
KENNETH R DAVIDSON 391 


POTENTIAL THEORY 


Two-dimensional theory: Connections with dif- 
ferential equations RICHARD K Guy 196 


SPECIAL FUNCTIONS 


Gamma and beta functions: Serendipity in 
mathematics BERTRAM Ross 562 

Elliptic functions and integrals: Of calculations 
past and present: the Archimedean algorithm 
GEORGE MIEL 17 

Orthogonal special functions and polynomials 
(Cebysev, Hermite, Jacobi, Laguerre, etc.) EL 
ORTIZ AND TJ RIVLIN 3 

Other special functions: Serendipity in 
mathematics BERTRAM Ross 562 


ORDINARY DIFFERENTIAL EQUATIONS 


General theory: Solutions in closed form, in- 
tegration by quadratures, reduction of differen- 
tial equations RICHARD K Guy 196 

General theory: Linear equations and systems 
LANCE L LITTELJOHN 462 
Functional-differential and differential-dif- 
ference equations, with or without deviating (or 
retarded) arguments: Qualitative theory G 
LADAS, YG SFIcas, IP STAVROULAKIS 637 


SEQUENCES, SERIES, SUMMABILITY 


Advanced exposition (research surveys, mono- 
graphs, etc.): Serendipity in mathematics 
BERTRAM ROss 562 

Convergence and divergence of infinite limiting 
processes: Convergence and divergence of series 
and sequences JOHN KLIPPERT 284 


APPROXIMATIONS AND EXPANSIONS 


Series expansions (e.g., Taylor, Lidstone series, 
but not Fourier series): From center of gravity 
to Bernstein’s theorem RAY REDHEFFER 130 


INTEGRAL TRANSFORMS, OPERATION- 
AL CALCULUS 


Advanced exposition (research surveys, mono- 
graphs, etc.): The quarterly reports of S. 
Ramanujan BRUCE C BERNDT 505 
Historical: The quarterly reports of S. 
Ramanujan BRUCE C BERNDT 505 
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FUNCTIONAL ANALYSIS 


Linear function spaces and their duals: Spaces 
of measurable functions L?-spaces, Orlicz 
spaces, Kothe function spaces, Lorentz spaces, 
rearrangement invariant spaces, etc.) JUAN L 
ROMERO 203 

Categorical methods: 
HARTIG 277 


Functors DONALD 


OPERATOR THEORY 


Single linear operators: special classes of opera- 
tors: Tricks or treats with the Hilbert matrix 
MAN-DUEN CHOI 301 


CALCULUS OF VARIATIONS AND OPTI- 
MAL CONTROL: OPTIMIZATION 


Necessary conditions and sufficient conditions 
for optimality: Free problems in one indepen- 
dent variable J-B H1gIART-URRUTY 206 


GEOMETRY 


Iterative processes in elementary geometry 
GENG-ZHE CHANG AND PHILIP J DAVIS 421 
Linear incidence geometry: Incidence struc- 
tures imbeddable into projective geometries 
PETER BORWEIN AND MICHAEL EDELSTEIN 389 
Real and complex geometry: hyperbolic and 
elliptic geometries ROGER FENN 87 

Real and complex geometry: geometric con- 
structions ANDRZEJ SZULKIN 640 


CONVEX SETS AND RELATED GEOMET- 
RIC TOPICS 


Convex sets in n dimensions: Applications of a 
result of spherical integration to the theory of 
convex sets KENNETH J FALCONER 690 


DIFFERENTIAL GEOMETRY 


Surfaces in Euclidean space: A pentad of 
pointed problems RICHARD K Guy 120 
Global differential geometry: Submanifolds PJ 
GIBLIN AND JW BRUCE 529 


GENERAL TOPOLOGY 


Generalities: Topological spaces and generali- 
zations (closure spaces, etc.) DENIS HIGGs 693 
Spaces with richer structures: Isometries, con- 
tractions, expansions THEMISTOCLES M RASSIAS 
200 


ALGEBRAIC TOPOLOGY 


Classical topics: Fixed points and coincidences 
JAMES W WALKER 466 

Classical topics: A homology version of the 
Borsuk-Ulam theorem JAMES W WALKER 466 
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MANIFOLDS AND CELL COMPLEXES 


Low-dimensional topology: Knots and links in 
S? GEoRGE K FRANCIS 589 


GLOBAL ANALYSIS, ANALYSIS ON 
MANIFOLDS 


General theory of differentiable manifolds: 
Differentiable manifolds, foundations RICHARD 
K Guy 196 

Calculus on manifolds; nonlinear operators: 
Real-valued functions WILLIAM C WATERHOUSE 
378 

Calculus on manifolds: nonlinear operators: 
Singularities of differentiable maps PJ GIBLIN 
AND JW BRUCE 529 

Partial differential equations on manifolds; dif- 
ferential operators: Spectral problems: spec- 
tral geometry: scattering theory RICHARD 
MILLMAN 553 


PROBABILITY THEORY AND STOCHAS- 
TIC PROCESSES 


Elementary exposition, textbooks: From loss of 
memory to Poisson BRUCE R JOHNSON 312 
Distribution theory: Distributions: general the- 
ory BRUCE R JOHNSON 332 

Probability theory on algebraic and topological 
structures: Determining a fair border 
THEODORE P HILL 438 

Geometric probability; stochastic geometry; 
random sets: A pentad of pointed problems 
RICHARD K Guy 120 


STATISTICS 


Elementary exposition, textbooks: A classroom 
note on the sample variance and the second 
moment JJ BIRCH AND TIM ROBERTSON 703 
Parametric inference: Point estimation JJ 
BIRCH AND TIM ROBERTSON 703 

Applications: A statistical analysis of casino 
blackjack MARTIN H MILLMAN 431 


NUMERICAL ANALYSIS 


Numerical approximation: Algorithms for 
functional approximation CHARLES W SCHELIN 
317 

Numerical linear algebra: Direct methods for 
linear systems and matrix inversion MAN-DUEN 
CuHo! 301 

Numerical linear algebra: Eigenvalues, eigen- 
vectors CLEVE MOLER AND DONALD MorRISON 
78 
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68-XX COMPUTER SCIENCE (including AU- A05 Mathematical economics: Decision theory PC 
TOMATA) FISHBURN AND HARRY O POLLAK 366 
B15 Software: Theory of data (filing, etc.) QUENTIN A99 Mathematical economics: mathematical meth- 
ods of economics JOEL FRANKLIN 229 
F STOUT AND PATRICIA A WOODWORTH 101 . ; 
. BOS Operations research and management science: 
C05 Metatheory: Algorithms GEORGE MIEL 17 ae . 
C25 Metatheory (excluding automata): Computa- Logistics, inventory, storage THOMAS JECH 
. _ ; 246 PC FISHBURN AND HARRY O POLLAK 366 
tional complexity and efficiency of algorithms COS Mathematical ‘no: Lj ; 
A KEITH AUSTIN 398 JAN MYCIELSKI 129 programming: “near program 
ming THOMAS JECH 246 
70-XX MECHANICS OF PARTICLES AND sys- 2% vate theory: 2-person games THOMAS JECH 
TEMS D99 Game theory: A Statistical analysis of casino 
FOS Dynamics of a system of particles, including blackjack MARTIN H MILLMAN 431 
celestial mechanics: Two-body problem JOHN 
MILNOR 353 
98-XX MATHEMATICAL EDUCATION, COL- 
90-XX ECONOMICS, OPERATIONS RESEARCH, LEGIATE 
PROGRAMMING, GAMES A99 A mock symposium for your calculus class 
—0O1 Elementary exposition, textbooks: The true DENNIS WILDFOGEL 52 
growth rate and the inflation balancing princi- A99 Mathematics appreciation courses: The report 
ple ROBERT C THOMPSON 207 of a CUPM PANEL 44 
PROBLEMS AND SOLUTIONS 
PROBLEMS PROPOSED 
Adler Irving 335 Foster LL 566 Montgomery Bruce 213 
Asic MD 334 Galvin Fred 134 338 648 Nadel AM 567 
Askey Richard 709 Glasser ML 55 Naroditsky Vladimir 648 
Aziz Abdul 133 Goldberg M 212 Novinger Phil 334 
Barr Michael 402 Golomb SW 706 Oberlin Daniel 334 
Beck J 134 Graham RL 54 Odom George 482 
Bencze Mihaly 134 Graham SW 706 Pach J 134 
Benke George 60 Halmos PR 289 Popescu CP 483 
Boas RP 60 Hammer FD 483 Powell Barry 60 286 338 
Borwein J 402 Jantzen Chris 482 Rabinowitz Stanley 566 
Brenner JL 289 400 486 Kestelman H 485 706 Robinson Raphael M 60 
Brown Morton 569 Kimberling Clark 212 335 644 Rubel L 289 
Bruckman PS 709 Klamkin MS 54 569 Sadowsky John 645 
Butler Edmund 402 Knuth DE 54 Schelin CW 567 
Chang Geng-zhe 645 Krafft O 400 Schmidt FW 400 569 
Cran Minh Crung 706 Kusner Robert 286 Schwenk AJ 403 
Cuculiere Roger 482 Kuttler JR 401 Selfridge JL 483 
Diamond Harvey 213 Larsen Michael 287 Shafer RE 707 
Dixon Michael J 289 Levine Eugene 567 Shallit JD 335 
Dou Jordi 54 286 Locke SC 212 Sibley TQ 403 
Ehrhart E 54 Luks EM 645 Simion Rodica 400 569 
Erdos P 335 710 Maddux Roger 54 Slater M 400 
Eves H 212 Mauldon JG 645 Starc ZF 212 
Fickett James W 286 Miles J 289 Steele M 338 
Forrester PJ 55 Miller Sanford S 60 Szekely GJ 402 
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Thron C 648 
Tomaszewski B 648 706 
Tomescu Ioan 566 
Tsintsifas George 133 


Alex LJ 569 

Asic MD 134 651 
Bager Anders 340 
Berg Gunnar 648 
Bloom DM 337 
Brandler JA 483 
Breusch Robert 213 483 
Browkin J 707 
Butler Edmund 401 
Cameron DE 291 
Cantor DG 61 
Castro BA 712 
Cater FS 339 571 
Cheng Benny 483 
Chernoff PR 488 
Cobb John 64 
Cremona John 489 
Debrunner H 403 
Deutsch Emeric 339 
Dodge CW 338 
Essick J 488 

Evans RJ 391 
Fecchini Alberto 489 
Foster LL 215 483 
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Vervaat Wim 60 
Wall CR 400 
Wang ETH 645 
Wetzel John E 287 


PROBLEMS SOLVED 


Gaines FJ 292 
Gentile ER 409 
Gessel Ira 335 485 
Gilmer Robert 483 
Grossman JW 288 711 
Growney JAS 484 
Hanes Kit 570 
Heller Sidney 649 
Henle James 62 
Hickman MG 571 
Horne JG 289 

Hung DT 483 

Israel RB 650 
Jackson DM 290 
Jones Lenny 64 
Kestelman H 215 
Kurtz DC 707 
Lossers OP 490 708 714 
Machover M 135 138 
Maharam D 487 
Maly Jan 488 
Mattics LE 56 568 
Meir A 573 

Miller AW 408 


SOLUTIONS 
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Yen Chen-Te 644 
Zempleni Andras 402 


Errata E2974 482 


Newcomb WA 567 
Pelling MJ 55 136 
Pinch RGE 341 
Pinsky MA 711 
Reingold EM 288 
Riddell James 64 
Ruehr OG 411 
Schwenk AJ 58 
Shan Chin-Chi 63 
Singh Sahib 483 
Stanley Richard 61 
Stenger Allen 483 
Steprans Juris 649 
Stone AH 487 
Straus EG 713 
Takacs Lajos 410 710 
Taylor KB 487 
University of South Alabama 
Problem Group 483 650 
Vlasek Zdenek 488 
Wagon Stanley 62 
Wells DM 287 
Williamson AG 213 
Wu PY 412 


Numbers in boldface type refer to problems; those in lightface, to pages. 


S-14 335 
E-2727 55 
E-2763 56 
E-2852 57 
E-2883 483 
E-2890 645 
E-2892 646 
E-2895 287 
E-2897 287 
E-2899 57 
E-2900 134 
E-2901 212 
E-2902 336 
E-2903 483 
E-2904 337 


E-2906 
E-2907 
E-2908 
E-2910 
E-2912 
E-2915 
E-2916 
E-2919 
E-2920 
E-2925 
E-2926 
3887 

5540 


338 6218 
485 6256 
485 6279 
567 6313 
568 6340 
569 6341 
646 6342 
707 6344 
707 6348 
401 6349 
708 6350 
486 6351 
135 6353 
403 6354 
135, 487 6356 


408 
487 
488 
289 
489 
61 
61 
138 
64 
64 
213 
64 
64 
214 
215 


6357 291 
6358 489 
6359 291 
6360 338 
6361 291 
6362 339 
6363 490 
6364 340 
6365 341 
6366 409 
6367 570 
6368 410 
6370 571 
6371 411 


6372 411 
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6378 573 6383 650 6387 711 6391 714 
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Names of authors are in ordinary type; those of reviewers, in capitals. 


Abelson Harold and diSessa Andrea Turtle Geometry. 
The Computer as a Medium for Exploring Mathe- 
matics GEORGE K FRANCIS 412-415 

Averbach Bonnie and Chein Orin Mathematics: Problem 
Solving through Recreational Mathematics MURRAY 
S KLAMKIN 216-218 

Brewer James W and Smith Martha K (Editors) Emmy 
Noether, A Tribute to Her Life and Work IRVING 
KAPLANSKY 717-718 

Childs Lindsay A Concrete Introduction to Higher Alge- 
bra MICHAEL ROSEN 575-576 

Cohn Harvey Conformal Mappings on Riemann Surfaces 
JAMES A JENKINS 142-144 

Dick Auguste Emmy Noether 1882-1935 IRVING 
KAPLANSKY 717-718 

Doneddu A Topologie. Fonctions reelles d’ une variable 
reelle R P BoAS 65-66 

Edwards CH Jr and Penney David E Calculus and 
Analytic Geometry PETER ROSENTHAL 576-579 

Felsager Bjorn Geometry, Particles and Fields CLIFFORD 
HENRY TAUBES 293-294 

Garcia CB and Zangwill WI Pathways to Solutions, 
Fixed Points, and Equilibria WERNER C 
RHEINBOLDT 66-67 

Gardiner A Infinite Processes: Background to Analysis 
RP Boas 651-653 


Goldstine Herman H A History of the Calculus of Varia- 
tions from the 17th through the 19th Century HELENA 
M Pycior 491-495 

Iooss Gerard and Joseph Daniel D Elementary Stability 
and Bifurcation Theory STEPHEN SCHECTER 498~501 

Manin Yu I Mathematics and Physics RO WELLS JR 
415-416 

Nachbin Leopoldo Introduction to Functional Analysis: 
Banach Spaces and Differential Calculus JOE 
DIESTEL 579-580 

Prugovecki Eduard Quantum Mechanics in Hilbert Space 
JOHN CHALLIFOUR 218-220 

Ribenboim Paulo 13 Lectures on Fermat’s Last Theorem 
DAVID R Hayes 341-343 

Richards Stephen P A Number for Your Thoughts: Facts 
and Speculations About Numbers from Euclid to the 
Latest Computers UNDERWOOD DUDLEY 715-717 

Roth Paul Computer Logic, Testing and Verification IR 
ARMSTRONG 494-497 

Seidenberg A (Editor) Studies in Algebraic Geometry 
(MAA Studies in Mathematics) KENNETH R MOUNT 
139-142 

Truesdell C The Tragicomical History of Thermody- 
namics STUART S ANTMAN 343-346 

Wieting Thomas W The Mathematical Theory of Chro- 
matic Plane Ornaments MARJORIE SENECHAL 


Gardiner Cyril F A First Course in Group Theory RONALD 574-575 
SOLOMON 720-721 
LETTERS TO THE EDITOR 
Bollinger Richard C 295 King Amy C 502 
Brown David H 722 Niederreiter Harald 581 
Feeman George F 582 Rotman Joseph 653 
Gillman Leonard 417 Scarborough Charles T 417 
Heims SJ 502 Snyder WM 581 
Jamison William H 67 Turner Danny W 144 
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MISCELLANEA 
102. Boas RP 296 103. Newcomb Simon 329 
100. Bobo Ray 276 117. Newman Edwin 722 
104. Bowden BV 346 98. Peirce Benjamin 220 
97. Buckley WF 220 110. Prather RE 481 
108. Crux Mathematicorum 436 116. Roberts Michael 675 
114. Einstein Albert 653 96. Russell Bertrand 220 
113. Good IJ 582 99. Sack John 244 
95. Hardy GH 211 106. Smullyan Raymond 390 
90. Harman Gilbert 53 93. Sylvester JJ 125 
107. Heinlein RA 416 109. The Committee 455 
111. Hermite to Stieltjes 501 92. Walker FA 99 
94. Hewitt Edwin 145 91. Wells HG 64 
105. Johnson Samuel 346 96. Whitehead AN 220 
112. Kac Mark 516 110. Wilson RJ 481 
101. Martino JR 280 115. Yang CN 668 
ERRATA AND ADDENDA 


Referring to his joint paper with Charles Blair, ““A universal entire function,” which appeared in this MONTHLY, 
5 (1983) 331-332, Lee A. Rubel writes as follows: “Robert Burckel has kindly drawn our attention to the paper by 
Gerald MacLane, ‘Sequences of derivatives and normal families,’ J. Analyse Math., 2 (1952) 72-87 [MR 14, p. 74] 
[Zbl]. 49, p. 57]. Among other things, MacLane constructed ‘a ubiquitous entire function’ by the same method we 
use, to have the same ‘universal’ property. However, he also gets estimates on the rate of growth of the function, so 
that his proof is somewhat more complicated than ours.” 
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AUTHOR VS. REFEREE: 
A CASE HISTORY FOR MIDDLE LEVEL MATHEMATICIANS 


ROBERT C. THOMPSON 
Mathematics Department, University of California, Santa Barbara, CA 93106 


This note evolved from a referee’s rejection of a research paper that I wrote. The reasoning 
behind the rejection was perhaps unusual, and leads to a not altogether trivial question concerning 
the role of the referee in the professional development of a mathematician. The discussion will be 
more candid than is customary, and this may add spice to the article, since confession of failure, 
or even of sin, is always interesting. 

The first section describes the mathematical problem that was investigated, and the second 
outlines what was proved concerning it and what was not. A research paper was written, and the 
motivation to publish it is given in the third section. The resulting referee’s report is described in 
the fourth section, and in the fifth the somewhat psychological issues it leads to are examined. 
Those readers who submit papers to research journals may wish to reflect on these issues in the 
light of their own experience. The sixth section briefly discusses waiting times, and a few 
concluding remarks are in the seventh. 


1. The Mathematical Question. Consider n X n matrices having integer entries. If A is such a 
matrix, it is known that there exist unimodular matrices U and V of integers such that 


UAV = 


a 


is a diagonal matrix with the diagonal elements forming a divisibility chain: 

a,|@| --- |a,. 
(The symbol | signifies divides, with the convention that 0|a means that a = 0. To say that U is 
unimodular means that det U = +1.) This is the diagonal form published by H. J. S. Smith in 


1861 and now universally known as “the Smith canonical form.” The diagonal entries a,,...,a,, 
are called the invariant factors of A; they are unique to within a plus or minus sign. In the sequel 
d1,...,@,_1 Will always be taken nonnegative and a, will be chosen so that 

det A = 4a, °°: G,. 


Let B and C also be n X n matrices of integers, with invariant factors 
b,| -:- |b, for B, b, -:: b, = det B, 
C,|-°°- |c, for C, C, °°: ¢, = det C. 


The issue to be addressed is how invariant factors behave when matrices add. This question is 
significant because a great deal is known about the behavior of invariant factors under matrix 
multiplication, whereas essentially nothing is known about their behavior under addition. Write 


Robert C. Thompson: I received my B.A. and M.A. degrees from the University of British Columbia and a Ph.D. 
from the California Institute of Technology. The thesis supervisors were Marvin Marcus (for the M.A.) and Olga 
Taussky Todd (for the Ph.D.) Most of my postdoctorate academic life has been at the University of California, 
Santa Barbara. My research program is concentrated in linear algebra, usually involving spaces over fields but 
sometimes over rings. I greatly enjoy teaching anything mathematical, and especially if there is an opportunity to 
present the pure mathematician’s outlook to an applied audience. 
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C = A + B; what does this imply about the a,, b,, c;? 

Since a, is the greatest common divisor of the elements of A (similarly for 6, and B, c, and C), 
an easy argument shows that any common factor of a, and b, is also a factor of every a,, b,, ¢; 
and of each element of A, B, C. It may therefore be cancelled. Thus generality will not be lost by 
taking a,, b,, c, pairwise relatively prime. This simplifying assumption will henceforth hold. To 
avoid trivial cases, take a,, b,, c, nonzero. 

By Smith’s theorem, there always exist unimodular matrices U,, U,, V,, V, such that 


a, by 
U,AV, = , U,BV,= 


a b 


n n 


A consequence of this is the following equation in 2n X 2n matrices: 


Since the first and last matrices on the right are unimodular, this equation reveals that C is an 
n X n submatrix of a 2m X 2n matrix having the same invariant factors as the diagonal matrix 
D = diag(a,,...,a,, 5,,...,5,). Two facts will uow be brought to bear: 

(i) It is relatively easy, though not completely trivial, to prove that the invariant factors 
d,| --- |d,, of D satisfy 


(a;, b,)\di4j-15 


where (,) signifies the greatest common divisor. The proof is in [6]. 
(ii) It is a somewhat standard fact, though not too well known, that the invariant factors 
C,| °°: |e, of C and those d,| --- |d,,, of the larger matrix containing C satisfy 


dc; forl<i<n. 
These facts imply that 
(1) (a;, b;)\Cj4j-1 


for all indices i, j for which the subscripts lie in the range 1, n. 

The inequalities (= divisibility relations) visible in (1) are very much like a rather well-known 
family of inequalities for the eigenvalues of a sum or product of complex matrices. Example: If 
A+B=C, with A, B,C Hermitian having eigenvalues a, > -:: >a,, B}>--: >B,,7,2 

- > Yn, Tespectively, then y;,;_,; < a; + B;. This lovely inequality was found by H. Weyl in 
1912 [9]. Extensions of it involving scatterings of terms are now known, the simplest being [5] 


(2) Do Yi+i-s < Le, + DB, ifip< +++ <ins di < 00+ <Sm 

s=l s=1 s=1 
The indices on the terms here belong to a family of sequences occurring in the representation 
theory of the symmetric group and first found by D. E. Littlewood and A. R. Richardson in 1934 
[2]. An immediate question is whether extensions of (1) along the lines of (2) are valid for the 
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invariant factors of integer matrices. It turned out that most conjectured extensions of (1) were 
almost instantly seen to be wrong, and those that weren’t could always be deduced from (1). Thus 
it appears that no further conditions like (2) have to be considered when studying the invariant 
factors of a sum of integer matrices. 

But there is an obvious further condition of quite a different form. Modulo a,, the matrix A 
becomes zero, so that 


det B = det C(mod a,). 


Observing now that everything so far said may be applied twice more by rewriting A + B= C 
as A +(—C) = —B and as B + (—C) = —A, we obtain the following theorem. 


THEOREM 1. The invariant factors of a sum A + B = C of integral matrices satisfy 
(4;,b;) |¢j4j;-15 (a;, ¢;)|b;4;-1; (b,,¢)|a;4;-15 
det A = (—1)"det B(modc,), det A = det C(modb,), det B = det C(mod a,). 


Question. Are the conditions of Theorem 1 sufficient? 


That is, if integers a,| --- |a,, b,| --- |B,,¢,| °°: |¢c, are given satisfying the conditions of 
Theorem 1, can integer matrices A, B, C= A + B be found having the a,, b,, c; as their respective 
invariant factors? This was the mathematical problem that I addressed in the research paper 
mentioned earlier. 


2. Partial Answers. If a;, b;, c; are integers satisfying the conditions of Theorem 1, can integer 
matrices A + B = C be found with these integers as invariant factors? The matrices are to be 
n X n; the following discussion treats each value of n in turn. Recall the assumption made earlier 
that a,, b,, c, are pairwise relatively prime. 

For n = 1, the question has a negative answer: even though a,, b,, c, satisfy the conditions, 
they need not satisfy a, + b, = c,. This discouraging result suggests that a reasonably clean 
theorem is unlikely. 

For n = 2, however, the answer is positive: If a,|a,, b,|b5, c,|c, satisfy the conditions, it is 
possible to construct 2 X 2 matrices A, B, C= A + B of integers having the a,, b,, c; as invariant 
factors. This encouraging result suggests that a reasonably clean theorem can probably be found. 

The proof in the 2 X 2 case is not too hard, but not obvious either. Here are a few details. 
Since a, b, are relatively prime, there are integers r, s such that ra, + sb, = 1. Write 


Aja rc, b, . B=»,| SC} — a, | c-|° al 


byZ5, rey + by Z55 —~ 4129, SC ~ A4Z9 


where z,, and z,, are integers to be chosen so that A has a,, a, as its invariant factors and B has 
b,, b,. It is almost immediate that only the determinantal constraints det A = a,a, and det B = 
bb, have to be met. Using the conditions from Theorem 1, this can be shown to be possible, after 
a not too long but not altogether straightforward analysis. There are nine conditions, but only 
three are actually needed, since these three imply the remaining six. 

For n = 3, the answer is again positive, but the proof is much, much longer. There are eighteen 
conditions, and all eighteen are needed, some repeatedly. The technique is similar to that used in 
the 2 X 2 case. One takes C = diag(c,, c,, c;) to be diagonal, 


rc] b, b1243 
A =a,| 012, 12 + biZ 12; ; 
b1231 b1235 re, + b235 
SC) —~ ay —~ 44243 
B=b,| ~% 221 S8€2 — 4222 —~ 4293 I, 


— @23) —~ 44239 SC3 — Q1233 
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where 213, 221, 229 2235 231) 2325 733 are unknown integers to be chosen so that A has a, a5, a3 as 
invariant factors (a, is automatic) and B has b,, b),b;. The task is a difficult diophantine 
problem, and it is not obvious how to proceed. Nevertheless, I found a way of proceeding, in a 
proof that runs for about eleven typed pages. I believe this proof to be a reasonably substantial 
achievement. 

I made some half-hearted attempts to settle the case n = 4, and I failed. I was, frankly, tired of 
the problem after the arduous work needed to settle the 3 X 3 case. So, sufficiency of the 
conditions of Theorem 1 is not established for n > 4. 

A comment is in order concerning the cases n = 2 and n = 3. The proofs just described failed 
to reveal any conceptual framework within which to set the problem, and so reveal few genuine 
clues showing how to proceed to higher dimensions. The arguments are clever, but perhaps only 
ad hoc. The lack of a conceptual framework is very distressing. What is the structural foundation? 
None is known to me. 


3. The Impetus Toward Publication. At this point, I came under review for a salary increase at 
my university, and to justify the increase it was of course necessary to exhibit a lot of new papers. 
(Unfortunately, quality is hard to establish, thus quantity counts in these matters.) I got busy and 
wrote up a dozen papers, selecting the dozen items easiest to write up from a store of unpublished 
results. The invariant factor problem was one of the dozen, and was written up as a manuscript 
entitled Sums of integral matrices. Because of the great difficulty I had experienced constructing 
the 3 x 3 sufficiency proof, I was absolutely confident that the paper would be accepted without 
hesitation. It was submitted to the Rocky Mountain Journal of Mathematics. 


4. The Referee’s Report. The referee’s report was short, and its contents were quite unex- 
pected. Here it is (with a journal reference deleted): 


I suggest not publishing the enclosed paper of R. C. Thompson in the Rocky Mountain Journal of 
Mathematics. 

The problem he is looking at is: To what extent do the invariant factors of integral matrices A and B 
determine those of A + B? The problem is moderately interesting. But he hasn’t solved it: He has only done 
the 2 by 2 and 3 by 3 cases. The computation that decides these rather special cases is a real tour-de-force, 
but doesn’t seem to uncover any interesting ideas. 

Thompson has done some very interesting work on integral matrices (e.g., his paper on interlacing 
inequalities for invariant factors). Basically, I feel someone as good as him should not try to publish such a 
partial, unilluminating result. 


Evidently the referee read the paper in detail, and understood it thoroughly. I interpret his 
report in this way: “‘Thompson is capable of solving the full problem, and he should get busy and 
do so!” 

The next section, which should be regarded as the heart of this note, will discuss the referee’s 
report from various points of view. Everything so far has merely been to set the background for 
the discussion. How do you respond to the issues about to be raised? 


5. Discussion. 


Disclaimer. Both the referee and I may well have missed the point: there may be a more or less 
easy solution for the invariant factor problem. The reader is challenged to find one. If one is 
found, the following discussion may lose its punch, but this note will still have merit since it 
provided the stimulus for the reader’s creative effort. 

To continue, however, assume that no easy solution is possible, so that the results in hand are a 
nontrivial partial solution to a difficult problem. 


Criteria for publishability: are there any generally accepted ones? They would appear to be 
enough depth or breadth, plus a write-up possessing sufficient clarity and polish. Originality is a 
key item, and great originality will offset a presentation lacking clarity or polish. 


Responsibilities of an author: what are they? They are to ensure that sufficient originality is 
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present, that the manuscript is technically accurate, that it is carefully prepared, and that 
incomplete partial results are not offered when more can be achieved with a not unreasonable 
additional effort. 

A side comment: Every mathematician serving in an editorial capacity soon learns that many 
prospective authors fail to discharge these responsibilities adequately; some fail grossly. Premature 
submission of partial results is a common sin, and inadequate attention to the technical aspects is 
another. (Are the proofs correct? Has the manuscript been properly proofread? Is the grammar 
sound?) Unreadable manuscripts are all too common. Some writers seem to feel that referees are 
servants, to take care of details that were overlooked. No judgment could be more mistaken. 


Responsibilities of a referee: what are they? They are to certify that the generally accepted 
standards are met, to suggest improvements, and to act as a stimulus to cajole an author into a 
further creative effort. A referee should be able to recognize the worth of mathematical styles and 
attitudes differing from his own. No referee, however, has to be an author’s servant, correcting 
shortcomings caused by carelessness, immediate rejection being the proper action in such cases. If, 
however, the referee is overworked, it is sound procedure to rely on an author’s reputation (if any) 
for quality and accuracy. This does not mean that an established mathematician should expect 
publication of his second class results. And every referee should be willing to give a helping hand 
to a novice writer. 

(The harsh sentences in the two paragraphs above are caused by this author’s experience as a 
journal editor.) 

Consider now the referee’s reports in the light of the headings above. Because my manuscript 
was prepared with extreme care, the quality of the presentation and the technical aspects were 
probably adequate. I shall assume this to be the case since the referee made no comment to 
suggest otherwise. I now pose, and partially answer, five somewhat psychological questions. 


Question 1. Should a nontrivial partial solution of a difficult problem be published? 


Many mathematicians would say “yes,” but some probably would say “no.” What is your 
opinion? The referee seemed to feel that the answer is “no,” but perhaps he was meeting his duty 
of cajoling the author into a greater creative effort. In general, I feel that a partial solution should 
be published if lack of publication would risk loss of a nontrivial piece of work. The whole 
purpose of our professional existence is to get problems solved, and if a partial solution will 
contribute significantly to an ultimate complete solution that may happen many years later, then 
the permanent record created by publication is justified. 

(Rule out publishing simply to create a long publication list: that’s not unknown, and probably 
unprofessional, even though sometimes dictated by university promotion policies. I have at times 
been guilty of the sin. Are you also guilty?) 


Question 2. Should the lack of a structural basis affect the merit of the results so far obtained? 


The referee appeared to feel that this was the case. But mathematics is full of examples of 
difficult questions for which no structural framework has been found. Does this lack render the 
questions less interesting? Many mathematicians, those who are problem solvers rather than 
theory developers, would say that the lack of a structural setting adds to the appeal. The 
absolutely pure battle between mathematician and nature, without the corrupting influence of a 
lot of distracting structure, is surely the highest form of intellectual activity. So the referee appears 
to be on shaky ground with this aspect of his analysis. It is true that the proper conceptual 
framework might make the problem solvable, or at least accessible (as is the case for the question 
of how invariant factors behave when matrices multiply, a question leading to Young tableaux 
and Littlewood-Richardson sequences), but it might also reveal it to be difficult. What is your 
opinion on this issue? 


Question 3. Was the attempted publication premature since only low dimensional cases were 
solved? 


666 ROBERT C. THOMPSON [December 


The referee probably felt that it was, and it might have been. However, I was willing to publish 
a partial result and abandon the invariant factor problem because I was attracted to some other 
equally difficult problems—see below. 

Do you sometimes abandon a problem with only a partial solution because something else 
becomes more interesting, or because you feel unable to achieve more? If so, what do you do with 
your partial result? 

(One response, which I quite deliberately adopted in another case when three years of 
intermittent effort yielded only a partial result, is to submit a manuscript containing conjectures 
and those facts that can be proved, hoping not for an acceptance but for a referee’s report 
containing a significant idea. It is permissible to use the referee’s talents, provided his contribu- 
tions are acknowledged. If he cannot supply an idea, a case for publication may already be 
established. Can you guess how this gambit worked for me? Answer below.*) 


Question 4. A referee might suggest that the author solve a problem in full, but the author has 
other problems that he wishes to attack, problems that he finds more attractive. Which course of 
action should the author follow? 


This question could be rephrased as: “How much influence should a referee have in the 
professional development of a mathematician?” In the present instance, the dilemma is partly 
attributable to my formulation of the following conjecture, a conjecture that is probably deep and 
certainly very structural. 


CONJECTURE. If A and B are Hermitian matrices, there will always exist unitary matrices U and 
V so that 


. . . -1 -1 
(3) eiAgiB — gi(UAU*+VBV"*) 
where, of course,i= y¥-—1. 


The conjecture belongs to the interface between Lie groups and Lie algebras and can be 
formulated in Lie-theoretic terms. I have many partial results concerning it. The Campbell-Baker- 
Hausdorff formula e*e” = e’ (where z is a formal series in commutators of x and y) of course 
plays a role. The conjecture can also be formulated for non-Hermitian matrices A, B, but then it 
can only hold for A, B in a neighborhood of 0. I have not published my partial results on this class 
of questions because I hope (perhaps unrealistically!) to produce a complete solution. But one 
thing is clear: if I work more on conjecture (3), I will have to commit all my resources to it, and 
cannot work on the invariant factor problem. It’s one or the other, not both, since I am 
(unfortunately) only an average mathematican, far, far below the leaders. Should I be influenced 
by the referee’s wishes that the invariant factor problem be solved in full? 

(It is possible that a reader of this article who is expert in Lie theory may resolve conjecture (3). 
If that happens, I shall return to the invariant factor problem, unless it too is settled by a reader!) 


Question 5. Will an author lose prestige in the eyes of a referee if a rejected paper is submitted 
to another journal without change? If so, should the author be influenced by this fact? 


That’s a tough issue, the possible loss of prestige being the only reason that I have not 
resubmitted Sums of integral matrices. In fact, I have an invitation from another journal to publish 
it there, in its present form, and if this were not the case, I could publish it in the journal I edit, 
Linear and Multilinear Algebra. But I have done nothing. What would you do in this situation? 


* Answer. The paper, [8], contained a conjecture having every appearance of being solvable in a few lines. The 
referee took 14 months to prepare a very accurate report, recommending acceptance, but not furnishing the hoped 
for idea. The journal’s editor, noting the referee’s observation that the conjecture ought to have a short solution, 
chose to overrule the recommendation and rejected the paper. It remains unpublished and the conjecture unsolved. 
Clearly, the editor may be a major factor in the author-referee configuration. 
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The note you are presently reading —Author vs. referee—is substantially based on a lecture that 
I gave at the matrix conference run by S. Pierce at the University of Toronto, August, 1982. Some 
members of the audience felt that the paper should immediately be resubmitted, but others could 
understand my reluctance to do so. 

There may be other questions that arise naturally. Do any occur to you? Here is one: The 
world of mathematics is populated mainly by average mathematicians, since the very best of us 
are so few. Journals obviously cater to the very best. To what extent should they cater to the 
average? One may argue that civilized society needs mathematicians, and that the middle level 
mathematician undoubtedly is a better contributor to society if journals exist that will publish his 
middle level theorems. What is your opinion? 


6. Waiting Times. How long should an author wait before expecting to receive a referee’s 
report? Many authors expect a report within a few months, and some expect one sooner. A clearly 
written paper almost always gets a report more quickly than a badly written one, and to this 
extent an author can do a lot to help his own case. My experience with papers that I have 
submitted to journals is: minimum waiting time until an editorial decision is reached—a few days, 
maximum-—four years, typically—a few months. The four year wait led to an acceptance, probably 
attributable to the referee’s sense of guilt. To this extent, it may pay to hope for a referee who is 
not speedy. 

A fair rule of thumb seems to be. A referee is entitled to six months in which to prepare his 
report, but not longer. This means: an author should not start to complain until six months have 
elapsed. The six month value is somewhat arbitrarily chosen. What do you think the correct figure 
should be? 

Some mathematicians are very conscientious about meeting their obligations as referees. Others 
are extremely remiss. It really seems unfair that the latter cannot be somehow penalized. Is there 
an imaginative journal editor somewhere who can invent the appropriate punishment? 

How long did the referee of Sums of integral matrices take? About six weeks, an excellent 
performance in view of the difficult 3 < 3 proof. 


7. Some Comments. 


(i) The invariant factor theorems attributed to me in the referee’s report were also obtained by 
E. Marquis de Sa [3], and certain parts of them can be deduced from facts in Bourbaki. The 
referee should have known that! (Perhaps he did.) 

(ii) The question of the similarity invariant factors of a sum of matrices with entries in a field 
has been studied in [1] with further results in the doctoral thesis of E. Marquis de Sa. 

(iii) Did I get my salary increase? I did, and fortunately it was granted before the rejection was 
received, making it unnecessary to jeopardize the increase by having to remove an item from my 
publication list. (Are you as lucky?) 

(iv) Of the dozen papers mentioned above, how many were accepted? Obviously not all, and 
the exact number of acceptances will be left for the reader to guess. It is true, though, that some of 
the accepted papers were weaker than the rejected one discussed in this article. This seems to 
imply that not all referees adhere to the same standards. Should an author resubmit a rejected 
paper, hoping for an “easier” referee? The tactic is surely not unknown. Does it conform to 
established standards of professional conduct? Probably not. A much better ploy is to make the 
paper stronger, revising as necessary, before resubmitting. However, an author is sometimes (not 
always!) a better judge of the mathematical situation than the referee: If the contents of a rejected 
paper form a step in a well conceived campaign toward a substantial objective, resubmission 
without change may be justified. 

The lack of a universal standard for evaluating mathematical papers was visible in the opinions 
expressed by the four referees of the article before you, Author vs. referee. The report given me by 
the MONTHLY’s editor suggests that of the positions taken by the four about Sums of integral 
matrices, one supported me, one supported the referee, one could see both sides of the issue, and 
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the position of the fourth was not disclosed. So much for unanimity! The four did not agree on the 
merit of Author vs. referee either. 

Sometimes a referee’s judgement is later seen to be wrong. Now and then I am sent 
manuscripts to appraise, and I do reject some. Two that I rejected were later published without 
change in other journals, and in retrospect I think both authors took the correct path. Mistakes do 
occur! 

(v) It is to be emphasized that I bear no grudge against the referee for rejecting my paper. 
After many years as a professional mathematician, I have learned to accept such twists of fate 
stoically. A major point is that the rejection has created uncertainty in my mind: should I 
continue with the invariant factor problem, or abandon it for pastures that I presently perceive to 
be more attractive? (Can you be similarly unmoved when one of your papers is rejected? The 
correct, professional, response following a: rejection is to examine objectively the reasons behind 
it.) 

(vi) No statistics seem to be available on journal rejection rates. An informed guess is that 
most journals reject roughly 40% of submitted manuscripts, and some (including the Monthly) 
much more. Do you think too many manuscripts are rejected? Or too few? 

(vii) I believe this article to be in good taste, even though rather candid, addressing an 
annoying fact of life with which most middle level mathematicians must contend. Do you agree? 
Or should it have been rejected? Readers may send comments to me. 


Summary. The interface between author and referee is an uneasy one. 
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MISCELLANEA 
115. 


To see ourselves as others see us? 


“If you try to understand fibre bundles by reading mathematics, if you are a 
physicist, you would probably not succeed, because modern mathematics is 
extremely difficult to read, and I believe there exist only two kinds of modern 
mathematics books; one which you cannot read beyond the first page and one 
which you cannot read beyond the first sentence.” 


—C.N. Yang, Lectures on Frontiers in Physics, Seoul, Korea, 1980. 


LARGE NUMBERS AND UNPROVABLE THEOREMS 


JOEL SPENCER 
Department of Mathematics, State University of New York, Stony Brook, NY 11794 


“Yes, please,” said Milo. “Can you show me the biggest number there is?” 
“Td be delighted,” [the Mathemagician] replied, opening one of the closet doors. “We keep it right here. 
It took four miners just to dig it out.” 


Inside was the biggest 
Milo had ever seen. It was fully twice as high as the Mathemagician. 
—The Phantom Tollbooth 


Norton Juster 


1. Large Numbers. “Describe, on a 3 X 5 card, as large a positive integer as you can.” 

Many mathematicians have at some time played the game above, either solitaire or in 
competition. My solutions in the second, sixth, and twelfth grades, respectively, are shown in Figs. 
1, 2, 3. 


SHEE OE Es HH 
WARP O 


HEHE HEE HEE HH HH 


Fic. |. 


1,000,000,000, 000, 000, 


000,000,000, 000, 000 WARP | 


FIG. 2. 


One Googolplexplexplexplex- 
plexplexplexplexplexplex- WARP 2 


plexplexplexplexplexplex 


FIG. 3. 


See this MONTHLY, 90 (1983)365, for the author’s biography. 
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The last needs a word of explanation. Since googol is 10'° and googolplex is 102°°8* let us define 
Nplex as 10”. Actually, by twelfth grade I could write “One googolplexplexplex...with a googol 
plexes” and even some more elaborate variants. These were at best WARP 2.2. The next level is 
shown in Fig. 4. 


Let f, (x)= 2x 
and fro4(x) = f° (1) WARP 3 


fy (9) 


FIG. 4. 


Here f‘*? represents the x th iterate of f. Iterated doubling is exponentiation, f,(x) = 2*. Iterated 
exponentiation is the tower function, 
2 


f(x) =2? withx 2’s. 


My WARP 2 solution is approximately f,(21), one for each plex and five to get to a googol. There 
is no word for fy. f4(4) = (s(f3(f3()))) = ffs) = £5 (65536) is already WARP 2.1. 

Three ideas help us create large numbers. First, we concentrate on constructing rapidly growing 
functions. The numbers will then be the value of the function f(x) for some reasonably small x. 
Second, we use iteration to build a larger function from a given one. Third, we introduce 
diagonalization. Having defined the functions f, above, we define a diagonal function, called f.,, 


by 
f,(n) =f,("). 


This is called the Ackermann function. (There are several similar formulations.) The Acker- 
mann function does occasionally appear in “real” mathematics. For example, van der Waerden 
proved in 1927 that to all m there exists W(n) such that if the integers from 1 to W(n) are divided 
into two classes, then there exists an arithmetic progression of length n in one of the classes. His 
proof gave a W(n) roughly equal to f,(”). (It is possible that far smaller W(n), even of 
exponential order, will suffice and this remains an open problem.) 

Once f,,(7) is defined, there is no reason to stop. We define a new function, let’s call it f,,, ,, by 
fo4\(n) = f& (1). Having defined f,,, ,, we may define f,,,, f,,3,-... When faced with ellipses 
we resort to diagonalization. We define a new function, called f,,,, by f,,,() = f,,,,,(). (See Fig. 
5.) 


fow (9) WARP 3.2 


Fic. 5. 


We are defining here a hierarchy of functions in which each function has an immediate 
successor and where limit functions are defined by diagonalization of an appropriate subsequence. 
The usual representation for ordinal numbers provides a perfect framework in which to do this. 
The ordinals a < w® have a simple representation. Each such a may be uniquely written 


Q = a,w"! + aw? + +++ + a," (w>s,>5,>--+: >s,>0) 
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where the a, are positive integers. (We write aw* instead of the more customary wa for 
convenience of expression.) The limit ordinals are those a with s, > 0. For these we define a 
specific “natural” sequence a(n) of ordinals approaching w®by 


a(n) = aw" + +--+ +4,_,0%-'! +(a, — l)o* + nw !, 
For example, if a = 2w* + 3w’, then a(n) = 2w* + 2° + nw. We define the natural sequence 
approaching w® by 

w°(n) = w". 

Now we define f,() for each a < w® using transfinite induction by 
(+) fas i(n) = fr? (1), 
(+ +) faln) = facny()s 
where a is a limit ordinal and the initial value f,(”) = 2n. (See Fig. 6.) 


f,,,0(9) WARP 3.5 


FIG. 6. 


Let us emphasize that though we are using the language of infinite ordinals the functions f, are 
recursive functions and the values f(t) are well-defined integers. The infinite ordinals are, in one 
sense, merely finite sequences of positive integers being manipulated in particular ways. A 
recursive program for computing f{ (n) could take the following form. 


FUNCTION F(a,N,T) 
BEGIN 
IF T > 1, 
SET X = F(a,N,T — 1) 
RETURN F(a, X, 1) 
IF T= 1 ANDa= 1 
RETURN 2*«N 
IF T = 1 AND LIMITORDINAL (a) 
RETURN F(a(N),N, 1) 
IF T = 1 AND NOT LIMITORDINAL (a) 
RETURN F(a — 1,1,N) 
END 
The representation of a, the predicate LIMITORDINAL (a), and the functions a — | and a(N) 
need to be defined explicitly, though we do not do so here. 
We continue the ordinals a half-WARP further. Set 


_ — Ww — w 
Wy} = W,W, = Ww goer ge Woy = W S yee. 


and set e, equal the limit of the w,. (We emphasize that w, is not the first uncountable ordinal. All 
ordinals in this paper are countable.) Each ordinal a < w,,, is uniquely represented as 


a = awh + +--+ + a,w* (w, > B, > B,>-:: > B= 9) 
with the a, positive integers. A “typical” ordinal is 


2w+ 1 


w+8 w 
Ta! + 1d go"? +5 
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Now for limits. We say nw® is the natural sequence approaching w**'. If B itself is a limit ordinal, 
then its limit sequence B(n) has already been defined and we call w? the natural sequence 
approaching w’. For sums we keep all but the smallest term fixed and take a limit sequence 
approaching that smallest term. Thus 


2w+ 1 


wt+8 w wt+8 w n 
Tu)” 4 130°” +5w 4 aw” +40°%+w 


is the natural sequence for the ordinal above. Finally, e, has the natural sequence e)(n) = w,. 
Now the hierarchy f, defined by (+), (+ + ) may be extended to all a < e, + w. We have a big 
number. (See Fig. 7.) 


fej+9 (9) WARP 4 


FIG. 7. 


This should win the game against any nonlogician! 


2. The Connection. Let PA stand for Peano Arithmetic, that first order theory of numbers 
which includes the basically finitistic methods of number theory. The surprising truth is that 
WARP 4 lies beyond the scope of PA. The sense in which we use this was shown by G. Kreisel [4] 
in 1952. 

A statement P(x,,...,x,) is called provably recursive if there is an algorithm for deciding if 
P(x,,...,X,) is true and a proof, in PA, that the algorithm always terminates. Thus 
P(x, y, z,t): x' + y' = z' is provably recursive (simply make the calculation) but 


P(t): (Ex)( Ey)(Ez)x' + y! = 2! 


is not known to be provably recursive. 

We say a function f dominates a function g if there exists n such that f(x) > - o(x) for all 
x>n. 

Let P(x, y) be a provably recursive statement in PA and suppose (x)( Ey) P(x, y) is provable 
in PA. Set fp(x) equal the least y such that P(x, y) is true. Then, Kreisel showed, the function fp 
is dominated by f, for some a < e9. As f,, dominates all previous f, we draw the following 
conclusion. 


Let (x)( Ey) P(x, y) be a statement of PA which is true for the natural numbers and let 
fp(x) be the least y for which P(x, y) is true. Suppose P(x, y) is provably recursive. If fp 
dominates f,, then the statement (x)( Ey) P(x, y) is unprovable in PA. 


3. An Unprovable Theorem.* The epochal work of Kurt Gédel gave the existence of state- 
ments in PA which are true for the natural numbers but unprovable in PA. The statements 
constructed by Gédel suffered the defect of being unnatural and for the past half century a 
somewhat raggedy debate ensued concerning whether or not Gédel’s result applied to statements 
of real mathematical interest. In 1977 Jeff Paris and Leo Harrington [2] gave the first natural 
example of a statement that was true for the integers and unprovable in PA. (The term “natural” 
is here a matter of subjective opinion.) Their statement comes from Ramsey Theory, a subdisci- 


*The term “unprovable theorem” is abhorred by logicians. Theorems have proofs by definition. For our informal 
discussion, however, it seems appropriate to the subject matter to use this delightful oxymoron. 
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pline of Combinatorial Analysis, and to give it one needs a moment’s introduction to that subject. 
(A detailed treatment is given in [1].) 

By “an r-coloring of the k-sets of S” we mean a function x with domain the family of 
k-element subsets of S and range [r]. (Notation: [a, b] = {a,a + 1,...,6},[r] = [l, r], [a, b) = 
[a, b — 1].) Given such a coloring x a set B C S is called monochromatic if all of the k-element 
subsets of B have the same color. 

We may state Ramsey’s Theorem in either a finite or an infinite form. 


RAMSEY’S THEOREM (Infinite Form). For all k,r given any r-coloring of the k-sets of N, there 
exists a monochromatic infinite set B. 


RAMSEY’S THEOREM (Finite Form). For all k, r, t there exists n so that given any r-coloring of the 
k-sets of [n], there exists a monochromatic t-set B. 


From the infinite form of Ramsey’s Theorem we deduce the finite form as follows. Suppose the 
finite form false and fix k, r, ¢ so that for all n there exists an r-coloring of the k-sets of [n] with 
no monochromatic ¢-set B. Any coloring for a larger n also works for a smaller one. Hence, for 
any given n, there is some coloring which can be extended to arbitrarily large n. Construct 
colorings for successively larger values of n in turn, each of which extends to arbitrarily large n. 
The union is an r-coloring of the k-sets of N with no monochromatic t-set B. Thus the infinite 
form of Ramsey’s Theorem would be false. The reasoning above, often called a Compactness 
Argument, can be applied in many situations to reduce an “infinite form” to a “finite form,” see, 
e.g., [1]. 

Define a set S of positive integers to be large if |S| > min(S). For example, {3,4, 7,9} is large 
but (4,63, 1281, 4504655} is not. The statement of Paris and Harrington (in one version) requires a 
seemingly minor modification of Ramsey’s Theorem. 


(PH) For all k, r there exists n so that given any r-coloring of the k-sets of [k + 1, n] there exists 
a large monochromatic B C [k + 1, n]. (The exclusion of 1,...,k is purely technical, avoiding such 
trivial large sets as {2,3,4}.) 


If we allow infinitistic techniques, (PH) is relatively simple to prove. Suppose (PH) is false for a 
particular k, r. By the Compactness Argument there would exist an r-coloring x of the k-sets of 
[k + 1,00) with no large monochromatic finite B. However, given any such x the infinite form of 
Ramsey’s Theorem guarantees the existence of an infinite monochromatic set C. The first 
min(C) + 1 elements of C then give.a large monochromatic finite B. 

We have deduced both Ramsey’s Theorem (finite form) and (PH) from Ramsey’s Theorem 
(infinite form). Neither of these arguments is formalizable in PA since neither Ramsey’s Theorem 
(infinite form) nor the Compactness Argument can even be stated in PA. This, by itself, does not 
show that Ramsey’s Theorem (finite form) or (PH) are unprovable in PA, only that we have 
not proven them. In fact, Ramsey’s Theorem (finite form) can be proven in PA (though we do not 
prove it here) but (PH) cannot. 

Paris and Harrington, in their original work, showed by model-theoretic arguments that (PH) 
was unprovable in PA. Robert Solovay, hearing of their result but not of their proof, discovered a 
more combinatorial argument. Let PH(k, r) be the least n such that for every r-coloring of the 
k-element subsets of [k + 1, n] there exists a monochromatic large B. Solovay showed that PH 
grows too fast for PA. 

A full discussion of Solovay’s argument is somewhat beyond the bounds of this expository 
discussion (though not by too much, see [1] or the original [3]), but we can quite easily 
demonstrate that PH(2, r) grows quite rapidly. (A similar exposition was given by Smorynski [5].) 
To find a lower bound for PH(2, r) we give explicit r-colorings of the 2-sets of [3, n]. 

Split [3, 00) into consecutive intervals of the form [x,2x)—t.e., [3, 6), [6, 12) [12, 24), [24, 48),.... 
We give the pair {i, 7} color 1 if i and j lie in a common interval. If all pairs in a set 
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A = {a,,...,a,}have color 1, then A C [x,2x), so|A| < x and min(A) > x, hence A is not large. 
Set 2,(x) = 2x. Now we define g,(x) = g(?(x) = x2” and split [3, 00) into consecutive intervals 
of the form [x, g,(x))—that is, 


[3, 24), [24,24 - 274), [24 - 274,24. 27. 924-2") 


We give a pair {i, 7} color 2 if i andj lie in a common interval and the pair does not have color 1. 
If all pairs in a set A = {a,,...,a,} have color 2, then A C [x, g(x)), which is split into x 
subintervals. Each element of A lies in a separate subinterval (since no pair has color 1) so |A| < x 
and A is not large. On [3, g,(3)) all 2-sets have either color 1 or 2 and there are no monochromatic 
large sets. Thus 


PH(2,2) > g,(3) = 24. 


We continue in this manner, defining g,, ,(x) = g¢” (x), partitioning [3, 00) into consecutive 
intervals of the form [x, g,,,(x)), and giving a pair {i, 7} color s + 1 if i and j lie in a common 
interval and the pair has not been given a smaller color. Then, quite explicitly, we have shown 


PH(2,3) > g3(3) = 24 - 274 . 2742" 


and, in general, PH(2, r) > g,(3). The function g,(3) has order roughly f,,(7). 

The colorings of k-sets are equally explicit but require a greater technical effort. Solovay 
showed that PH(3, r) is bounded from below by f,(r) where a = w*, PH(4, r) by f,(r) where 
a = w*, etc., and that PH(r, r) was bounded from below by f,.(r). (Though we do not require it 
here, Ketonen found upper bounds on these functions of roughly the same order.) 

Let P(k, n) be the statement “Given any k-coloring of the k-sets of [k + 1, n] there exists a 
large monochromatic B.” P(k, n) is surely provably recursive as one may check all k-colorings of 
the k-sets of [k + 1, n]. Applying Kreisel’s fundamental result the statement 


(k)( En) P(k,n) 


is unprovable in PA. 


4. Reflections. WARP 4 takes us to the tradeoff between largeness and definiteness. We have 
described an algorithm for computing f,(¢)—but how do we know that the algorithm will work 
(i.e., terminate)? One way is by transfinite induction, the determination of f,(7) requires ¢ calls of 
the algorithm to calculate /,_ , or, if a is a limit ordinal, the algorithm for f,,,). In either case these 
are smaller ordinals, by induction the algorithm works for them, hence the f, algorithm works. 
However, transfinite induction is a basically infinitistic tool and we can ask for a proof in PA that 
the algorithm for f, will work. Here there is a very nice result. For a < e, there is such a proof in 
PA. However, for a = e, there is no proof in PA that the algorithm will always work. (This gives 
another statement which is true but unprovable, but one that would hardly be termed natural.) 

We would agree that the number described in Fig. 8 is not legitimate as it stands. It gives, in 
fact, Berry’s paradox, one of the classic Russell-type paradoxes. Since this number has been 
described in less than 50 words, it must be greater: than itself. The problem lies in the notion of 
describable. Let us say that a number m is describable (modulo PA) in length n if there is a 
statement A(x) in PA such that 


(i) (E,x)A(x) has a proof in PA of length at most n. (EZ, =“ there exists a unique.”) 
(ii) There is an algorithm for deciding A(x) and a proof in PA of length at most n that the 
algorithm always terminates. 

(iii) A(m). 
We construct a legitimate alternative (see Fig. 9). This number cannot be described by a book the 
size of the known universe, with electrons for characters, in the language of PA. The function g 
lies beyond PA—but just barely. It is of order f,, and the above card is still WARP 4. 

To go beyond WARP 4 we strengthen PA. Let GAM be a formalization of Generally Accepted 
Mathematics. (See Fig. 10.) 
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The largest number describable 


in at most fifty words, plus one. WARPED 


FIG. 8. 


Let Qgan(n) equal the largest m 
describable (modulo GAM) in 
length <n. 


Let g(n) equal the largest m 


describable (modulo PA) in 
\ WARP 5(?) 


length < n. 


g (10109) 4 4 Doan (100°) +1 


FIG. 9. Fic. 10. 


Travel beyond WARP 4 now depends on what one allows in GAM. There is always the danger - 
that if too much is allowed, the system will become inconsistent and the 3 < 5 card will no longer 
define an integer. The game of describing the largest integer, when played by experts, lapses into 
hopeless argument over legitimacy. 
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MISCELLANEA 
116. 


The condensation of metaphor involves no denial of logic: it is simply an extension of the 
implications of grammar, the development of a notation which, being less cumbersome, enables us 
to think more easily. It may be compared to the invention of a new notation, say that of Leibniz 
or Hamilton, in mathematics: the new is defined in terms of the old, it is a shorthand which must 
be learned by patient effort, but, once learnt, it makes possible the solution of problems which 
were too complicated to attack before. The human head can only carry a certain amount of 
notation at any one moment and poetry takes up less space than prose. 


—Michael Roberts, The Faber Book of Modern 
Verse, London, Faber and Faber, 1937, p. 20. 


gists he is the important one. 


but for topolo 


3 


is better known to most educated people 


) 


His uncle 
(See p. 705 
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squared error as a measure of quality. It is well known and easy to verify that 


MSE{ - . -$?(¥)| < MSE(anS?(X )) 


and 


MSE( —" 55°(n)} < MSE(anS?(,)) 


for every a > 0. Thus if mean squared error is our criterion, we should use 
] 2 
2 = _ 
when p is known (an admissible estimator) and 
UYT)\— _ yy 
nti> (*) nore 1c x) 


when p is unknown (an inadmissible estimator (cf. Stein (1964)). Moreover, our intuition is then 
2 


confirmed as 
n 2 _ 2 4 YY 
a(n) n+2° ~n+1° ; 


Thus the statistic using the known mean yp is better, in the sense of mean squared error, than the 
one ignoring this information. It should be noted, however, that these two estimators of o7 are 
almost never considered in basic textbooks on statistics. 

On the other hand, one might not be so much interested in o* as in o. Thus we might compare 
the mean squared errors of S(X) and _S(y) as estimators of o. The distributions of these statistics 
are not so tractable as those of S?(X) and S*(). However, we calculated their mean squared 
errors for sample sizes of 1,2,..., 30,40,50,..., 100, and in each of these instances the mean 
squared error of S() was less than that of S(X). Thus it seems that for the estimators S(X) and 
S(w) of the standard deviation o our intuition holds while, for the estimators S?( XY) and S*() of 
o* it does not hold. 

Finally, mean squared error is not the only possible criterion for measuring the quality of an 
estimator. Among other measures of the “closeness” of an estimator to its parameter is one 
proposed by Pitman. It can be shown that 


P[(s*(X) — 0?) > (S*(u) —0°)] > 


so that, in the Pitman sense, S?(y) is “closer” to o” than S?(X) is. Here again our intuition is 
upheld. 


4 n 
MSE( 7 MSE| n+ 1 
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FUNCTIONS WHICH PARAMETRIZE MEANS 


MICHAEL E. MAYS 
Department of Mathematics, West Virginia University, Morgantown, WV 26506 


1. Introduction. This paper is about certain functions of two variables, means, and several 
natural ways that have arisen to associate them with functions of a single variable. 

We define a mean m(a, b) to be a function of two positive variables satisfying the following 
properties: 


(1) (Intermediacy) min(a,b) < m(a, b) < max(a, b) 
(2) (Symmetry) m(a,b) = m(b,a). 
Several means well-enough known to have names are listed below: 
Arithmetic A(a, b)=(a+b)/2 
Geometric G(a, b) = Vab 
Harmonic H(a, b) = 2ab/(a + b) 
Logarithmic L(a, b) = (b— a)/(in b — Ina) 
Root Mean Square RMS(a, b) = [(a? + b*)/2]'” 
Contraharmonic C(a, b) = (a? + b*)/(a + b) 
Heronian He(a, b) = (a+ Vab + b)/3 
a? 1/(a—b) 
Identric I(a, b) = Fa, Je 
Maximum Max(a, b) 
Minimum Min(a, b). 


The logarithmic mean has been studied by Lin [12] and Carlson [3]. It was generalized by 
Stolarsky [10, 11], and studied by him and Leach and Sholander [7]. A special case of the 
generalization is the identric mean—it also arises as the value of a certain limit on the 1979 
Putnam Examination. 

There is a technique called compounding which builds new means from old. Carlson [2] 
develops this iterative procedure whereby (possibly nonsymmetric) functions of two variables are 
combined to yield a new mean and generalizes Gauss’s algorithm for the arithmetic-geometric 
mean and Borchardt’s algorithm for the calculation of the inverse sine. Lehmer [8] discusses the 
compounding of means from certain families by associating the families with series expansions. 
Gould and Mays [5] derive recurrence relations and explicit formulas for the coefficients in the 
series expansions of the families Lehmer studied and others. 

All of the named means satisfy a third property as well. 


(3) (Homogeneity) m(a,b) = am(\, b/a) 


for all a and b. We will speak of a homogeneous mean if we wish (3) to hold. One goal of this 
paper is to determine conditions on a function so that the mean associated with that function is 
homogeneous. Another aim is to find functions that are associated with the means mentioned at 
the beginning. The techniques we consider for associating means with functions of a single 
variable include a graphical technique of Moskovitz and an application of the Mean Value 
Theorem for integrals which is related to the approach of Stolarsky [10, 11] and the classical work 
of Hardy, Littlewood, and Polya [6]. 


2. The alignment chart of Moskovitz. Let f be any function from (0, 00) into (0, 00), and define 


Michael E. Mays: 1 received my Ph.D. from Penn State in 1977 as a student of Raymond Ayoub. Since then I 
have been at West Virginia University. Besides collecting means, my research interests include estimating the rate of 
growth of counting functions associated with properties of finite groups. 
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M,(a, b) to be the x intercept of the line connecting (a, f(a)) and (b, —f(b)). It is easy to see 
that intermediacy and symmetry both hold for M,. To find the mean associated with a given f, 
calculate the slope of the line through (a, f(a)), (M,(a, b),0), and (6, —f(b)) in two ways to 


obtain (see Fig. 1) 
(4) f(a) ___—f() 
a—M,(a,b) M,(a,b)—b 


(M,(a, b),0) 


Fic. 1. 
This can be written as 
OEZ iON 
©) Mya) =" Fay + f(b) 


Different functions can determine the same mean. 
THEOREM |. M, = M, if and only if g = kf for some k > 0. 


Proof. If g = kf, k cancels in the right-hand side of (5) to yield M, = M,. If g # kf for any k, 
pick a, b, and k so that g(a) = kf(a) but g(b) = kf(b). Then if M,(a, b) = M,(a, b), 


(6) af(b) + bf(a) _ ag(b) + bkf(a) | 
f(a)+f(b) = kf (a) + g(b) 
Cross-multiplying and regrouping yields 
af (a) kf(b) — g(b)] = of(a)[kf(b) — g(d)], 
and since kf(b) — g(b) # 0 and f(a) # 0, we must have a = b, a contradiction. 


COROLLARY. When associating M, with a given function f, we may assume without loss of 
generality that f(\) = 1. 


Any function f determines a mean M, by formula (5), but not every mean is determined by a 
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function. There are obvious difficulties with Max and Min, and less obvious difficulties with some 
other homogeneous means. 


THEOREM 2. Let m(a, b) be a homogeneous mean, and define f(x) by 
x —m(1, x) 


x <i), 
(7) f(x)=4 mQ,x)-1 
] x= 1. 
If f(x) is multiplicative, then m = M,. If f(x) is not multiplicative, then m # M, for any f. 
Proof. If m = M,, fis determined by (5) with a = 1, b = x, and f(1) = 1 to be (7). Since m is 
homogeneous, 
m(x, xy) = xm(I, y), 
sO 
xf (xy) + f(x) _ fly) +y 
f(x) + f(xy) L+f(y)’ 
and cross-multiplying again gives an equation which simplifies to 
f(xy) = f(x) f(y). 
This says that the only possible functions yielding homogeneous M, are constant multiples of 
powers of x. It is to be emphasized that as long as m(1, x) # 1 the function f(x) given by (7) may 
be used to calculate correctly the mean of | and x. It is only when neither argument of the mean is 


1 that the trouble arises. 
For example, (7) associates with the Heronian mean the function 


r( y= Malet vet 3 _ 14 vx 
* (x+v¥x4+1)/3-1 24x” 


which is not multiplicative. 
He(3,7)=(3 + V21 + 7)/3 = 4.861, but 
3f(7) + 7f(3) 
TH = 4.876. 
f(3) +f) 


Of the means considered by Moskovitz, only the following have multiplicative functions 
associated with them by (7): 


M,; f 
A 1 
G vx 
H x 
C 1/x. 


The technique of Moskovitz provides a set of homogeneous means parametrized by a single 
variable, given by 
ab* + ba* 


(8) M,(a,b) = M,:(a,b) = aé+ bs ° 


Either a geometric or an analytic argument gives that, for fixed a and b, M, is a decreasing 
function of s; lim,_,,,M, = Min and lim,_, _,,M, = Max. The property of being monotone in the 
parameter is shared by several other sets of means that have occurred in the literature, among 
them the power means 


680 MICHAEL E. MAYS [December 


(9) P,(a, b) = [(a’ + b°)/2]'”, 
a mean of Gini [4], also studied by Beckenbach [1] and Lehmer [8] 
a* + b* 
10 G,( a, b) = ———_ 
( ) ( ) as} 4 bs7} 
or Stolarsky’s generalized logarithmic mean 
as _ b° 1As—1) 
11 ,b) = (| —— . 
(11) u,(a,6) = {+ | 


That M,(a, b) and G,(a, b) have similar properties is explained by the identity 
M,(a,b) = G,_,(a, b). 


Two parameter families also exist. Several of Gini’s means of several variables reduce in the 
case of two variables to 
a’ 4 b® 1As—r) 
? 


(12) G(r,s;a,b)= { 
which generalizes (10), and Leach and Sholander worked with 


(13) B(1, 53a, 6) = | eye" 


which generalizes (11). (9) is a special case of both (12) and (13). The two parameter means are 
monotone in each parameter if a and b and the other parameter are fixed. 


3. Means and Mean Value Theorems. Formula (11) arises as a special case of a general 
method for associating a mean with a function in [10], just as (8) arises from the technique in [9]. 
Let g be (strictly) monotone, differentiable, and convex on (0, 00). Then the Mean Value Theorem 
for derivatives guarantees that for each pair of numbers a and 5 there exists a unique c in (a, b) 
satisfying 
(14) g'(c) = (g(b) — g(a))/(b — a). 

Write c = U,(a, b). The c chosen is clearly between a and b, and symmetry is also obvious (Fig. 
2). 

The Mean Value Theorem for integrals generates mean values, too. We need only assume that f 

is (strictly) monotone and continuous on (0,00) to be sure that a unique c in (a,b) exists 


satisfying 


(15) fe)(b— a) = ffx) ae. 
We define the mean V; by 
(16) v,(a,b) = f-'| f°A(x) de/(b~ a)], 


The next theorem shows that the sets of means 


{U,|g is monotone, differentiable, and convex} 


and 
{V,|f is monotone and continuous} 


are the same, but we will use the V, form exclusively thereafter because a broader class of 
functions are allowed to generate means (at least superficially), there is a more elegant characteri- 
zation of those functions which generate the same mean, the integral has a clearer link to the work 
of Hardy, Littlewood, and Polya, and all the “nice” means are associated with powers of x (Fig. 
3). 
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THEOREM 3. If f is monotone and continuous, then 
Vip = Usyex dx: 
If g is monotone, differentiable, and convex, then 
U, = Vp: 


The proof is an application of properties of real functions and the Fundamental Theorem of 
Calculus. 

Hardy, Littlewood, and Polya associate a mean of several numbers with a continuous, 
monotone function. For the n nonnegative numbers a, < a, < ::: < a, and weights g,, q>,.--5 Jp 
which sum to 1, O(a, a2,..., @,) is defined by 


My (a), a2,---; a,) =f! » as(a)]. 
i=] 
Take a, = ad,a,=a+(b—a)/n,..., anda, =a+(b—a)(n— 1)/n, and gq; = 1/n for each i 
to get 


» aif (a;) 

i=] 
a Riemann sum for {’f(x) dx/(b — a). It is an upper sum if f is decreasing and a lower sum if f is 
increasing. Fancier choices for the q, yield a Stieltjes integral in the limit. In a sense, we have 
constructed a mean of two variables as a generalization of a mean of n variables. Properties of SN, 
developed in [6] yield properties of V,. Theorem 16 in [6] implies that, for V,; abbreviated as V,, V, 
is an increasing function of s for fixed a and b. We list some means arising as a V, for some s. 


5 V; 
— 00 Max 
A 
> 0 I 
—1/2 (A + G)/2 
—] L 
—2 G 
~3 (HG?)'” 
> —0 Min 
Conspicuous in its absence from this list is H, although the mean arising for s = —3 is enough 


to exhibit that H < G. No matter what f is taken, H never arises. To see why, carry the parallel 
between this construction and that of Hardy, Littlewood, and Polya a few theorems further. 
Theorem 83 of [6] implies the following result: 


THEOREM 4. V,(a, b) = V,(a, b) for all a and b if and only if there exist constants c = 0 and k 
such that f= cg + k. 


A consequence of this theorem is that we may assume f is increasing without loss of generality. 
In fact we may take f(1) = 1 and, if fis differentiable at 1, f’(1) = 1 by an appropriate choice of c 
and k. 

The analogue of Theorem 84 of [6] provides a criterion for homogeneity. 


THEOREM 5. Suppose f is continuous and monotone on (0,0), and that V; is homogeneous. Then 
V, = V, for some real number s. 


THEOREM 6. H # V, for any f. 


Proof. By Theorem 5, since H is homogeneous we need only look among the means Y,. If 
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Vi, 2) = ACI, 2) = 473, solving 


[(°*! - 1)/(s +] =473 
yields s = —5.14, but if V.(1,3) = A(l, 3) = 3/2, 


[(3°*! — 1)/(2s + 2)]'” = 3/2 
has solution s = — 5.36. 
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MONTHLY UNSOLVED PROBLEMS 1969-1983 


RICHARD K. GUY 
Department of Mathematics & Statistics, The University of Calgary, Calgary, Alberta, Canada, T2N I1N4 


Regular readers will know that in December of odd-numbered years we pool the information 
that correspondents send in concerning problems that have appeared in this section. References in 
brackets are to years and pages of this MONTHLY, earlier updating articles being [1971, 1113; 
1973, 1120; 1975, 995; 1977, 807; 1979, 847 and 1981, 775]. In several of these there are 
statements of policy, including that of not normally publishing solutions. A parenthesis with a 
year after a name, such as Guy (1981), relates to the list of references at the end, as do (tbp) and 
(wrc), which respectively mean “to be published” and “written communication.” 


f 
To accommodate a wider variety of unsolved problems, we have experimented with different 
formats, and sometimes, when there’s a dearth of suitable problems, with no format at all! 


There would not be enough space to pass on all the information if we restated all the problems 
in detail, so that many are given only a brief mention, without even defining the main ideas 
involved. This might betoken a regrettable lack of leisureliness. On the other hand, if we are 
encouraged to turn to our back issues, we are reminded of the wealth of good material that they 
contain. 
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Falconer (1983) has settled a problem of Klee [1969, 54; correction 1971, 1114] by showing 
that, except for certain unlikely possibilities, any curve with two equireciprocal points must have 
the same equireciprocal constant at each point. A point P is an equireciprocal point of a curve C if 
C is starshaped at P and if every chord XY of C through P satisfies 1/.XP| + 1/|PY| = a for 
some (equireciprocal) constant a. Further, any twice differentiable curve with two equireciprocal 
points must be an ellipse. On the other hand, there are nonelliptical convex curves with two 
equireciprocal points. Hallstrom (1974) had obtained some partial results. 


If (x) is Euler’s totient function, then Carmichael’s conjecture is that for no value of n does 
the equation ¢(x) = n have a unique solution: see Klee [1969, 288]. Carmichael showed that such 
an n is greater than 10°’ and Klee in 1944 improved this to 10°. Now Masai and Valette (1982) 
have a lower bound of 10’. Valette (1981) has also published an expository paper. 


The quest for gracefulness [1969, 1128] has abated somewhat, we are glad to say. All seekers 
are urged to read the paper of Huang, Kotzig and Rosa (1982). Ayel and Faveron (1982) have 
shown that helms (wheels with an additional edge at each rim-vertex) are graceful (Fig. 1). 
Golomb (1981) rides a number of hobby-horses, including polyominoes and graceful numbering, 
but there is no progress with the main Ringel-Kotzig conjecture. 


Fic. 1. Two graceful helms. 


Peter Walker (1983) has the following result for Ogilvy’s problem [1970, 388] on complex 
iterated radicals. The sequence z,,, = \Z, + ¢ converges in all cases with c < —1/4, or with 
Imc > 0, provided all square roots have positive imaginary part. For Imc <0 there are 
corresponding results when all square roots have negative imaginary part. 


T. C. Brown [1971, 886] asked if there is a sequence on four symbols in which no two adjacent 
segments are permutations of one another, and discussed other aspects of Thue (Morse-Hedlund) 
sequences (Guy (1981), Problem E21). Some recent papers in this area are by Bean, Ehrenfeucht 
& McNulty (1979) and by Shelton (& Rao) (1981, 1982). 


If S, C are noncoincident planar convex sets whose interiors intersect, define a(S, C) to be the 
number of connected components in the intersection of the boundaries of S and C, then Peterson 
[1972, 505] made the following three conjectures: (a) ; if a(S, S’) is even or infinite for every S’ 
congruent to S, then S has constant width? (b) ; if a(S, C) is even or infinite for every circular 
disc C of diameter w, then S has constant width w? (c) ; if C is a convex set of constant width w 
and a(S, C’) is even or infinite for every C’ congruent to C, then S has constant width w? 


1983] MONTHLY UNSOLVED PROBLEMS 1969-1983 685 


Goodey & Woodcock (1978) verified (b) and now Goodey (tbp) verifies all three conjectures by 
proving the following theorem: if S and C are planar convex sets such that a(S, C’) is even or 
infinite for all translations C’ of C, then S and C have equal width in every direction. 


Detlef Seese (wrc) has shown that the crossing number and the rectilinear crossing number of a 
graph [1973, 52] are computable. 


Earlier updating articles should have given more information on Lehmer’s problem (Alter 
[1973, 192]): does $(n), Euler’s totient function, ever properly divide n — 1? I quote from Guy 
(1981, Problem B37): Such an n must be a Carmichael number. Lehmer (1932) showed that n 
would have to be the product of at least seven distinct primes. Lieuwens (1970) proved the 
following theorems: if 3 divides n, then n is the product of more than 212 primes and 
n> 5-5 xX 10°”; if the smallest prime factor of n is 5, n contains at least 11 prime factors; if the 
smallest prime factor of n is at least 7, then n is the product of at least 13 primes. This supersedes 
and corrects the work of Schuh (1944). Masao Kishore (1977) has shown that at least 13 primes 
are needed in any case and Cohen and Hagis (1980) have improved this to 14. 


Klee [1977, 284] asked for a good algorithm to compute the d-measure of the union of n given 
ranges (hyperrectangles) in d-space. When d > 3, van Leeuwen & Wood (1981) improve Bentley’s 
O(n7~ "In n) algorithm (see Bentley & Wood (1980)) to one with O(n!) steps. 


For an excellent survey of the recent “dynamically increasing research activity in the theory of 
permanents,” including the present status of Wang’s conjectures [1978, 188; 1979, 119], see Minc 
(1983). Indeed, the present status is given for 20 conjectures and 10 problems, with a list of 11 
additional conjectures and 3 further problems. There is a bibliography of 77 items published since 
Minc (1978) and 30 addenda to that earlier survey. 


In answer to McCarty’s question [1978, 578], Herzberg & Garner (1981) construct latin queen 
squares of any prime order, p > 11. For example, with (i, /)th entry 47 + 27 — 5 (mod p). John 
van Rees (1981) extends this to: latin queen squares of order n exist if gcd (n, 210) = 1, and (wrc) 
an analogous result for latin queen hypercubes in any number of dimensions. His two conjectures, 
(a) 4 there are no toroidal latin queen squares of order n if gcd (n, 210) > 1? (b) ¢ there are no 
latin queen squares of order n if gcd (n, 210) > 1? are still open. He proves that toroidal latin 
queen squares of order n do not exist if gcd (n,6) > 1, and mentions the problems: find a latin 
queen square of order 12, 14 or 15; find a toroidal latin queen square or order 25, 35 or 49. 


In a remarkable paper, Tunnell (1983) has shown that the numbers less than a thousand and 
= 1,2 or 3 (mod 8) whose congruent or noncongruent status was unknown to Alter [1980, 43] and 
are missing from rows a = 1, 2 or 3 (mod 8) in Table 1 [1981, 759] or Table 7 of Guy (1982, 
Problem D27), viz. 


a=1(mod8) 569,577, 593, 809, 857, 881, 897, 953 
a=2(mod8) 282, 482, 706, 802, 898, 938 
a=3(mod8) 627, 939 


are all noncongruent. So only nine numbers less than a thousand have unknown status: 


a=5(mod8) 573, 597, 677, 893, 917, 933, 965 
=7(mod8) 543, 623 


Tunnell writes “It is a classical Diophantine problem to determine which integers are the area of 
some right triangle with rational sides. It is well known that D is the area of such a triangle if and 
only if the elliptic curve y* = x? — Dx has infinitely many rational points. Using results from 
the theory of modular forms of half-integral weight due to Shimura and Waldspurger, and the 
theorem of Coates-Wiles on elliptic curves with complex multiplication, we construct modular 
forms La(n)q” of weight 3/2 such that a(n) nonzero implies that n is not the area of any rational 
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right triangle. The coefficients a(n) are computed in terms of representations of n by a ternary 
quadratic form, and are conjecturally related to the order of the Tate-Shafarevitch group of the 
curve above.” 


For another paper on the postage stamp problem [1980, 206], see Mossige (1981). 


In [1981, 760] I mentioned that a conjecture of Hamilton & Mullen [1980, 392] had been settled 
by a succession of correspondents. Jurai Bosak writes that the result was already in his book 
(1976): see also Laywine (1981). 


Klee (1982) has settled the questions raised by Zowe [1980, 475]. In particular Zowe showed, 
for mathematical programming in infinite-dimensional vector spaces, that the Slater constraint 
qualification and a formally weaker one of Kurcyusz were equivalent in barrelled spaces, and 
asked if barrelledness is necessary. Klee has shown equivalence in a topological vector space E just 
if every barrel in FE is a neighborhood of the origin. Thus, if FE is locally convex, the two constraint 
qualifications are equivalent just if F is barrelled. 


In his Zbl. review (444.05038) of Haggard’s paper [1980, 654] A. T. White observes that 
Edmonds answers Question 1 (given a connected graph G and nonnegative integer n, is there a 
2-cell imbedding of G whose geometric dual has n loops) in the affirmative when n is the number 
of edges in G, not for n = 0 as stated by Haggard. 


Jacob Beard [1980, 744], and see Beard, Doyle & Mandelberg (1980), called a prime 3- 
square-separable if each pair of cubic residues which are not quadratic residues, is separated by a 
quadratic residue. For example, 43 is 3-s.-s. because (5 is a primitive root and) 5? = 39, 5? = 22, 
55 = 8, 572 = 42, 577 = 27, 5% =2, 5°? = 32 are separated by the quadratic residues (in 
parentheses) in 


...(1)2 (4) 8 (9) 22 (25) 27 (31) 32 (36) 39 (40) 42... 


I asked if 7 is the only 3-s.-s. prime, so the answer is “no”! However, it seems likely that 7, 19, 37 
and 43 are the only 3-s.-s. primes. Only a finite amount of calculation is needed to check this since 
the Lehmers (1962) with Mills (1963), Selfridge (1962) and Brillhart (1964) (and see Bierstedt & 
Mills (1963)) have results of the kind: “except for a finite number of primes, two consecutive 
cubic residues no larger than 77, 78 must occur.” This needs extension to “cubic residues which 
are not quadratic residues,” but presumably this is straightforward. See the Lehmers & Shanks 
(1970) and compare the argument: for a prime greater than 5, at least one of 2, 5 and 10 isa 
quadratic residue, so at least one of the pairs 1, 2 or 4, 5 or 9, 10 are quadratic residues. Note also 
the Lehmer-Mills-Selfridge result that, except for a finite number of primes, three consecutive 
cubic residues no larger than 23532, 23533, 23534 must occur, and see Sahib Singh (1977). 


The corresponding situation for quintic residues seems to be that only 11, 31, 41, 61, 71 and 101 
are 5-s.-s., but that needs more effort to check. The Lehmer-Mills bound for consecutive quintic 
residues is 7888, 7889. It seems reasonable to conjecture that there are only finitely many primes 
which are s-square-separable for each odd s. The Brillhart-Lehmer bound for consecutive septic 
(7) residues is 1649375, 1649376. The first few s-square-separable primes for 7 < s < 17 are: 


s=7 29, 43,113,127 s= 19, 37, 73, 109, 181 
s=11 23,67, 89,199 =13 53,79, 157 
s=15 31, 61,151, 181, 211, 241, 271 =17 103,137, 307 


In his review (Zb1.463.10004), D. Wolke notes that some of the problems mentioned by 
Kenneth H. Rosen [1981, 276] were treated by E. Wirsing (1980). See also a further paper of 
Elliott (1976). 


Suppose that S = {5s,, 55,...,5,,} 18 a sequence of symbols, not necessarily distinct. A rind of S$ 
is a permutation T = {54(1), Sq(y>+++»Sacn)} Of a special kind in which 7(i + 1) is either the Least 
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or the gReatest of the set {1,2,...,2} \ { (1), 7(2),...,7(i)}, ie., Tis formed by peeling off the 
Leftmost or Rightmost symbol remaining in the original sequence S. Let p(S) be the number of 
distinct rinds of S. It is easy to see that p(S) < 2”~' with equality just if the s, are distinct. Gédbel 
[1982, 113] conjectured that p(S) is a maximum when the symbols in S are grouped, i.e., when like 
symbols occur contiguously. G. W. Peck (wrc) and P. J. Slater (wrc) notice that the conjecture is 
false; for example p(AA BBCCC) = 46, whereas p(ABBACCC) = 49. In fact, if n = 4k and each 
of the symbols 1,2,...,2k occurs twice, Peck observes that {1,2,...,k,1,2,...,k,k +1, k + 2, 
...,2k,k+1,k + 2,...,2k} has more rinds than the corresponding grouped sequence. How- 
ever, this is still not best possible: p(12123434) = 104, whereas (12213443) = 110. It seems 
surprisingly difficult to say anything that’s both true and nontrivial. 


The exhortation [1983, 35] not to try to solve certain problems may have been effective, though 
more earlier attempts have come to light. In connexion with Problem 1, on the distinctness of the 
Markov numbers, Schénheim sends a yellowing offprint of his 1956 paper. 


I believe that it may be important to distinguish between Problems 2 and 3, i.e., between 
Problems E16 and E17 in Guy (1981). In the Collatz, ‘3x + 1’ type of problem there is no 
(unique) inverse function, and the logical status may be different from the ‘permutation se- 
quences’ which I attribute to Conway. I have given a brief partial history of the Collatz sequence 
[1983, 39] and Lagarias (tbp) is preparing the definitive article on the subject. Masai Yamada 
(1980) claimed to show that the sequence always collapsed to 1, but see Lagarias’s review. Enrico 
Federighi sends the sequence a,,, =a,/2 (a, even), =a,/3 (a, an odd multiple of 3), 
= 7a, + 1 (otherwise). Does the 7 make it easier to prove anything? Are the cycles containing 1 
and 19 the only ones? 


As stated [1983, 40], Conway has shown that the premutation sequences belong to a class of 
problems which contains undecidable questions; but that is not to say you may not be able to 
decide a particular case. Such sequences appeared in problems of Klamkin (1963) and (a more 
tractable example; see Bergman (1964)) of Klamkin & Tritter (1963). Such problems were 
considered by John Conway, Alan Tritter and Mike Guy during Mike’s undergraduate years 
(1960-63). See also comments by Shanks (1965) and Atkin (1966). 


That the answer to Hartwig’s question [1983, 120] about finding an invertible linear combina- 
tion of matrices was “no for n > 3” was discovered in rapid succession by Stephen Boyack 
(83 :02:10), Don Coppersmith, Victor Miller & Gadiel Seroussi (83:02:17 and 18), John H. 


Smith (83 : 02:22), Walter Sizer (83 : 02:23), Man-Duen Choi (83:03:09), Michael Atkinson 
(83 : 03:10), Jean-Marie Monier (83 : 03:21) and presumably many others. One of the simplest 
QO, Q; 0 1 0 0 0 0 
whereQ,=Q,=|]-1 0 0/Q2,=]0 0 1/Q;= 
QO, QO, 0 0 0 0 -1 0 
“The skew symmetric trick is very pretty and was found by several authors. It won’t work for a 
division ring, however, as 


examples is 
0 1 
0 0 
0 0 
Hartwig replied to Coppersmith, Miller & Seroussi: 


0 a £B 
U=|—-a 0 y 
-B -y O 


can be invertible for Baty # ya 18. Then Ux = 0 implies x = 0. You are right, my question 
should have been: when do such scalars a, exist for a field, a division ring, etc? The real problem is 
the splitting problem, which does hold for a field, but should be false for the integers.” 

Engel’s “Celtic Art” problem [1983, 122] stimulated John P. Robertson to use a computer to 
find the 42 essentially different 3 x 4 matchings which lead to Hamilton cycles. There are two 
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different 2 X 3 matchings, each of which leads to a pattern with 4-fold rotational symmetry. 
These were depicted earlier [1983, 125]. Of the eight 2 x 5 matchings which generate Hamilton 
cycles, all but one have such symmetry. But none of the 42 (3 X 4)-generated patterns is 
symmetrical. Engel had earlier noted that this rotational symmetry only seems to occur if mn is 
singly even, not when mn is a multiple of 4. Is this a theorem, or haven’t we searched far enough? 
[Stop press! Robertson writes that he has a proof.] Robertson has found symmetrical patterns 
generated by 3 X 10, 5 X 6 and 6 X 7 matchings, but none of the seven 4 X 5 matchings that he 
found leads to a symmetrical Hamilton circuit. Figure 2(a) is a 5 X 6 matching which gives the 
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Fic. 2. The matching (a) generates the Hamilton circuit (b). 
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symmetrical pattern shown in Figure 2(b). A reference omitted from the original article is Engel 
(1980). 


Emeritus Professor Lawrence Ringenberg reminds us that snowplow problems [1983, 199] go 
back at least 32 years; see Klamkin (1951) and Ringenberg (1952). 


My indebtedness to numerous correspondents is obvious. So too, I hope, is the usefulness of a 
clearing-house for information on problems which are not always quite as unsolved as we thought. 
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APPLICATIONS OF A RESULT ON SPHERICAL INTEGRATION TO THE THEORY 
OF CONVEX SETS 


K. J. FALCONER 
School of Mathematics, University of Bristol, University Walk, Bristol, England BS8 ITW 


This note relates five results on the geometry of convex bodies by demonstrating that they are 
almost immediate consequences of a theorem on the integration of functions over Q, the surface of 
the unit sphere in n-dimensional Euclidean space. This basic theorem is a corollary to the classical 
Funk-Hecke theorem concerning n-dimensional spherical harmonic functions. The article by 
Seeley [4] provides a readable account of spherical harmonics, and more detailed theory of such 
functions is given in Erdélyi [2]. Essentially the spherical harmonic functions S,,(@) may be 
thought of as the higher dimensional analogues of cos mt and sin mt in Fourier series. If f(@) is a 
reasonably behaved function on Q, then we may write f(@) = Loc,,S,,(8) where S,,(@) 1s some 
spherical harmonic of degree m. 

If 6, are two vectors in the (n — 1)-sphere, let 9-y denote their dot product. The 
Funk-Hecke theorem states: If g(x) is an L' function defined on the real interval | — 1, 1], then 


(1) [a(O-¥)S,(8) dO =A,WS,(P) 
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for ally € Q, where 

_ I a” py ay m+1/2)n-3/2 
(2) Am = mf 8(2) Fa (1 — x?) dx 


and a,, is a nonzero number. 


Suppose f(@) is a continuous function such that for all y € Q, fg(@ - ~)f(@) dé = 0. We may 
obtain a form which is more useful for our purposes by multiplying equation (1) by f(W) and 
integrating over {2 with respect to ~. We get, after changing the order of integration, 


dm f f(%) Sn (b) ay = 0 
for every mth degree spherical harmonic S,,(W). Thus, provided that A,, # 0, 


[fO)5,(4) ay = 0. 


Then using the completeness of the spherical harmonics and the fact that mth degree spherical 
harmonics are odd or even according as to whether m is odd or even we obtain the following 
form: 


Let g(x) be an L' function on {—1,1] and let X,, be as in (2). Suppose that f(@) is a continuous 
function on Q such that for all p € Q 


[a(9-) (8) 40 = 0. 
Then 


(A) ifA,, #0 for all m, f(9) = 0; 
(B) ifA,, # 0 for all odd m, f(@) is an even function; 
(C) ifA,, # 0 for all even m, f(@) is an odd function. 


Finally for applications to convex sets we use the following special cases of the preceding form of 
the Funk-Hecke theorem. 


(i) Let f(@) be a continuous function on Q such that fg.ysof (9) - v|* dd = 0 for all y € Q for 
some nonnegative integer k. Then if k is odd, f(@) is an odd function, and if k is even, f(@) is an 
even function. 


(ii) Let f(@) be a continuous function on Q such that fg.,~of(9) a0 = 0 for all p € Q (integra- 
tion being with respect to the obvious invariant measure on the set {0:0 € Q and 0 - ~ = 0}). Then 
f(@) is an odd function. 


(i) may be obtained by putting g(x) = x* for 0 < x < 1 and g(x) = 0 otherwise. We may use 
repeated integration by parts (and Leibniz’s theorem) to see that we have the situation of (B) 
above if k is even and of (C) above if k is odd. For case (ii) we take g(x) to be a “delta” function 
(using the usual method of approximate identities) giving 


d™ moyana7 
= ——(]| — x? : 
im Om dx” ( * ) x=0 
and as this is nonzero for even m, (C) above yields the result. 
This concludes our preparatory work with the Funk-Hecke theorem. 


We now deduce various results on convex bodies; all convex sets to be considered are assumed 
closed and bounded and not contained in any hyperplane. The first result gives a sufficient 
condition for a convex body to be centrally symmetric; such results are important when defining 
measures of symmetry of convex bodies. (A measure of symmetry is a function that takes value 1 
on centrally symmetric sets and values strictly less than | on other convex sets, see Griinbaum [3].) 


692 K. J. FALCONER [December 


THEOREM |. Let X be a convex body in R", and let P be a point such that every hyperplane 
through P divides X into two parts of equal n-dimensional volume. Then X is centrally symmetric with 
P as center. 


Proof. Certainly P is an interior point of X. Let f(@) be the defining function of X with respect 
to P, so that (@, f(@)) gives the boundary of X in polar coordinates with P as origin. Considering 
the portion of X cut off by the hyperplane through P and perpendicular to the unit vector J, we 
must have that 


] n ] 
= 0)" dd=—V 
n byso ) 2 


for all Y, where V is the volume of X. Thus 
f _ [4(@)"-s(-6)"] 40 =0 
O-~>0 


for all y. By (i) above f(@)”" — f(—8@)” is an even function, so as it is certainly an odd function as 
well, f(@) = f(—8@) for all @. 


Next we show that a centrally symmetric convex set is determined by the areas of its sections 
through the center of symmetry. 


THEOREM 2. Let X and Y be centrally symmetric convex bodies in R" such that the (n — 1)- 
dimensional areas of parallel sections through the centers of the bodies are equal. Then X and Y are 
translates of each other. 


Proof. Let f(@) and g(@) be the defining functions of the bodies with respect to their centers. 
Thus 


] 


n— ] 


] 


n— |] 


[_ £(0)" 'do=—— [_—g(6)""'a0 

6-~=0 6-~=0 

for all y. Gi) implies that f(0)”"' — g(@)"~' is an odd function, so that as it is also an even 
function (from the central symmetry of X and Y), f(@) = g(@) for all 0. 


Ulam [5] p. 38 mentions the following problem, as yet unsolved: Let X be a convex body in R? 
made of a material of uniform density p with 0 < p < 1. Suppose that X will float in equilibrium 
in any orientation in a liquid of density 1, must X be spherical? We show that this is true in a 
special case: 


THEOREM 3. Let X be a centrally symmetric convex body in R° of uniform density 4 that will float 
in equilibrium in any orientation in a liquid of density 1. Then X is spherical. 


Proof. Let X be defined by the function f(@), taking the center of symmetry as origin. When X 
floats with the vector y pointing downwards, the parts of the surface of X with 6-wy> 0 are 
submerged. By the energy principle, if equilibrium is to be possible in all orientations, the center 
of gravity of the part of X below the surface level of the liquid must be at constant depth. That is 


[ £(8)'10-yid0 =c 
O-~>0 
for all ~, where c is constant. Hence for some number k 
f — [#(0)* - k]6- 40 =0 
6-~>0 


for all y. By (i) this implies f(@)* — k = 0 for all 6, making X spherical. 


Next we show that a symmetric convex body is determined by the perimeter of its shadow in 
every direction. 
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THEOREM 4. Let X and Y be centrally symmetric convex bodies in R" (n > 3) such that for every 
hyperplane H the (n — 2)-dimensional perimeter areas of the projections of X and Y onto H are 
equal. Then X and Y are translates of each other. 


Proof. Let X and Y have support functions h(@) and k(@) with respect to their centers. Using 
Cauchy’s surface area formula (see for example Eggleston [1] p. 89) we get, considering projections 
onto the hyperplane perpendicular to y, 


fg -@ dO = Sg of dd 


for all ~. We deduce from (ii) that h(@) = k(@) for all 6 so that X and Y are translates. 


Finally, if X is a body of constant width in R?, it follows from the plane version of Cauchy’s 
surface area formula that all plane projections of X (which must also be sets of constant width) 
have equal perimeters. Our last result shows that the converse of this is also true. 


THEOREM 5. Let X be a convex body in R? such that every plane projection of X has perimeter of 
length k. Then X is of constant width k/m. 


Proof. Let X have width w(@) in direction 6. From Cauchy’s surface area formula 
f w(0)d0=k 
0-~=0 
for all W, giving that 


J [w(0) —k/a] dd =0 


for all y. By (ii) we have w(@) = k/z for all 0, as required. 


References 


1. H. G. Eggleston, Convexity, Cambridge University Press, 1969. 

2. A. Erdelyi, et al., Higher transcendental functions, vol. 2, Bateman manuscript project, New York, 1953. 

3. B. Grunbaum, Measures of symmetry for convex sets, Convexity, Proceedings of symposia in Pure Mathe- 
matics, vol. 7, American Mathematical Society, Providence, 1963. 

4. R. T. Seeley, Spherical harmonics, this MONTHLY, 73 (1966), Slaught memorial supplement, 115-121. 

5. S. M. Ulam, A collection of mathematical problems, Interscience, New York, 1960. 


ITERATING THE DERIVED SET FUNCTION 


DENIS HIGGS 
Department of Pure Mathematics, University of Waterloo, Waterloo, Ontario, Canada N2L 3G1 


Cantor, in investigating the convergence of Fourier series, was led around 1870 to define the 
derived set (the set of all limit points) d(A) of a subset A of the real line R. He was also led to 
consider the iteration of the derived set function d, thereby coming to define ordinal numbers a in 
order to define the ath iterate d* of d (see [1] and [2, III]). An interesting fact, which does not 
seem to have been noticed hitherto, is that d* is itself the derived set function for some topology 
on R. Indeed, for very many topological spaces S, it turns out that d* is the derived set function 
for a new topology on S. 

This note first characterizes the topological spaces for which all the d* functions are derived set 
functions. Then the unfortunate extent to which iteration fails to relate well to various standard 
topological notions is noted, and the “ultimate iterate” of d (already considered by Cantor, for the 
case S = R) is defined, along with a related concept. 
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The derived set d(A) of a subset A of a topological space S is the set of all limit points in S of 
A, so that a point x is in d(A) if and only if every open set containing x meets A in a point y + x. 
Let S be any set and let ?(S) denote the power set of S. A function d: ?(S) > P(S) which is 
the derived set function for some topology on S will be called a derived set function on S. As is 
well known, a topology on S can always be recovered from the associated derived set function d 
since the closure of a subset A of S is just A U d(A). For this reason we sometimes refer to a 
topological space as a pair (S, d) where d is a derived set function on S. Derived set functions 
may be characterized as follows (Dugundji [3, p. 73]; similar characterizations are given by Harvey 
[4], Thron [7, p. 53], and others): 


PROPOSITION 1. A function d: ?(S) > P(S) is a derived set function on S if and only if 


(i) d(@) = @ and d(A U B) = d(A) Ud(B) for all A, B € ¥(S), 
(ui) x € d(x) forallx © S, 
(iii) d?(A) C A U d(A) forall A © P(S). 


(Here d(x) should really be d({x}), and d?(A) means d(d(A)).) 
The ath iterate d*: ?(S) — P(S) of a derived set function d on S is defined by transfinite 
recursion on a as follows: 


d'(A)= (A), d**"(A)= d(d"(A)), — d*(A) = 1) a*(A), 


where A is any member of #(5), a denotes a nonzero ordinal, and A denotes a limit ordinal. 

In order to determine for which derived set functions d every d* is also a derived set function, 
consider first the case a = 2. In view of condition (ii) above, a necessary condition for d? to be a 
derived set function is that x € d?(x) for all x € S. Now x € d(x) if and only if d?(x) € d(x), 
that is, if and only if d(x) is closed. So our necessary condition is that d(x) is closed for all x € S$ 
and this is just the 7, separation axiom of Aull and Thron (see [7, p. 92]; Tp is not strong, being 
intermediate to 7, and 7,). From [7, p. 93] we have: 


PROPOSITION 2. A function d: ?(S) > P(S) is a Tp derived set function (that is, the derived set 
function for a T, topology) if and only if (1) and (11) above hold and also 


(iii) d?(A) C a(A) for all A € P(S). 
We now use this result to show that if dis a 7, derived set function, then so is every d*. For 


functions f: ?(S) > P(S) the notation f, < f, will be used to mean that f,(A) GC f,(A) for all 
A € FS). 


THEOREM. Let d be a derived set function on a set S. Then d“ is a Ty derived set function on S for 
all nonzero ordinals a. Moreover, d® < d* for a < B, and d**® = d®d*, d*® = (d*)8 for all a 
and B. 


Proof. Let P(«) be the assertion that d**' < d*. Then P(1) holds since d is Tp, P(a) clearly 
implies P(a + 1), and if P(a) holds for all a < A, then for any A € ?(S) we have 


d*\(A) = d(d*(A)) = a0 4*(4)) C d(d%(A)) = d**"(A) C d*(A) 


for all a < A, from which P(A) follows. Therefore P(a) holds for all a by transfinite induction on 
a. An even simpler induction on B now shows that d® < d* fora < B. 

Let Q(a) state that d® satisfies (i) above. Q(1) is certainly true and Q(a) clearly implies 
Q(a + 1). Suppose Q(a) holds for all a < A. Then Q(A) states that 


A, 4%(A) nd*(B) = 1 d*(A) 0 0 a*(B) 


and it is clear that the left-hand side is contained in the right. To show that the other containment 
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also holds, let x be in the right-hand side, so that x € d%(A) N d*(B) for some a, B < where 
a < B, say. Then x € d*(A) N d%(B) and hence x is in the left-hand side. So Q(A) holds, and we 
therefore have Q(a) for all a, that is, d* satisfies condition (i) for all a. 

Since d*(x) € d(x), d® satisfies (41) for all a. 

Let R(a) state that d® satisfies (iii) ». Then R(1) holds. If R(a) holds, then for any A € ?(S) 
we have 


(d%*!)°(A) = d(d*(d**!(A))) © d(d*(d*(A))) & d(d*(A)) = d**'(A) 


so that R(a + 1) holds (the first inclusion here uses P(a) and the fact that A C B implies 
d*(A) ¢ d*(B) which we have on account of Q(a)). If R(a) holds for all a < A then 


(d*)*(A) = NM d*(d(A)) ¢  a*(a"(A)) SM d*(A) = (A) 


for all A € #(S) so that R(A) holds. Hence d® satisfies condition (iii) p for all a. 

It only remains to show the two identities involving ordinal addition and multiplication and we 
omit the routine arguments. 

An iterate d* with a > 2 of a derived set function d can itself be a derived set function without 
d being T,. Indeed, the two-point indiscrete space provides a simple instance. It can be shown 
that the following three conditions are equivalent for any derived set function d: d° is a derived 
set function; x € d3(x) for all x; and d?(A) € d(A) for all A. I do not know if this generalizes. 

Now we turn to the bad behaviour of iteration. To observe this it is already sufficient in most 
cases to look at d* where d is the usual derived set function on R. First note that for x € R and 
A & #(R) we have x € d?(A) if and only if A N {y: 1/(m + 1) < |y — x| < 1/m) is infinite for 
infinitely many m € N = (1,2,3,...). It follows that a basis of d?-neighbourhoods of x is 
obtained by taking the sets of the form [x — (1/m), x + (1/m)]\{x,: n © N) where m € N and 
x,, n EN, is a sequence of points such that x, # x for each n and lim, _,.,x, = x. What is 
relevant here is that R is a locally compact metric space—thus a similar description of d* and 
d*-neighbourhoods, and many of the remarks here, are valid for any such space. 

How do various topological properties of (IR, d) fare in passing to (IR, d*)? It is easy to see 
from the above description of d?-neighbourhoods that (IR, d7) is neither regular, nor locally 
compact, nor locally connected. (IR, d*) is Hausdorff for all a, this simply being a consequence of 
the fact that Hausdorffness is expansion-invariant— that is, is preserved by enlarging the topology 
(since d* < d, the d°-topology is an expansion of the d-topology). Since nonconnectedness is 
expansion-invariant, it is more surprising that (IR, d%~) is connected for all a, a fact which can 
perhaps best be seen to hold by showing, with the aid of the Cantor-Bendixson Theorem (see [2]), 
that (R, d°‘?) is connected, where d°“ is the ultimate iterate of d defined below. However, this 
fact is somewhat special to (IR, d) and in general connectedness is lost in passing from (S, d) to 
(S, d’). 

Iteration does not restrict nicely to subspaces. If T is a subset of a space (S, d), let d; denote 
the derived set function on T with the subspace topology. Then for most Tp spaces, it is not the 
case that (d;)* = (d“); for all T and a, a counterexample being given by taking the usual don R 
-with 


T={—+—:m,nen} U {0}; 
m n 


then 0 € (d’),(T) but 0 € (d;)*(T). It is easily verified that we always have (d,)* < (d%)r, 
with equality holding if T is either open or closed. The 7p spaces in which (d;)* = (d“), for all T 
and a turn out to coincide with those for which d is “hereditarily idempotent’ (see the remarks on 
the ultimate iterate below; such spaces are discussed in [6)]). 

Iteration does not behave well on products. Let d, X d, denote the derived set function on a 
product space (S,, d,) X (S,, d,). Then in general (d, X d,)* # df X d§: if we take the usual d 
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on R and let A = ((1/m,1/n): m,n € N), then (d X d)?(A) = ((0,0)) but (d? x d?)(A) = @. 

Iteration does not behave well with respect to continuous functions. Let f: R > R be the 
function whose graph is shown in Fig. 1. Then f is continuous with respect to the usual d but not 
with respect to d*, as can be seen from the fact that if 

l l 
A -uUlx. 2n — | 

then 0 € d’(A) but f(0) € f(A) Ud?(f(A)). Something can be salvaged here nevertheless: say 
that a function f: S, — S, is strictly continuous with respect to derived set functions d,, d, on 
S,, S, respectively if x © d,(A) implies f(x) © d,(f(A)) for all x € S, and A € #(S,). It is easy 
to see that if fis strictly continuous with respect to d,, d, and d,, d, are Tp, then f is strictly 
continuous with respect to df, d= for all a. 


Fic. 1 


Although (R, +) is a topological group for the (usual) d-topology, this is no longer true for the 


d?-topology: 
] 
u=[-1,1] \{o:nen| 


is a d7-neighbourhood of 0 but there is no d?-neighbourhood V of 0 such that V+ Vc U, this 
being because V+ VC U implies that V X V is disjoint from each of the diagonal lines 
x + y = 1/n, an impossibility. 

We close by mentioning the ultimate iterate d°“” of a T, derived set function d on a set S. Let 
\P(S)%S)| = x—then since d* decreases with increasing a and has at most « distinct values, there 
must be some ordinal p no larger in cardinality than « such that d* = d° for all a > p. Let us call 
the smallest such p the rank p(d) of d. The fact that every nonzero ordinal occurs as the rank of 
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some d is a consequence of the following result, due essentially to Hausdorff [5, p. 280]. 


PROPOSITION 3. If an ordinal o > | is given the order topology, then the rank of the associated d 
is the smallest ordinal p such that w° > oa. 


In order to describe d°‘” without reference to iteration (and also to describe a related concept), 
we need the following definitions. Let (S, d) be any topological space. Then d is idempotent if 
d? = d and hereditarily idempotent if (d,)* = d; for all T C S. For A € 9(S), let c(A) denote 
the closure of A and let p(A) denote the largest dense-in-itself subset of A (a subset B of S is 
dense-in-itself if d(B) 2 B). For (S, d) a Tp space, d°“ is clearly the largest idempotent derived 
set function < d. Also, as was essentially known to Cantor and as is in any case easily verified, we 
have d°‘” = pc (where pc means “c first, then p”). 

In a certain sense, d° is not quite ultimate: although idempotent, it is not hereditarily 
idempotent in general (for instance, it is not in the case of the usual d on R). All the same, there 
does exist a largest hereditarily idempotent derived set function < d, namely cp. In fact, cp is a 
hereditarily idempotent derived set function on any 7, space S. One way to see this is as follows. 
First note that the scattered subsets of S form an ideal 5 (A C S is scattered if and only if it 
contains no nonempty dense-in-itself set). Hence 5 may be minimally adjoined to the stock of 
closed subsets of S by means of the “localization” construction described by Vaidyanathaswamy 
[8, p. 171 et seq.]. Now, as Vaidyanathaswamy proves, the derived set function of the augmented 
topology is just cp [8, p. 183]). Moreover, since 5 is an adherence-ideal in the terminology of [8] 
(see pp. 177 and 183), it follows from results of [6] that cp is hereditarily idempotent and is 
furthermore the largest hereditarily idempotent derived set function < d. 
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CROSS PRODUCTS OF VECTORS IN HIGHER DIMENSIONAL EUCLIDEAN SPACES 


W. S. MASSEY 
Department of Mathematics, Yale University, New Haven, CT 06520 


Several times I have had students in undergraduate mathematics courses ask me the following 
question: Can one define a cross product of vectors in Euclidean n-space for n > 3 so that it will 
have properties similar to the usual cross product of vectors in 3-space? Of course the answer to 
this question will probably depend on which properties of the usual cross product one requires to 
hold in n-space; it is conceivable that there will be many different answers, depending on which 
properties are required to hold. 

Fortunately the situation is not quite as chaotic as the foregoing sentences might suggest. If one 
requires only three basic properties of the cross product, properties which are explained in 
practically all undergraduate textbooks that discuss vector analysis, it turns out that a cross 
product of vectors exist only in 3-dimensional and 7-dimensional Euclidean space. To the best of 
the author’s knowledge, the only textbook which contains a discussion of this fact is Hilton [8]. 
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The purpose of this note is to give an explanation of this result which will be accessible to the 
average reader of this MONTHLY. We will actually give two theorems on this subject: the first uses 
purely algebraic techniques, and is based on a famous theorem proved by A. Hurwitz in 1898. The 
second theorem gives a stronger result; it depends on a deep theorem proved by J. F. Adams in 
1958. At the end of the paper we discuss some other results in this area. 

Our notation is standard: R” denotes the real vector space consisting of n-tuples of real 
numbers, 


xy = xi; 
is the dot product of x = (x,,..., x,) and y = (yj,-..., y,), and 
1/2 
|x] = (x + x)!” 
denotes the norm or length of the vector x. 


THEOREM I. Assume n > 3 and a cross product is defined which assigns to any two vectors 
v,w © R" a vector v X w € R” such that the following three properties hold: 


(a) vo X wis a bilinear function of v and w. 
(b) The vector v X w is perpendicular to both v and w, i.e.,(v X w):v = (v X w):-w=0. 
(c) jv X wl? = |o|*|wl? — (o> w)?, 


Then n = 3 or 7. 


REMARK. Note that condition (c) is the usual condition that the length of vo < w shall be equal 
to the area of the parallelogram spanned by v and w. 


Proof. The proof consists in showing that a cross product defined on R” and having the three 
properties listed above implies the existence of a bilinear multiplication on R"*! which has very 
special properties. We will consider R"”*! as an orthogonal direct sum: 


R"t! = R' @ R". 
Thus an element of R”*' consists of an ordered pair (a, v), where a is a real number and v € R”. 
The required product is defined by the following formula: 


(1) (a,v)(b,w) = (ab—v-w,awt+ bv +vXw). 


This multiplication is obviously bilinear, and (1,0) is a 2-sided unit. An easy computation using 
properties (b) and (c) shows that the norm of the product of two elements of R"*' is given by the 
following formula: 


(2) (a, 0)(6,w)I =I(a, o)F1(b, w)I 
Now this is exactly the situation considered by A. Hurwitz [4] in 1898. Hurwitz proved that if we 
have a bilinear multiplication with a unit defined on R? such that the norm of the product of two 
vectors is the product of the norms (condition (2) above), then g must be 1, 2, 4, or 8, and the 
multiplication is isomorphic to that of the real numbers, the complex numbers, the quaternions, or 
the octonions of Cayley and Graves. For a lucid exposition of a modern version of this theorem of 
Hurwitz, see Jacobson, [5, pp. 417-427]. 
Note that the uniquess assertion of Hurwitz’s theorem shows that conditions (a), (b), and (c) of 

Theorem I characterize the cross products on R?® and R’ uniquely up to isomorphism. 

. The interested reader is referred to pp. 408-409 of a paper by E. Calabi [2] for a list of 
additional properties of the cross product in R’ and an actual multiplication table for this cross 
product in terms of an orthonormal basis of R’. 


REMARK. The multiplication given by formula (1) has evidently been known for a long time. In 
1942 B. Eckmann referred to it as “einer bekannten, elementaren Konstruktion” (see [3, p. 338)). 

In our next theorem we show that we can significantly weaken conditions (a) and (c) of 
Theorem I without altering the conclusion. 
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THEOREM II. Assume n > 3 and that a cross product is defined which assigns to any two vectors 
v,w © R" a vector v X w such that the following three properties hold: 


(a) v X w is a continuous function of the ordered pair (v, w). 
(b) The vector v X w is perpendicular to both v and w, i.e., (v X w):v = (v Xw)-w=0. 
(c) If v and w are linearly independent, then v X w + 0. 


Then n = 3 or 7. 


Proof. For any vectors vo, w © R”", let 


A(v,w) = |joPjw? — (v-w)']'”. 


Then A(v, w) is equal to the area of the parallelogram spanned by wv and w; it is obviously a 
continuous function of the ordered pair (v, w). Using this area function, we define a function 
f: R" X R" > R" 
by the formula 
A(v,w) 
f(w,w)=% |v xXw 
0 ifo Xw=0. 


(oXw) ifoxw#¥0O, 


We assert that the function f thus defined is continuous. To prove this, it suffices to prove that if 
(v,, W,) is any infinite sequence of pairs of vectors such that 


jim (vg, ) = (0, W%), 
then 
jim (0,5 Wi) = f (0, Wo). 


There are various cases to consider, depending on the two cases in the definition of f(v, w), but 
the details are completely elementary. 
Note that the function f thus defined satisfies the following two conditions: 


(3) f(o,w)-0 = f(vo,w)-w=0, 
2 
(4) If(o, w)|? = |A(v, w)I* = Jol? wl? — (ow). 
Exactly as before, we may consider R”*! as the direct sum R!' ® R", and define a function 
pw R't! x R"t! sR"! 
by the formula 
(5) p[(a, v),(b, w)] = (ab — v- w, aw + bo + f(v, w)) 
(compare with (1)). Then p is obviously continuous and (1,0) is a 2-sided unit in the sense that 
w[(1,0), (a, o)] = n[(a, 0), (1,0)] = (a, v). 
Finally, we have the analog of formula (2): 
(6) w(x, y)I? = |x]? 11? 
for any x, y € R"*'. Now let S” denote the unit n-dimensional sphere: 
S" = (x © R"* |x] = 1). 


In view of formula (6), we see that if x and y belong to S”, then p(x, y) € S” also. Thus p defines 
a continuous multiplication with a 2-sided unit on the n-sphere S”. 

This raises the following question: For what values of n does the n-sphere admit a continuous 
multiplication with a 2-sided unit? This was a famous problem for many years in algebraic 
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topology. It was finally resolved by Frank Adams in 1958 (see [1]). The answer is that such a 
continuous multiplication exists on S$” only in the cases n = 1, 3, and 7. Examples of such 
multiplications arise from the multiplication of complex numbers, quaternions, and the 
Cayley—Graves octonions respectively, restricted to the unit sphere. 

Thus we see that by referring to this theorem of Adams we can complete the proof of Theorem 
II. 

We will conclude this note by considering other possible ways to generalize the definition of 
the cross product to higher dimensional Euclidean spaces. Let v and w be vectors in R°; recall the 
formula for the components of v X w in terms of the components of v and w. According to this 
formula, if 


o= (v1, 02, 03) 


w= (1, 2, W3) 


then the kth component of v X w is the determinant of the 2 x 2 submatrix of the matrix 

UV, U2 0; 

W, W W3 
obtained by striking out the kth column. To be correct, we must multiply this determinant by 
(—1)*. 

Analogously, in R” we can define a cross product v; X v, X ++: X v,_, of any ordered (n — 1) 

tuple of vectors by a similar process. Form a matrix whose successive rows are the vectors 
U1, 0z,..., U,_;- The kth component of v, X v, X --- X v,_, is (— 1)* times the determinant of 


the submatrix obtained by deleting the kth column. This generalized cross product enjoys many of 
the familiar properties of the cross product in 3-space: It is a multilinear, skew symmetric 


function. The norm of the product, v, X v, X -+: X v,_,, 1s the (nm — 1)-dimensional volume (or 
measure) of the parallelopiped spanned by the vectors v,,..., v,_,. The vector v,; X v, X ++: X 
v,,_; 18 perpendicular to v, for k = 1,2,...,n — 1. 

This raises the following question: given integers k and n such that 2 < k < n — 1, can we 
define a cross product of any k-tuple of vectors v,, v,,..., v, in R” having similar properties? To 
make the question precise, let us demand the following properties, similar to those in Theorem II: 

(a) v, X vy X +++ X vo, is a continuous function of the ordered k-tuple (0,,..., v;). 

(b) (v0, X 0, X +++ Xv,):v; = 0 fori = 1,2,...,k. 

(c) If the vectors v,, v,..., v, are linearly independent, then v,; X v, X --- X v, # 0. 


The answer, strangely enough, is that such a cross product does not exist, with a single exception: 
n = 8 and k = 3. For the proof of this, the reader is referred to a paper by George Whitehead [6]; 
for explicit formulas for a cross product in this case, see Zvengrowski, [7]. 

Another property of the cross product of vectors in 3-space is the following: For any rotation r 
(i.e., orthogonal transformation of determinant +1) of 3-space and vectors v and w, 


(7) r(v X w) = (rv) X (rw). 


(Note that this equation is not true if v is an orthogonal transformation of determinant — 1). One 
can now prove the following: 


PROPOSITION. Assume that n > 2 and a cross product product is defined in R" which is bilinear 
and satisfies equation (7) for any rotation of R"; then n = 3. 

The proof depends on a knowledge of the real representations of the special orthogonal group 
S0(7); we do not have space in this note to go into details. 

This preservation of the cross product by rotations, expressed by equation (7), is less well 
known than the usual properties which are treated in our first theorem. It is normally only treated 
in advanced texts in theoretical physics or geometry. 
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A “COUNTEREXAMPLE” FOR THE SCHWARZ-CHRISTOFFEL TRANSFORM 


ELGIN JOHNSTON 
Department of Mathematics, Iowa State University, Ames, [A 50011 


The Schwarz-Christoffel Transform is a formula for a one-to-one analytic function that maps 
the upper half of the complex plane onto the inside of a polygon. As such the Schwarz-Christoffel 
Transform can be used to translate problems set in polygonal domains to more manageable 
problems set in the upper half plane. Such applications of the Schwarz-Christoffel Transform are 
common in problems involving two-dimensional flows, diffusion, potentials, etc. (see [1], [3], [5]). 

A statement of the Schwarz-Christoffel Transform can be found in most text books on 
elementary complex variables. One such statement is [4, p. 178]: The functions w = F(z) that map 
the upper half plane conformally onto polygons with interior angles ma, (k = 1,2,..., n) are of the 
form 


zu _ 
(1) F(z) =4f Il (6—x,) *d&+B 
0 k=1 
where Xx, < X. < +++ <x, are points on the real axis, B, = 1 — a, (k = 1,2,..., n) and A, B are 
complex constants. Since the sum of the exterior angles of a polygon is 27, we have: 
(2) B,+B,+-°:-+B8,=2 with-1<B, <1. 


(The polygons with one or more 8, = +1 are those with some vertices at oo and/or some interior 
angles of 277). 

It is well known that conditions (1) and (2) are necessary for F(z) to map the upper half plane 
conformally onto a polygon. However it is rarely stressed that these conditions are not sufficient 
for such a mapping and there appear to be few (if any) examples to this effect. In this paper we 
produce an example of a function F(z) that satisfies (1) and (2) but is not one-to-one in the upper 
half plane. We start with a third condition (involving the choice of the x,’s) that is necessary for 
F(z) to be univalent. 

The following theorem gives some guidelines for the choice of the real numbers x, mentioned 
in the statement of the Schwarz-Christoffel Transform. 


THEOREM. Let {B,);—, be given with —1 < B, < 1 and let x, < x, < +--+ <x, be real. If the 
function F(z) defined by (1) is univalent in the upper half plane, then 
Rn 
Bi, 3 
(3) x Zo Xx < Im Zo 


whenever Im Z, > 0. 


The theorem is an immediate consequence of the following lemma which gives a necessary 
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Etter. Benjamin/Cummings Pub, 1983, xv + 357 pp, (P). [ISBN: 0-8053-2520-4] Chapter formats are 
very attractive. Each begins with the statement of an interesting, even unusual for a _ beginning 
FORTRAN text, applied problem whose programmed solution is given within the chapter. Each chapter 
contains summaries, key words and debugging aids sections, style and technique guides, sets of prob- 
lems. Two appendices: summmary of FORTRAN 77 statements; and a list of FORTRAN 77 intrinsic func- 
tions. Glossary of key words. Answers to selected problems. Index. RJA 


Computer Programming, T(13-18: 1), S. Programming in Modula-2, Second, Corrected Edition. Nikiaus 
Wirth. Texts and Mono. in Comp. Sci. Springer-Verlag, 1983, 176 pp, $13.95. [ISBN: 0-387-12206-0] 
Provides an introduction to programming in general as well as a manual for learning Modula-2. Two 
appendices and an index (First Edition, TR, June-July 1983). RJA 


Computer Science, $(16-18), P, L. Computing Skills and the User Interface. Ed: M.J. Coombs, J.L. 
Alty. Computers & People Ser., No. 3. Academic Pr, 1981, xii + 499 pp, $46. [ISBN: 0-12-186520-7] 
Addresses problems associated with communication between man and computer. Divided into three 
parts: (1) needs of computer users; (2) nature and acquisition of computing skills; (3) design of 


the user interface. Index. RJA 


Computer Science, P. Lecture Notes in Computer Science-147: RIMS Symposia on Software Science and 
Engineering. Ed: Eiichi Gotto, et al. Springer-Verlag, 1983, 232 pp, $11.50 (P). [ISBN: 0-387- 
11980~-9] This volume contains 12 research papers presented at the annual computer science symposia 
held in Kyoto, Japan during the period 1980-1982. The papers span a wide range of computer research 
including programming language design, analysis of algorithms, database systems, and computer archi~ 
tecture. M§& 


Applications (Physics), P. Theory of Phase Transitions: Rigorous Results. Ya. G. Sinai. Intern. 
Ser. in Natural Philo. Pergamon Pr, 1982, viii + 153 pp, $27. [ISBN: 0-08-026469-7] Developed from 
a series of lectures, this monograph surveys many of the rigorous results about the theory of phase 
transitions. The intended audience is “specialists in statistical physics." AO 


Reviewers 


RJA: Richard J. Allen, St. Olaf; PB: Peder Bolstad, St. Olaf; RB: Richard Brown, Carleton; JNC: 
Judith N. Cederberg, St. Olaf; CEC: Clifton E. Corzatt, St. Olaf; JD-B: John Dyer-Bennet, Carleton; 
JRG: Jennifer R. Galovich, St. Olaf; SG: Steven Galovich, Carleton; JG: Jack Goldfeather, Carleton; 
PH: Paul Humke, St. Olaf; RBK: Roger B. Kirchner, Carleton; RSK: Richard S. Kleber, St. Olaf; JK: 
Joseph Konhauser, Macalester; LCL: Loren C. Larson, St. Olaf; GHM: George H. Mills, Carleton; RM: 
Richard Molnar, Macalester; RWN: Richard W. Nau, Carleton; AO: Arnold Ostebee, St. Olaf; AWR: A. 
Wayne Roberts, Macalester; MS: Michael Schneider, Macalester; JS: John Schue, Macalester; SS: Sey- 
mour Schuster, Carleton; JAS: J. Arthur Seebach, Jr., St. Olaf; KS: Kay Smith, St. Olaf; LAS: Lynn 
Arthur Steen, St. Olaf; TAV: Theodore A. Vessey, St. Olaf; MW: Martha Wallace, St. Olaf; FLW: Frank 
L. Wolf, Carleton; PZ: Paul Zorn, St. Olaf. 
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The Mathematical Association of America 
The Sixty Third Summer Meeting of the Association 
State University of New York, Center at Albany 


The Sixty Third Summer Meeting was held on the campus of the State University of New York, 
Albany Center during the period August 7-12, 1983. There were 909 registrants including 594 members 
of the Association. The Program Committee consisted of C. E. Burgess, Chairman; Donald R. Cohen, 
Vincent F. Cowling, Ronald L. Graham, William F. Hammond, and Theodore Bick. The talks for which 
abstracts were submitted consisted of the following: 


Some Connections Between Algebra and Set Theory, by Barbara Osofsky, Rutgers University. 


The talk began with the standard axioms of Zermelo-Fraenkel set theory (ZF). The axiom of 
choice (AC) was added as "Every subspace of a vector space has a complimentary subspace." Other 
algebraic statements closely related to AC were discussed. A module M over R[X,Y,Z] was constructed 
with the continuum hypothesis (CH) equivalent to 2Xn = wl for n contained in w were given. The 
existence of measurable cardinals was presented in terms of a mapping property of abelian groups. 
Lastly the proposition: "If every exact sequence of abelian groups 0 -> Z ~> E -> G -> 0 splits, 
then G is free," which is undecidable in ZF + AC + CH was discussed. 


Understanding and Teaching Mathematical Problem Solving, by Alan Schoenfeld, University of Roches~ 
ter. 


In a recent report on "Problem Solving in the Mathematics Curriculum" (MAA Notes #1), the MAA 
Committee on the Teaching of Undergraduate Mathematics wrote: “We endorse any approach to teaching 
mathematics that...actively engages students in the process of doing mathematics. We encourage the 
use of a “problem based approach" wherever possible in standard course offerings, including the par- 
ticipation of students in discussing, solving, and presenting their solutions to problems... . We 
recommend that a series of problem solving courses at various levels of sophistication be developed 
and made regular offerings in the curriculum." This talk focused on the results of research into 
mathematical problem solving (what accounts for the success of good problem solvers, the failures of 
poor ones?) on some practical suggestions for offering problem solving classes, and on some useful 
suggestions for implementing CTUM’s recommendations. 


SR NE SR SRE 


Ancient Greeks started the study of polyhedra, but the topic remains rather confused to this 
day. There is no precise and agreed definition as to what is a polyhedron, and different authors 
have in mind different things when they speak of polyhedra. The practice of concentrating on regu- 
lar or other very special polyhedra adds to the confusion: for them the need for workable and pre- 
cise definitions is lessened, since many different approaches lead to quite similar results. 
Several concepts of “polyhedra" were discussed. This provided a convenient setting for presenting 
various old and new results, and many unsolved questions. 


Theoretical and Experimental Contributions to the Development of a Science of Vehicular Traffic, by 
Robert Herman, University of Texas. 


A brief historical survey was given to show that traffic problems are very old. In addition, a 
bird’s eye view of developments in the science of vehicular traffic will touch on single lane flow 
and stability, a kinetic theory of multilane flow, the characterization of urban traffic and energy 
consumption, and a two-fluid model of town traffic. 


Progress Report of the Commission on Precollege Education in Mathematics, Science, and Technology, 
by Katherine P. Layton, Beverly Hills High School. 


In response to the current decline in the quality and quantity of precollege mathematics and 
science education in the United States, the National Science Board established the Commission on 
Precollege Education in Mathematics, Science and Technology. The purpose of the Commission is to 
define a national agenda for improving mathematics and science education. A progress report was 
given explaining the work done by the Commission during its first fourteen months. Included was a 
discussion of tentative recommendations. 


Convexity Ideas in Geometric Function Theory, by Thomas H. MacGregor, SUNY, Center at Albany. 


Presented were ways in which the idea of convexity plays a role in complex analysis. In par- 
ticular, the use of linear methods in the area of conformal mapping was discussed. The extreme 
points and the convex hulls of various families of analytic functions relate these sets to the solu~ 
tion of extremal problems and to the set of support points were described. This involved a blend of 
geometric, algebraic and analytic notions. 


Primality Testing, by Carl Pomerance, University of Georgia. 


If a large number is prime, how can one quickly find a proof that it is so? With recent tests, 
average 100 digit primes take about 33 seconds on a fast computer, while 200 digit primes take about 
8 minutes. Worst cases take about twice as long. With the older tests, worst case 100 digit primes 
could have taken about 100 years of computer time and 200 digit primes about 109 years. 
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Nevertheless, much of the new test has its roots in the older tests. These developments were dis- 
cussed, especially the similarities and differences between the new tests and the old tests. 


Galois” Version of Galois Theory, by Harold M, Edwards, NYU~-~Courant Institute. 


This talk was an exposition of the ideas of Galois” original memoir "On the conditions for the 
solvability of equations by radicals," including his method of defining the Galois group of an equa- 
tion and his proof that an equation is solvable by radicals if and only if its group is solvable. 


Other speakers and their titles were: Richard Lewontin, Harvard University, "Analysis of Com- 
plex Generic Systems," and Elias M. Stein, Princeton University, "Some Ideas in the Development of 
Fourier Analysis," the Earle Raymond Hedrick Lecture for 1983. 


Special Sessions 


Mini-Courses: The Association sponsored mini-courses entitled "Pascal for Mathematicians," by 
Harley Flanders, Florida Atlantic University; "Problems from Industry for Use in the Undergraduate 
Classroom," by Jeanne L. Agnew and Marvin $. Keener, Oklahoma State University; "An Introduction to 
the Mathematical Techniques and Applications of Computer Graphics," by Joan Wyzkoski, Bradley; "Con~ 
duit Microcomputer Software in Mathematics Instruction," by David A. Smith, Duke University; "Com- 
mercial Microcomputer Software in Undergraduate Mathematics Instruction," by David A. Smith, Duke 
University; "Coloring Problems," by David M. Berman, University of New Orleans. 


Contributed Paper Sessions: The Association sponsored three contributed papers sessions. The 
interest areas and the list of presentors and submitted abstracts follows: 


he Use of Computers in Undergraduate Mathematics Instruction. Presider: Ronald H. Wenger, Univer- 
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sity of Delaware. 


"How Effective is Computer-Assisted Instruction for Mathematics Remediation," by Henry Africk, New 
York City Technical College of the City University of New York. 


Computer-assisted instruction can improve overall student performance when used as a supplement 
to regular classroom instruction. The amount of improvement will vary depending on the topic, the 
type of student, the quality of classroom instruction as well as the instructional design of the 
computer program itself. These points were illustrated with examples from the computer~assisted 
instruction project at New York City Technical College. 


"The Use of a Software Authoring System to Individualize Instruction in a Mathematics Pre~Calculus 
Course," by Fredric Tufte, University of Wisconsin, Platteville, Wisconsin. 


This paper discussed the use of TIPS (Teaching Information Process System), a software author- 
ing system, in the design of a computer-assisted instructional package to supplement normal instruc- 
tion in a mathematics pre-calculus course. After being designed and implemented by the instructor, 
the student is able to take multiple choice tests over small units of material. The system diag- 
noses concepts with which the student has difficulty and prescribes additional work for the student 
to hopefully correct the exhibited deficiencies. 


"A Computer Aided Calculus Course," by Mario Martelli, Bryn Mawr College, Bryn Mawr, Pennsylvania. 


Described was how the Computer-Calculus course organized in the last two years. A set of pro- 
grams designed for addressing the main topics of a first year Calculus, which require a minimum 
knowledge of computer language, and which can be used both for illustration or investigation pur- 
poses was also described. 


"Motivating Trigonometry: Using the Computer to Investigate Harmonic Motion,” by Martin E. Flash- 
man, Humboldt State University, Arcata, California. 


The harmonic motion of a spring can motivate the circular trigonometric functions using the 
standard second order differential equation y" = -y with y(0) = 0 and y°(0) = 1. Proceeding by 
Euler’s Method, computations illustrate delayed feedback and the important interaction of y and y” 
With at small, a computer or hand plot of the approximate solutions for y(t) and y(t), with x(t) 
y°(t), suggests the relations of the points (x(t),y(t)) to the unit circle, which then can be pro- 
ven. Finally the parameter t can be discovered as the arc length on the unit circle. 


"Using a Surface Plotting Program in Sophomore Calculus," by Richard J. Cleary, Saint Michael’s Col- 
lege, Winooski, Vermont. 


In the spring of 1983 at Saint Michael’s College, fourth semester calculus students were 
required to complete a computer graphics project involving the use of a surface plotting program. 
The program, called BDPLOT, was developed by the National Center for Atmospheric Research. Our ver- 
sion was adapted by Middlebury College and run on Saint Michael’s DEC PDP-ll using the RSTS/E 
operating system. The primary goal of the project was to allow the students to obtain a graph of 
function £(x,y) which they had previously analyzed to find critical points. Each student submitted 
a copy of three pictures with the analysis of the function attached. The results of the project and 
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suggestions for improvement are discussed. 


"A Numerical Approximation Method for Double Integrals," by Donald R. Snow, Brigham Young Univer~ 
sity, Provo, Utah. 


This numerical method arose in a calculator/computer multivariable calculus class. It is the 
natural generalization of the Trapezoidal Rule for one-variable integrals and was worked out through 
a series of homework problems and discussions with the class. The domain of integration for the 
double integral is divided into triangles and the function surface is approximated by a plane 
through the function values at the corners of these triangles. The "Triangular Rule" is then the 
sum of the volumes of each of these prisms. The formulas turn out to be quite simple and give good 
results. By experimenting with the Romberg correction formulas the error estimate for the Rule was 
determined. This is a good illustration that through the use of computational aids in the class the 
students can understand more clearly the concept under consideration and can share in the excitement 
of discovery. 


"Inverse Functions, Implicit Functions and Computer Graphics," by Sheldon P. Gordon, Suffolk Commun- 
ity College, Selden, New York. 


The use of sophisticated computer graphics provides the user with a powerful tool to investi- 
gate many aspects of mathematics that are inherently geometric in nature and yet cannot be dealt 
with by hand due to their complexity. The talk was focused on two such topics from elementary cal- 
culus and differential equations and demonstrated how the computer can add a valuable dimension to 
student understanding. 


"The St. Petersburg Paradox," by Allan J. Ceasar, U.S. Merchant Marine Academy, Kings Point, New 
York. 


The Strong and Weak Law of Large Numbers was compared in this paper through the use of the St. 
Petersburg Paradox. The gamble, which was shown to have an infinite mean, created controversy 
because most participants would be willing to pay only a relatively small entrance fee to play. 
This is inconsistent with the concept of a fair game. Utility and expected utility was discussed as 
a means of avoiding this dilemma. A computer program to simulate this gamble was analyzed and a 
graphical presentation of the computer printout proved to be a valuable pedagogical tool in demon~ 
Strating the Law of Large Numbers. 


"Using Computer Simulation in Teaching Statistics," by Robin H. Lock, St. Lawrence University, Can- 
ton, New York. 


The computer may be effectively used in most statistics courses as a tool for generating random 
data. It allows the instructor to control the experimental situation completely by specifying both 
the "true" underlying model and the nature of the random noise which was introduced into the sam- 
pling scheme, Furthermore, the student may easily obtain samples, even for a complicated experimen- 
tal design, in a convenient form. Two examples of this sort of computer simulation were discussed. 
The first is a rather simple program called SAMPLER which generates sample from a randomly chosen 
distribution. The student’s task is to identify the distribution and estimate its parameters. The 
second program, GOLFBALLS, simulated both the distance and deviation of golf shots. The student may 
control variables such as the color, compression, and construction of the balls to investigate their 
effect on the shots. 


“An Undergraduate Course in Time Series Analysis Made Possible with Minitab," by Constance M. Elson, 
Ithaca College, Ithaca, New York. 


Formal time series analysis involves graduate-level mathematics by using Minitab to provide 
"hands-on experience," it proved possible to offer a one~semester course in the subject to mathemat- 
ically unsophisticated students. Typically, these students had had 1-2 semesters of calculus and I 
semester non-calculus statistics course. Minitab, a widely-used software package for statistical 
analysis of data, is quickly mastered and easy to use in an interactive mode; it proved valuable in 
teaching some basic concepts of stochastic modeling which would not have been accessible to these 
students if stated in abstract terms. By using Minitab to do their own "random sampling" of a ran- 
dom variable with a given probability distribution, the difference between complete theoretical 
knowledge and inferences based on partial observations became explicit and vivid. Building on this, 
it was possible to convey a correct intuitive understanding of the difference between a stochastic 
process (a sequence of random variables) and a single realization of the process, the "observed time 
series" (a single sample path of the process). Once the students had grasped these ideas, concepts 
like stationarity, ergodicity, and conditional expectation became meaningful to them. 


Classroom Notes. Presider: J. Arthur Seebach, St. Olaf College. 


"Using Discovery and Invention to Learn About Computers," by Marjory Baruch, Hamilton College, Clin- 
ton, New York. 


"Simpson’s Rule as a Weighted Average," by Tom Tucker, Colgate University, Hamilton, New York. 


“An Heretical Approach to Solving Polynomial Equations," by Sylvan Burgstahler, University of 


1983 Center Section C115 


Minnesota, Duluth, Minnesota. 


"The Predator-Prey Problem Revisited," by Roland Lamberson, Humboldt State University, Arcata, Cali- 
fornia. 


"Numerical Linear Algebra: A Brief Survey," by Steven Leon, Southeastern Massachusetts University. 


“An Elementary Proof That Certain Numbers are Irrational," by Harvey Hindin, Hi-Tech Editorial, 
Inc., Dix Hills, New York. 


The Undergraduate Mathematics Curriculum. Presider: Jerome A. Goldstein, Tulane University. 


"A Multiple Linear Regression Model for Placement Using the MAA Placement Exam," by E.J. Manfred and 
C.F. McCarthy, U.S. Coast Guard Academy. 


This was a continuation of a study designed to investigate methods by which the use of avail- 
able data on incoming cadets at the United States Coast Guard Academy can be used as a guide for 
placement in the first year mathematics course. A multiple linear regression model that predicts 
grades in the first year mathematics course was used for the past several years. A new model was 
developed and included as one of the independent variables the score of the MAA Calculus Readiness 
Placement Exam. Intercorrelations between Calculus Readiness Exam and other independent predictor 
variables were examined. 


"Mathematics Remediation with Dignity," by L. Clark Lay, Aurora College, Aurora, Illinois. 


Optimum college mathematics remediation requires more than just a review of concepts met before 
but not mastered. Fresh content is essential. This must be new, simple, and significant. 


"The Undergraduate Teaching Assistant in Development and College Mathematics Courses,” by Marguerite 
Graves, Pennsylvania State University, Allentown, Pennsylvania. 


The peer tutor, i.e., PT, the undergraduate teaching assistant fairly close to the student in 
academic development, is one of our precious resources often preferable to the graduate TA even when 
available. 


In various forms of individualized instruction or tutorial set-ups, the PT has a well defined 
function, grading unit tests, answering questions, and facilitates flexibility in the curriculum, 
etc. In the "group" method, the role is more subtle. A structure is created when the better stu- 
dent helps the members of the group--and thus him or herself. Various group projects might include 
take home tests and grading of take home tests for the other groups, chapter outlines, etc. 


"Variables in the Description of Developmental Mathematics Programs," by Miriam Hecht, Hunter Col~ 
lege, CUNY, New York. 


Over the past decade, developmental mathematics programs have proliferated on American cam- 
puses. These programs, many of which have been described in the literature, often embody important 
educational innovations. However, the reader may be at loss to evaluate, replicate, compare, or 
even understand such programs because authors vary in their selection and discussion of salient 
variables. Thus, one account may describe the role of tutors and computers while failing to mention 
the size of the program, the content, or the completion rate. Another may provide details of an 
independent study program without considering the number, age, and preparation of the students or 
the vehicle of instruction. 


In this paper, the author identified and described a number of variables entering into the 
design and implementation of developmental programs. Such an outline provided a framework for 
description and analysis. It is particularly useful in connection with developmental courses 
because these courses usually address the needs of nontraditional students and incorporate an 
impressive variety of resources. However, it is applicable to the description of any college pro~ 
gram, in whatever field. 


"Large Section Teaching of Developmental Mathematics,” by John L. Tilley, Mississippi State Univer- 
sity, Mississippi State. 


The fiscal, logistical, and philosophical reasons for using our format will be touched on. As 
a mandated program by the Board of Trustees of the Institutions of Higher Learning for the State of 
Mississippi, the "tracking" of all students meeting the Boards cut-off requirements has produced a 
great amount of statistical data about these students. From this mass of data, information about 
Students” grades in the course and subsequent grades in credit mathematics courses is available. 
Focus on subsequent courses is primarily on College Algebra with a semester prior to the introduc~ 
tion of Developmental Mathematics. 


"Convergence Isn“t Sufficient," by Joanne Growney, Bloomsburg State University, Bloomsburg, Pennsyl- 
vania. 
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Mathematics instruction primarily focuses on teaching students to find solutions to carefully 
posed problems. This process of "convergent thinking" is a necessary and effective thinking in the 
large number of day~to-day situations that are permeated with unidentified problems or questionable 
solution proposals. Such situations need individuals skilled in divergent thinking (the ability to 
produce a variety of ideas that lead in different directions) and evaluation (the ability to reach 
decisions about the quality of proposed solutions to a problem). Traditionally, mathematics courses 
have been seen as a “natural setting" for teaching thinking. However, learning to think is not an 
automatic by~product of learning mathematics; moreover, divergent thinking and evaluation are not 
automatic by~products of convergent thinking skills. Instead, it is likely that we teach thinking 
skills only when it is our goal to do so. Thus, it is important that we design our teaching stra- 
tegies and curricular materials to develop specific thinking skills. Few teaching materials that 
emphasize divergent thinking or evaluation were found. The presentation provided examples of such 
materials and will urge interested mathematicians and teachers to develop instructional materials to 
help to achieve these teaching goals. 


"A Mathematics--Art Course," by Frederick Solomon, SUNY College at Purchase, Purchase, New York. 


The course taught at State University of New York, College at Purchase has both an artist and a 
mathematician as instructors. The course is aimed at freshmen and sophomores and combines mathemat— 
ical ideas with visual representation. Students are required to complete ten projects visually 
illustrating various mathematical concepts. The talk included syllabus, annotated bibliography and, 
in general, focused on techniques for topic presentation and rationale for inclusion of topics in 
the course. 


"The Sloan Foundation Discrete Mathematics Initiative," by Steve Maurer, Swarthmore College, Swarth- 
more, Pennsylvania. 


The Sloan Foundation recently announced grants of $40,000 each to 6 institutions to develop 
mainstream mathematics curricula for the first two undergraduate years in which discrete mathematics 
receives as much attention as calculus. What programs have these 6 institutions proposed? Should 
this initiative succeed? Will it? 


"Constraints on the Changing Curriculum,” by Richard Anderson, Lovisiana State University, Baton 
Rouge, Louisiana. 


"Mathematics in China--Summer, 1983," by Donald Bushaw, Washington State University. 
The Association also presented the following Film Program: 


Points of view: Perspective and projection 

Dragon fold...and other ways to fill space 

Caroms 

Linear programming 

Turning a sphere inside out 

Hypothesis testing, inferential statistics, Part II 
Journey to the center of a triangle 

Symmetry and tesselations 


Board and Business Meet ing 


The Board of Governors met at 9:00 a.m. on Sunday, August 7, 1983. The major items of business 
transacted by mail will be announced to the membership in FOCUS. 


The Business Meeting of the Association was held at 4:30 p.m. on Tuesday, August 9, 1983 and 
the Carl B. Allendoerfer and George Polya Awards for expository articles in Mathematics Magazine and 
the Two-Year College Mathematics Journal were presented. 


Election of Members 


At its meeting on August 7, 1983, the Board elected to membership 794 applicants for individual 
membership and 10 applicants for academic membership. The latter follows: 


Juniata College, Huntingdon, Pennsylvania 

Norwalk State Technical College, Norwalk, Connecticut 

Pace University, Pleasantville, New York 

Chattahoochee Valley Community College, Phenix City, Alabama 
Ryerson Polytechnical Institute, Toronto, Ontario, Canada 
University of Puerto Rico, Mayaguez, Puerto Rico 

University of Southern Colorado, Pueblo, Colorado 

Delaware County Community College, Media, Pennsylvania 
Mercy College of Detroit, Detroit, Michigan 

University of West Indies, Kingston, West Indies 


Respectfully submitted, 


David P. Roselle, Secretary 
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[6, pg. 145] that for such g(w), | - Ot 
conditions concerning the first three derivatives of F analogous to the condition in the Theorem. 
Using such a result, one might be able to construct an example showing that (1), (2) and (3) are 
not enough to guarantee the univalence of F. 


< 18. Based on this inequality, one could derive 
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A CLASSROOM NOTE ON THE SAMPLE VARIANCE AND THE SECOND MOMENT 


J. J. BIRCH AND TIM ROBERTSON 
Department of Statistics and Actuarial Science, The University of Iowa, Iowa City, [A 52242 


In teaching statistics, one uses a number of fundamental criteria to select estimators and judge 
their quality. Apparent paradoxes occasionally arise when two or more of the principles are 
applied to the same problem. It is the purpose of this note to point out one such conflict that 
appears not to have been noticed by many statisticians. 

The two criteria we have in mind are the principles of maximum likelihood and mean squared 
error. We consider the concepts in the selection of an estimator for the variance of a normal 
population. The normal distribution is a two parameter distribution and if we are interested in its 
variance, 0”, then the mean, p, is an auxiliary parameter. More specifically, the estimation of 0” 
depends upon whether or not p is known; if » is unknown, its estimation seems to take precedence 
over the estimation of o*. When p is known, the maximum likelihood estimator of o7 is the 
average of the squared deviations from p, i.e., the second sample moment about the mean 


$?(n) == D(%— wy. 


This estimator, when p is known, has certain desirable properties such as: (1) it results from the 
method of moments; (2) it is unbiased and, moreover, (3) it is the minimum variance unbiased 
estimator. 

The criterion of sufficiency is often used in choosing estimators. A statistic is sufficient if, 
roughly speaking, all pertinent sample information about the distribution is contained in the value 
of the statistic. Thus, if an estimator is a function of a sufficient statistic, then the full expression 
of the extent to which the data give information about the parameter is obtained from such an 
estimator. M. G. Kendall (1946) apparently had this criterion in mind when he stated in his classic 
text (page 11) that “A sufficient estimator is best for any sample size since it gives all the 
information about @ that a sample can give; and is most efficient for large samples.” Continuing, 
he points out that when sampling from a normal population with known mean 
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s?(X¥)=2(x%,- X) 
is not sufficient for o*. Thus, he says “If we know the real mean p, there is little point in 
preferring the sample variance S?(X) to the second moment S?() as an estimator of the parent 
variance.” 

Thus, S*(2) seems to be a very desirable estimator. However, before we can claim that this 
estimator is “better” than others, we should, to begin with, consider what losses will be sustained 
if errors are made. A popular method for comparing estimators, based on errors, is to compare 
their mean squared error (MSE). 

The sample variance, S?(.X), can be used as an estimator of o” when p is known although, as 
noted above, we might not expect it to compete very favorably with S?(), the second sample 
moment, since it seemingly ignores our knowledge of ». However, when we compare the MSE’s of 
these two estimators, we find that 

MSE(of the sample variance) = (= _ S o* < =! = MSE(of the second sample moment). 
These values are obtained from the first two moments of the chi-squared distribution and the facts 
that nS?(X)/o? and nS?(p)/o* have chi-squared distributions with n — 1 and n degrees of 
freedom, respectively. This is the apparent paradox and it seems to imply that when p is known, 
we should ignore this information and use the estimator, X, for p. 

In connection with some recent research (Randles (1982) and Pierce (1982)), we note that the 
second sample moment, S*(,), is a U-statistic so that the sample variance S7(X) is a U-statistic 
with an estimated parameter. A natural question would be to ask how much efficiency is lost when 
using a U-statistic with an estimated parameter over the corresponding U-statistic. Of course the 
U-statistic is unbiased, by definition, but the corresponding U-statistic with an estimated parame- 
ter may be biased. Thus, efficiency cannot be measured in terms of variance. One way of 
measuring it would be in terms of mean squared error. This example illustrates that with regard to 
squared error loss the U-statistic with an estimated parameter may, in fact, have a smaller mean 
squared error than the corresponding U-statistic. 

Another comment we make is that the mean squared errors of the sample variance and the 
second moment can be computed in general (cf. Cramér (1946)). It turns out that as long as the 
population kurtosis is at least as large as 2.5, the MSE of the sample variance is smaller than 
the MSE of the second sample moment. One might infer from this fact that the sample variance is 
more “robust” than the second sample moment. 

When p is unknown in a normal population, we usually compare S*(X), the sample variance, 
with the “best” unbiased estimator of o7. In this case, it is well known that the unbiased estimator 

"_s*(¥)=—+_ 9 (x,- ¥) 


n—1 n— 


is the minimum variance unbiased estimator. of o7 (it is the recommended estimator in many 
books on statistics). An easy calculation yields 
24 


n—-1° 


MSE(—"—s?(¥)] = 


and 
n 
n—- 1 


MSE| s*(¥)| > MSE(S?(1)), 
which agrees with our intuitive principle. Nevertheless, we still feel a little shaky about using 
S?() as an estimator of o? when p is known, because S?(X) has an even smaller MSE than 


S*(p). 
One explanation is that none of these estimators has the correct divisor if we are to use mean 
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squared error as a measure of quality. It is well known and easy to verify that 


MSE(— (¥)| < MSE(anS?(X )) 


and 


n 
MSE( —"- 5?(u)) < MSE(anS?(1)) 
for every a > 0. Thus if mean ‘anaree error is our criterion, we should use 
] 2 
2() = _ 
when p is known (an admissible estimator) and 
_"_92(7)-—! ~¥y 
n+1> (*) ne Ele x) 


when p is unknown (an inadmissible estimator (cf. Stein (1964)). Moreover, our intuition is then 
confirmed as 


nN a _ 2 4 2 = 
aH) = 5 < pa? (X)). 
Thus the statistic using the known mean p is better, in the sense of mean squared error, than the 
one ignoring this information. It should be noted, however, that these two estimators of o* are 
almost never considered in basic textbooks on statistics. 

On the other hand, one might not be so much interested in o* as in 0. Thus we might compare 
the mean squared errors of S(X) and | S(2) as estimators of o. The distributions of these statistics 
are not so tractable as those of S?(X) and S?(). However, we calculated their mean squared 
errors for sample sizes of 1,2,..., 30,40, 50,..., 100, and in each of these instances the mean 
squared error of S() was less than that of S(X). Thus it seems that for the estimators S(X) and 
S() of the standard deviation o our intuition holds while, for the estimators S?(X) and S*() of 
o* it does not hold. 

Finally, mean squared error is not the only possible criterion for measuring the quality of an 
estimator. Among other measures of the “closeness” of an estimator to its parameter is one 
proposed by Pitman. It can be shown that 


P[(s*(X) — 0?) > (S*(u) -0°)] > 


so that, in the Pitman sense, S?() is “closer” to o” than S?(X) is. Here again our intuition is 
upheld. 


4 _ n 
MSE( MSE{ - ar 
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ELEMENTARY PROBLEMS 


Solutions of these Elementary Problems should be mailed in duplicate to Professor G. L. Alexanderson, 
Department of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by April 30, 1984. Please place 
the solver’s name and mailing address on each (double-spaced) sheet. Include a self-addressed card or label (for 
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E 3025. Proposed by S. W. Graham, University of Texas at Austin and D. Hensley, Texas 


A & M University. 


Let A(n) be the function on the positive integers determined by the conditions that A( p) = —1 
for all primes p and A(mn) = A(m)A(n) for all m and n. (This is commonly referred to as 
Liouville’s lambda function.) Show that there exists a C > 0 such that the number of n < x for 


which A(n) = A(n + 1) exceeds Cx, for x sufficiently large. 


E 3026. Proposed by H. Kestelman, University College, London. 


Let A be a skew-symmetric matrix of order 2n considered as a function of its n(2n — 1) 
independent upper off-diagonal elements; these are complex numbers ordered conventionally as 
X1,Xz,..-, Xy. Show that there exist matrices F and G whose elements are rational functions of 


the x,’s with rational (real) coefficients, and which are such that A = FG and det F = det G. 


E 3027. Proposed by Cran Minh Crung, Skogn, Norway. 


Evaluate in closed form B(r, m,n) = » (i): 
k=r(mod m) 


E 3028. Proposed by B. Tomaszewski, University of Wisconsin. 


Let {z;} be a sequence of complex numbers which converges to 0. Prove that there exists 


sequence e, = +1 such that »; e,z; converges. 


E 3029. Proposed by S. W. Golomb, University of Southern California. 


Let a and B be nonzero elements of a field F which satisfy a + B = 1 and a~'+ B' = 1. 


Prove that a = B~! and that a® = B° = 1. 


706 


PROBLEMS AND SOLUTIONS 707 


E 3030. Proposed by R. E. Shafer, Berkeley, CA. 


Let dy > a; > a, > +++ 2B Ay_, > 0. The polynomial Py(z) = La,z' has zeros |§| > 1. If for 
one of these zeros |{| = 1, then there is n > 2, n|N, such that 
vel 
Py(z) = zt » a,;2". 
z—1 jp 


SOLUTIONS OF ELEMENTARY PROBLEMS 
Convergent Subsequences 


F2919. [1981, 764] Proposed by Elgin Johnston, Iowa State University, Ames. 


(a) Let {a,}%_, be a sequence of real numbers. Suppose that, for each prime p, the sequence 
{@y)n=-1 1S increasing and bounded above. Let {b,} be the sequence obtained when those elements 
with prime subscripts are deleted from {a,}. Show lim,., ,,b, exists. 

(b) Suppose in part (a) the phrase “is increasing and bounded above” is replaced by 
“converges.” Does the result still hold? 


Solution by J. Browkin, Warsaw University, Poland. (a) From the assumptions it follows in 
particular that the limit of a,,, exists; call it A. Moreover, if composite numbers 7, s, r > s, have a 
common prime factor, then 


(1) a, > d,. 
Take a composite number m and let q be its least prime factor. Then m = tg with t > [vm |. 
Applying (1) three times we obtain 
22m z ay, = aig z ar, z arm): 
From lim,,. 042m = A, lim, »@2, ya] = A, we conclude that the sequence a,, with composite 
indices m also converges to A, 1.e., lim,_,,,b, = A. 
(b) No. Take a,, = 1 if n is the square of a prime; a, = 0 otherwise. Evidently, for a prime p, 


all terms but one in the sequence {a,,,)7-, are equal to 0. Thus it converges. In the sequence b, 
infinitely many terms are equal to 0, and infinitely many to 1. Thus it diverges. 


Also solved by U. Abel (Germany), L. Boxer, D. Cantone (Italy), D. Goldberg, T. Hermann (Hungary), V. 
Hernandez (Spain), D. Hill, J. Hook (student), W. Janous (Austria), M. Josephy (Costa Rica), C.-N. Lee (student), 
G. S. Lessells (Ireland), O. P. Lossers (Netherlands), D. A. Mattson, R. E. Megginson, F. B. Miles, R. Moller, A. M. 
Nadel (student), W. A. Newcomb, E. Posti (Finland), H. Prodinger (Austria), D. A. Rawsthorne, B. Richter, J. H. 
Riley, Jr., T. Salat (Czechoslovakia), E. A. Schwenk, A. Seeger (Germany), H.-J. Seiffert (Germany), D. K. Skilton 
(Australia), A. Smuckler (Israel), R. J. Snelling, J. Suck (Germany), G. G. Thompson, University of South Alabama 
Problem Group, D. G. Weinmann, D. Wells, P.-Y. Wu, and the proposer. 


The Discriminant of x" — ="x"~! 


E 2920 [1982, 63]. Proposed by J. O. Shallit, University of California, Berkeley. 


Let f(x) = x" — x"~!— x72 -—--- —x—-—ln>2. 
(a) Show that the discriminant of f is given by 


1)"*' _ gntiyn 

(n— 1) 

(b) Show that the fraction in (a) is an integer directly. 

Solution to part (a) by D. C. Kurtz (Zomba, Malawi) and P.-Y. Wu (Hsinchu, Taiwan). The 


disc( f) = (—1)"rr bd? (n+ 
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formula is the special case a = —1 of problem 840, Problems in Higher Algebra, by D. K. 
Faddeev and I. S. Sominskii (W. H. Freeman, 1965). 


Solution to part (b) by most of the solvers listed below. Replacing n by m + 1, we have to show 
that for m > 2, disc f = Omod m?. This is a consequence of the binomial theorem. 


M. Josephy (Costa Rica) proved the result by noting that p(1) = p’(1) = 0 when p(n) = (n + 1)"*! — 2"*!n", 


Also solved by D. Bode (Germany), J. Brillhart, L. Carlitz, S. Gagola, H. Kappus (Switzerland), C.-N. Lee, O. P. 
Lossers (Netherlands), P. Ramankutty (Mexico), A. T. Steele, and the proposer. 


Natural Numbering of a Lower Triangular Array 


E 2926 [1982, 130]. Proposed by Jerrold W. Grossman, Oakland University. 


The cells in a lower triangular array are indexed by pairs of integers (i, 7), where 1 <j <i <n. 
There are two natural ways in which to order the cells: 


ROW ORDER: Gj)<W,j) ei<i or(i=i' andj <j’). 
COLUMN ORDER: (i,j) < (i, j))ej<j’ or(j=j’ andi <i’). 
Let S(n) be the number of cells whose rank is the same in both systems. 
(a) Evaluate S(n). 


(b) Find the values of n for which S(n) = 4. 
(c) Find the values of n for which S(n) is odd. 


Solution by O. P. Lossers, Eindhoven Institute of Technology, Netherlands, and independently by 
the proposer and Donald G. Malm, Oakland University ( jointly). We shall assume that n > 2. 
(a) Clearly cell (i, 7) has row rank 


1+2+---+(i- += (5) + 

and column rank 

n+(n—-—1)4+---+(n-j+2)4i-j+1=3(j- 1Qn-J) +i. 
Hence S(n) is the number of solutions of 
(1) i(i-1)+2j7=(j-1)Qn-Jj)+2i, l<j<ic<n. 
Putting x = 2i — 3, y= 2n — 27 — 1, it follows that 
(2) S(n) ={(x, y)[x? + y? = 4n? — 12n4+ 10, -l<x,y<2n-3,x+y>2n- 4}. 
The solutions (+ 1,2” — 3) and (2n — 3, +1) lead to solutions (1, 1), (1,2), (n, n — 1) and (n, n). 
It follows that 
(3) S(n) =2 + K(x, y)|x? + y? = 4n? — 12n + 10,x > 1,y > 1). 
According to Satz 161 and Satz 162 of E. Landau, Elementare Zahlentheorie, Chelsea, 1950, we 


obtain 


(4) S(n)=2+ > V((4n? — 12n + 10)/d7) 
d7\(4n?— 12n+ 10) 


where V(m) is defined by 


0 if 4 divides m or if a prime p, p = 3 (mod 4) divides m, 
if m #0 (mod4) and no prime p, p =3 (mod 4), 
divides m and s is the number of different odd divisors of m. 


(3) ¥(m)=(.,, 
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Clearly we can replace (4) by 
(6) S(n)=2+ > V((2n? — 6n + 5)/d’). 
d*\(2n2—6n+5) 


(b) Since 2(2n? — 6n + 5) = (2n — 3)? + 1 and since —1 is not a quadratic residue modulo a 
prime p when p = 3 (mod 9), it follows that 


(7) 2n*—6n+5= [[p%, e,>1, 

ie! 
where the p,’s are different primes congruent to 1 (mod 4). If e; > 2 for some i € J, then d = 1 
and d = p,; already lead to a value of S(n) greater than 4 by (5). If e; = 1 for all iE J, then 
S(n) =2 +4 2" > 4 if |J| > 2. It follows that S(n) = 4 iff 2n? — 6n + 5 is a prime number. 


(c) An odd term in (6) occurs by (5) iff (2n* — 6n + 5) = d?’. This condition can be rewritten 
as 


(8) (2n — 3)’ — 2d? = -1, 


and it follows from Theorem 8-6 in W. J. LeVeque, Topics in Number Theory, vol. I, Addison- 
Wesley 1956, that S(n) is odd iff for some m > 1 


an 2m + | 2m+1\52, ... 2m+1)\ym 
(9) 2n — 3 1+( 5 Ja +( 4 Jo 4 +( om Jam. 


Since the right-hand side in (9) equals 


(4 BYOB (4 BY), 


where (x) denotes the nearest integer to x, we have S(n) is odd iff 
n= (34+ (1+ 72)" '/2))/2. 


Parts (a), (b) also solved by J. Suck (Germany), and M. Vowe (Switzerland). 


ADVANCED PROBLEMS 


Solutions of these Advanced Problems should be mailed in duplicate to Professor G. L. Alexanderson, Department 
of Mathematics, University of Santa Clara, Santa Clara, CA 95053, by April 30, 1984. The solver’s full 
post-office address should be on each sheet. 


6445. Proposed by Richard Askey, University of Wisconsin. 


Show that Pfaff’s transformation of the hypergeometric function 


—a _ XxX 
»F(425; x) = (1 - x) »F,( 4 >, | 


implies Landen’s transformation of the dilogarithm 


n 


Li,x:= y 5 = — 5 [los(1 — x)]? — Li,(x/(x — 1)). 


6446. Proposed by Paul S. Bruckman, Carmichael, CA. 
Define the generalized totient function ¢, as follows: 
o(n)= > k’, r=0,1,2,..., 


k=1 
(k,n)=1 
and its (Dirichlet) generating function 
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f(s) = ¥ 6,(n)/n', 


n=1 


defined for an appropriate domain of s. Show that 


r+] 
f(s) = 


+] 
Ce Drern EME Brake W), sor 42, 
k=1 
where the B,’s are Bernoulli numbers and ¢ is the Riemann Zeta function. 


6447. Proposed by P. Erdés, Hungarian Academy of Sciences, and J. L. Selfridge, Mathematical 
Reviews. 

For fixed k > 2, if n > 2k, show that there is at least one 7, 0 < i < k — 1, such that n —i 
does not divide ( n) On the other hand, there is an n, > 2k for which n — i divides (7 for all 
but one i in the range 0 < i < k — 1. Estimate the smallest such n, as well as you can. 


SOLUTIONS OF ADVANCED PROBLEMS 
The Expected Value of the Largest Part of a Partition 


6386 [1982, 338]. Proposed by I. P. Goulden and L. B. Richmond, University of Waterloo. 
n— | 


Assuming each m-part ordered partition of n to have probability 1/ ( |, let the expected 


value of the largest part in such a partition be E,,. Show that, for fixed m, lim, _,,, E,,/n = S,,/m, 
where S,, = L7’ 1/i is the mth partial harmonic sum. 


Solution by Lajos Takacs, Case Western Reserve University. If n = 6, + 6, + +--+ 6, is an 
ordered random partition of n into the sum of m positive integers, then 6,,6,,...,6,, are 
interchangeable random variables and 


P(8y > ky 82> kyu > k= (" JEON /(m 1) 


m— | m— | 
for0 <k < (n — m)/j and j = 1,2,..., m, since the number of representations of n as a sum of 


n— jk | If p,,(n) = 


m positive integers in which j given terms are greater than k is 


max, <;<,,0;, then by the method of inclusion and exclusion we obtain that 


Plp,(n)>ky= (-i'(7)("— ER) /(m= 1) 


l<j<(n—m)/k m— | m— | 
for k = 1,2,...,n 
For any positive integer j we have 


x (7 EIR) = 2m) + o(n™) 


O<k<(n-m)/ 


as n — oo. (See [1].) Consequently, 


E, = E{p,,(n)} = L Plon(n )>k}= 


as n — oo. Since 


¥ (<9 (7) 5 + on) 


y (-1)7" i Ja- Gaara tae f 2 ay 


j=l Jy 
=[\(ltyte4 ty" de D4, 
) j= iJ 
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it follows that 


for m = 1,2,.... 
Reference 
1. L. Takacs, A sum of binomial coefficients, Mathematics of Computation, 32 (1978) 1271-1273. 
Also solved by Victor Hernandez (Spain), L. E. Mattics, Roger S. Pinkham, Helmut Prodinger (Austria), and the 


proposers. 


The Sum of a Series 


6387 [1982, 338]. Proposed by Jeff Alden, Vector Research Inc., Ann Arbor, Michigan. 
Let a, y satisfy —e7'<a<e™', y = e%. Prove that 


nin l 


Lt Gay)’ 


nN: 


Solution by Mark A. Pinsky, Northwestern University. The additional condition |ay| < 1 is 
required to ensure that the correct solution of y = e*” is chosen. The result then holds for complex 
a with |a| < e7'. 

We begin with the identity 


nn l 


y na” _ i dz 
fag MY lai dzj=12 — ae?” 


This is proved by writing 


multiplying by a” and integrating the resulting uniformly convergent series term-by-term. By 
Rouché’s theorem, the function z — ae’ has exactly one zero z, 1n the unit disc |z| < 1. By the 
residue theorem the integral in question is then equal to 1/(1 — ae’'), which yields the required 
result if we make the identification z, = ay. 


Also solved, by the proposer and 47 others, a number of whom pointed out that the result is known and 
referenced the solution to problem 214 on page 348 in Problems and Theorems in Analysis I, by G. Polya and G. 
Szego, Springer-Verlag, 1972. 


On Rolling a Cube and a Tetrahedron 


6388 (1982, 338]. Proposed by Nicholas Wheeler and Howard Straubing, Reed College, Portland, 
Oregon. 


A regular tetrahedron R sits on a unit triangle T on a plane tiled with triangles congruent to T. 
A move consists in rotating R about an edge in contact with the plane. After several moves, R sits 
on T again. Have the vertices of R been permuted in space? What if R is a cube and the tiling is 
by squares? 


Solution by Jerrold W. Grossman, Oakland University, Rochester, Michigan. The answer is no 
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for the tetrahedron and maybe for the cube. The latter is easy to see, since rotation north, then 
east, then south, then west, returns the cube to its starting position with the vertices permuted. To 
prove the former, note that the plane can be coordinatized so that the positive y-axis makes a 60° 
angle with the positive x-axis. Then the vertices of the triangular tiling are represented by pairs of 
integers (i, j). Reducing the coordinates modulo 2 we obtain a 4-coloring of the vertices of the 
partition. It is clear that if tetrahedron R, whose vertices are labelled by the same four “colors,”’ is 
placed on the tiling so that colors match, then the matching is preserved by a rotation of R about 
an edge in contact with the plane. 


Also solved by Miroslav D. Asic (Yugoslavia), David A. Barrington, Benny N. Cheng, the Chico Problem Group, 
Hans E. Debrunner (Switzerland), Clayton W. Dodge, Pravin Kumar (India), Jeff Loveland, John R. Silvester 
(England), F. B. Strauss, David M. Wells, and the proposers. 

A number of solvers showed that the cube can return to 12 of its 24 possible configurations. Hans E. Debrunner 
proved that a regular icosahedron in place of the tetrahedron can return to any of its 60 configurations, and a regular 
octahedron to 4 of its 24 configurations. 


On a Rayleigh Quotient 


6389 [1982, 339]. Proposed by Alfonso Castro B., Centro Estudios Avanzados, Mexico, D. F. 


Let Q denote a generic bounded region in real n-space R”. Let H'(Q) be the Sobolev space of 
square integrable functions in Q having generalized first-order partial derivatives in L*(Q). (See R. 
Adams, Sobolev spaces, Academic Press, 1975.) With integrals over Q, define 


J(u) = LE fauras?| / fu 


Find a bounded region Q such that the infimum (over the nonzero functions u in H'(Q) with 
fu = 0) is 0. 


Solution by the proposer. Let c = X%_51/k*, d, = Li_.1/k?’, (n = 2,3,...) and dy = 0. Let 
Q = (x, y)ER?; -1<x<0 and 0<y<1, or dy,_,; <x <d, and 0<y< 1/(2n + 1)* 
(n = 2,3,...), or d,, < x <d>,,,; and0<y<1(n=1,2,...), or 0<x<1/4and0<y< 


1/81}. 
Let f,: [—1, c] ~ R be defined by 
E,, ifx<d,,_ 
fr(x) = (1 — e, (2n) (x — do»_1) + & ifd,,,<x<d), 
] if x > dp, 


where —c < e,, < 0 is such that if u,(x, y) = f,(x), then 


1 w(x, y) dx dy = 0. 

(1) J unl 9) dx dy 

Clearly u, € H'(Q) and 

(2) [ui(x, y) dx dy > y 1/(2k +1). 
Q k=n 


On the other hand 
2 2 _ 2 2 4 
(3) f(du,/ax) + f (8u,/8y) = (1 ~ en) 2n)'/(2n + 1)", 


From (2) we see that 


2n 
(4) fui> y 1/(2k + 1) en/(4n4 1). 
Q k=n 
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Combining (3) and (4) we see that J(u,,) > 0 as n > oo, which proves that 


0 = int{ J(u); u€ H'(2),u 4 Oand [u= oh. 


Also solved by Alfonso Villani (Italy). 


A Determinant 


6390* [1982, 339]. Proposed by P. Venzi and J. Aguadé, ETH, Ziirich, Switzerland. 

Let c,,..., ¢, be real numbers, with c; # c, if i # j. Consider the n  n-matrix (a; ;) defined by 
a;,; = ¢;¢,/(¢; — ¢;) fori<j, a;, = —a;; fori # j and L%_,a;,; = —c;, i= 1,..., n. Prove that 
det(a;;) =(—1)"n'c, ++: ¢,. 

Solution by E. G. Straus, University of California at Los Angeles. We can factor out a factor c; 
from the ith row to get 


det(a;;) = c\c, «++ ¢, det(b;;) 


where 
c,/(¢; — ¢;) i-j 
k#j 


We first prove that det(b;,) is a constant independent of the values of c,,..., c,. Since it is 
clearly a homogeneous rational function of degree 0 in these variables, it suffices to prove that the 
denominator, which is a divisor of IT) cj<j<n(¢; — c;)’, is of degree 0. 

By symmetry it suffices to prove that the factor c, — c, does not occur in the denominator. 
Since it occurs only in the denominators of 51), b2, 52,, 622, we modify det(b;;) by first adding 
columns 2,3,..., 2 to the first column and then subtracting the first row from the second row. 
This yields a matrix (6;,) with 


i= —1,5,, = 5,,forj > 2; 
, , C3 Cy 
b5, = 0,65, = -2+ + +--+ + ——., 
Cy — © Cy — Cy 
C. C, c;(cy — ¢,) 
CG 2 GT (c; — ¢)(¢; — €2) 
i=l fori > 2; bj; = 5; fori > 1,j > 2. 


Thus the only 5;; with denominator having c, — c, as a factor is b},, and in det(b;,) the cofactor 
of b;, is a determinant with first row 


Ch 


which is divisible by c,; — c>. 
Having established that det(b,;) is a constant, it is easy to find its value. Set d;; = c,/(c¢; — ¢;) 
if i<j. Then 
~i +t dy t dy, t+: + dj_4,; — Gi, i+ Tt ody T= i. 


I 


i<j 
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Successively letting c, > oo, then c, > oo, etc., we find that every d;; > 0, and hence 


1 -2 0 
det(b,)=| | | ~? =(-1)"nl. 


Also solved by Mihaly Bencze (Romania), C. S. Karuppan Chetty (India), Clark Givens, C. L. Mallows, and 
Y. S. Sathe & R. B. Bapat (India). 


Prawn’s Moves 


6391 [1982, 339]. Proposed by Louis W. Shapiro, Howard University, Washington, D.C. 


Let a prawn be a new chess piece that moves like a rook in one direction, and like a pawn in 
the perpendicular direction. A prawn starts in the lower left-hand corner of an n X n board. On 
each move, it can progress either any number of squares straight up (remaining on the board) or 
one square to the right. 

(a) In how many ways can the prawn get to the mth diagonal (defined by x + y = m on the 
standard lattice)? This sequence starts 1,2,5,13,.... 

(b) In how many ways can the prawn get to the opposite corner? This sequence starts 
1,2, 9, 44,.... 


Solution by O. P. Lossers, Department of Mathematics and Computing Science, Eindhoven 
University of Technology, Eindhoven, The Netherlands. (a) The answer a,, equals f,,,, where 
fi, fy)... are the Fibonacci numbers given by /, = 0, f, = 1,f, =f,_,| + f;_2. Indeed, depending 
on whether the last move was of the pawn or of the rook type, the mth diagonal can be reached in 
a, OL a, +a, +-:-:+a,,_, Ways respectively. Hence 


m—2 
Q,, = 24,1, + y a;, 


i=1 


and consequently 


Since a, = f, = 1, a, = f, = 2, it follows that a,, and f,,,, satisfy the same recurrence relation and 
hence that a,, = fom- 
(b) To reach the opposite corner the prawn makes n pawn moves and k rook moves. These can 


be ordered in (” I «| different ways. The k rook moves must add up to n steps upwards, and the 
number of possibilities for this is (7 _ | \ the number of partitions of n into k parts. It follows 


that the prawn can get to the opposite corner in 
.(n+k\(n-1 _. 
x | a Va ie 


different ways. From E. T. Whittaker and G. N. Watson, Modern Analysis, Cambridge, 1920, page 
311, we quote the following expression for the Legendre polynomial P,,(x): 


P,(x) = L(A” I Jor 


Differentiation yields 
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roo SEEN 


and hence 


2 
B, = yin (3). 


Using a well-known recurrence relation for Legendre polynomials, we obtain the following 
recurrence relation for B,: 


3(2n + 1) n—1 
Bo = 1,8, = 2, Bia = rarer me — ——— B,,_ 1. 


n 


Also solved by Anthony E. Barkauskas, J. Binz (Switzerland), Roger Cuculiere (France), Roger B. Eggleton 
(Australia), Milton P. Eisner, C. Georghiou (Greece), Ira Gessel, Jerrold W. Grossman, J. Stephen Montague, 
Robert E. Shafer, Lajos Takacs, John Ward, David M. Wells, and the proposer. 


REVIEWS 


EDITED BY ALLAN L. EDMONDS AND JOHN H. EWING 
COLLABORATING EDITOR FOR FILMS: SEYMOUR SCHUSTER, CARLETON COLLEGE 


A Number for Your Thoughts: Facts and Speculations About Numbers from Euclid to the Latest 
Computers. By Stephen P. Richards. S. P. Richards, New Providence, New Jersey, 1982. 207 pp. 
$7.95 (paperback). 


UNDERWOOD DUDLEY 
Department of Mathematics, DePauw University, Greencastle, IN 46135 


Go to a mall. Go to a bookstore—a Waldenbooks, say. Go to the science section. What will 
you find? Four three-foot shelves: astronomy gets one shelf; weather, physics, and general science 
gets one shelf; biology gets one shelf; miscellaneous science gets one shelf. John McPhee is there, 
with The Curve of Binding Energy. Lewis Thomas is there. Isaac Asimov is there. The Soul of a 
New Machine is there. Is there any mathematics? One book and one only: Hofstadter’s Gédel, 
Escher, Bach. 


Go to another mall. Go to a B. Dalton’s bookstore this time. The management is more 
ambitious than Waldenbooks’, since there are at least five times as many titles on the shelves. 
General science is well represented, with books by Carl Sagan, Isaac Asimov, Jeremy Bernstein, 
and many others. There is Nigel Calder in the Physics section. There is the Field Guide to the 
Atmosphere in Earth Sciences, along with a misfiled Catastrophe Theory. There is even a lot of 
mathematics. But what kind of mathematics? The Statistics Problem Solver, Advanced Engineering 
Mathematics, Forgotten Algebra (A Refresher Course), only one copy of Courant and Robbins’ 
What is Mathematics? and just two copies of Davis and Hersh’s The Mathematical Experience. 
Most of the books are texts—sixty-one out of one hundred and one. There are seven reference 
works, four histories, five different works on mathematics anxiety. The books are almost all 
textbooks. The space devoted to astronomy is two and one-half times as large as that devoted to 
mathematics, and hardly any of the books are texts. Computer science gets twelve times as much 
space. Nature and the biological sciences are shelved separately, and their space is many times as 
great as the space given to mathematics. 
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Go to a public library. Go to the science section and see that the books on mathematics take 
up about one-twentieth of the total space for science. And what books they are! Three works on 
the slide rule, Arithmetic for the Practical Man, Introduction to Plane Trigonometry. Old books, 
undisturbed for years, the dust thick on them. I suspect that most of them were gifts to the library 
from people discarding books no longer of any use. Why would anyone want to keep a 
mathematics book? 


Go to Books in Print, the 1982-83 edition. How many of the tens of thousands of books listed 
are classified under ‘“‘Mathematics— Popular Works’? One hundred, would you think? Fifty? The 
number is eight. Eight only, and 37.5% of those were published in the Union of Soviet Socialist 
Republics. One of the remaining five is Technical Shop Mechanics, popular only in a restricted 
group. You can maintain, probably rightly, that the classifiers of the J. J. Bowker Company are 
not expert in classifying mathematics, but the fact that there are so few titles classified under 
Popular Mathematics shows that such works are rare and strange. 


Why are there so few popular mathematics works? Is it that, as G. H. Hardy wrote, 
“Exposition, criticism, appreciation, is work for second-rate minds” so no one reads about 
mathematics because the books are of such low quality? I think not. Some first-rate minds have 
produced popular works in mathematics and, quality aside, many books in other sciences succeed. 
Is it because publishers are today almost all part of some conglomerate, so they no longer want to 
publish books that have no chance of selling a huge number of copies? Perhaps, but were many 
popular mathematics books being read back in the golden age of publishing, whenever that was? 
Aside from the success of a Mathematics for the Million, or a World of Mathematics, I think there 
was the same low level of readership as there is now. Is it that there is not enough advertising? Is it 
the fault of television? Air pollution? What can it be? 


The answer is simple. People do not want to read about mathematics. They will not buy books 
about mathematics, no matter how well-written they are, no matter how many are published, no 
matter how hard they are advertised. They would rather read about other things. Quarks instead 
of primes. Black holes instead of Fermat numbers. Relativity instead of pi. 


Why the difference? Of course, books on physics and astronomy can have pictures more 
attractive than portraits of Gauss and Euler, but I think it is no coincidence that the number of 
books published and read in mathematics, physics, and astronomy is inversely proportional to the 
number of people who studied each subject in school. It is our schools which have succeeded in 
making mathematics so distasteful and unattractive that the vast majority of the population wants 
to have nothing to do with the subject after school. If the study of mathematics was not inflicted 
on everyone, we might see almost as many popular works in mathematics as we do in physics. 


It is curious that society insists that every citizen be taught to find solution sets of quadratic 
equations, not to confuse a number with a numeral, and to distinguish unary minus signs from 
binary ones when there is really no need for everyone to know these things. Mathematics is useful 
for some people and necessary for society, but so are the principles of sanitary engineering. It is 
curious because there is no great shortage of mathematical talent—if the financial rewards for 
mathematicians were as large as they are for, say, stockbrokers, graduate schools of mathematics 
would be turning away thousands of qualified applicants. It is curious because it is easy to spot 
mathematical talent, the utter lack of it is almost as easy to see, and no amount of schooling will 
change one or the other. Why indeed do we persist in continuing to force millions of suffering 
children to do something for which they have neither the talent nor the inclination? 


The present situation is more than curious, even pernicious, because it creates dislike for 
mathematics and anxiety about it. Dislike creates bad feelings about mathematics and by 
extension about the entire life of the mind and anxiety creates bad feelings about oneself. How 
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much better to have a happier anxiety-free population! (That mathematics anxiety is not 
misplaced is shown by the record of professional anxiety-reducers. I have read no success stories 
of people who were once intimidated by decimals who are now proving theorems. The best that 
seems to happen is that some people are able to struggle through a semester of calculus with a 
grade of C.) To reduce anxiety is a waste of resources. To create dislike is a waste of resources. 
Waste is bad. 


There is no hope that mathematics books will ever be popular, and A Number for Your 
Thoughts illustrates this. Its author loves numbers. He is crazy about them, and he wants you to be 
crazy about them too. Numbers are fascinating! They are at least as interesting as quarks, at least 
as rich as black holes, and at least as easy to understand as relativity. But the author has published 
his book himself. He had no publisher to tell him that references and an index are necessary. He 
had no editor to eliminate the embarrassing stylistic lapses and correct the occasional errors of 
fact. So his book will fail. In a more perfect world, a revision of it would have succeeded. But in 
this world, popular works in mathematics are doomed to be unpopular. 


Emmy Noether, 1882-1935. By Auguste Dick. Translated by H. I. Blocher. Birkhauser, Boston, 
MA, 1981. xiv + 193 pp. 

Emmy Noether, A Tribute to Her Life and Work. Edited by James W. Brewer and Martha K. 
Smith. Marcel Dekker, New York, 1981. 


IRVING KAPLANSKY 
Department of Mathematics, University of Chicago, Chicago IL 60637 


Emmy Noether was born on March 23, 1882. Her numerous admirers have found much to 
cheer during the centenary year of 1982. In addition to the two books I am reviewing, Springer 
will shortly publish her collected works, edited by Jacobson. In March of 1982 a symposium in her 
honor was held at Bryn Mawr, and there will be a volume of proceedings. So, in all, we have three 
books as a 100th birthday salute, with a fourth coming before long. 

I am a member of the steadily shrinking circle of people who have a personal recollection of 
Emmy Noether (when I was a freshman at Toronto she gave a colloquium talk). But I did not 
know at first hand the German mathematical scene between the first World War and 1933. I 
picture a “golden age” of ferment, excitement, and great progress. At the center was Gottingen, 
and algebra at Géttingen meant Emmy Noether. In 1933 her stature as a superb mathematician 
was’ firmly established. Yet, at the age of 51, her rank was “ausserordentlicher Professor” 
(associate professor, more or less). Even this had been achieved only by a concerted battle against 
prejudice. 

And then the storm broke. On April 2, 1933, her right to teach at Géttingen was withdrawn. 
The letter from the Prussian Ministry of Science, Art, and Public Education cited as its authority 
the new statutes of the National Socialist regime (which, for a time at least, gave lip service to the 
German tradition of orderly procedure). 

“It shall not be forgotten what America did during these last two stressful years for Emmy 
Noether and for German science in general.” These are the words of Hermann Weyl. Perhaps they 
are overgenerous. At least initially, the positions found for the illustrious immigrants (as Laura 
Fermi called them) were modest. Still, it was the bottom of the depression, and action had to be 
taken quickly. I think that the North American mathematical community can be reasonably proud 
of what was achieved. The reward was ample; there is no doubt that this infusion played a major 
role in transferring mathematical leadership from Germany to the U.S.A. Today another immigra- 
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tion is taking place; it is smaller but again illustrious. Our colleagues in the Soviet Union should 
ponder carefully the lessons of the past. 

In Emmy Noether’s case the transition was from Géttingen to Bryn Mawr till her untimely 
death on April 14, 1935. (An interesting sidelight is the fact that Edward R. Murrow, then with 
the Institute of International Education, played a role in the negotiations that brought her to Bryn 
Mawr.) 

Dick’s biography was published in German in 1970. It is likely to remain the standard work. 
The translation into English incorporates some minor changes. It will make the story of her life 
more widely available and I accordingly welcome it. 

I recommend to the reader the review by B. H. Neumann in the Bulletin of the London 
Mathematical Society (vol. 14, pp. 155-6). Some of his comments have my endorsement. I cannot 
resist mentioning that the typo he spotted (a misspelling of C. C. Faith on page 98) is not the only 
one: Herbrand is misspelled in the caption facing page 75. 

Completing the book are the obituaries by van der Waerden, Weyl, and Alexandrov, a 
bibliography, and a list of her Ph.D.’s. 

The volume edited by Brewer and Smith consists almost entirely of new material. Clark 
Kimberling, who had much to do with initiating the book, begins with a much expanded version 
of his 1972 MONTHLY biography. But I urge readers nevertheless to look up the previous article 
for the fascinating final section on the recovery of letters of Cantor, Dedekind, Frobenius, and 
Weber after a 33-year sojourn in a Philadelphia law office. 

The book continues with incisive reminiscences by Saunders Mac Lane of Gottingen in 
1931-3, together with a letter he wrote to his mother on December 8, 1931. A reader might enjoy 
supplementing this with the description of the writing of his Géttingen thesis that appears in his 
Selected Papers (Springer, 1979). 

Olga Taussky had a unique opportunity to observe Emmy Noether: in Gottingen in 1931-2 
and in Bryn Mawr in 1934-5. In my opinion her 20 pages of reminiscences bring Emmy Noether 
to life more than anything else in either book. 

In the next two chapters we find the van der Waerden and Alexandrov obituaries. These 
duplicate the corresponding portions of the Dick book. But it is not complete duplication, for the 
reader can have the fun of comparing two different translations. 

Available in English for the first time is her 1932 address to the Ziirich International Congress. 

Five mathematicians contribute articles analyzing various aspects of Noether’s work. In 
alphabetic order they are Frohlich, Gilmer, Lam, McShane, and Swan. These papers add up to a 
contribution to the mathematical literature of lasting importance, and I move a vote of thanks. 

I feel that McShane’s article deserves special mention. I dare say it may surprise some of my 
readers to learn that many physicists are quite familiar with the name of Emmy Noether. But to 
them it does not conjure up commutative rings with the ascending chain condition, or even the 
mathematician who taught us the power of abstract algebra. They are thinking of a theorem about 
problems in the calculus of variations for which there is a group of transformations. The relevant 
paper is entitled “Invariante Variationsprobleme” and it appeared in the 1918 Gdéttinger Nach- 
richten. 

It was not through publications alone that her influence spread. Here are some of her Ph.D.’s: 
Grell, Levitzki, Deuring, Fitting, Witt, Tsen. Many others, such as Alexandrov, Artin, Herbrand, 
Krull, and van der Waerden, were deeply influenced. The circle of students and disciples that 
surrounded her in Gottingen was jokingly referred to as the “Noether boys.” After the transplan- 
tation to Bryn Mawr, she worked with four “Noether girls.” Three were postdoctoral fellows: 
Olga Taussky, Marie Weiss, and Grace Shover (Quinn). Ruth Stauffer (McKee) received a Bryn 
Mawr Ph.D. under her direction. 

The spirit of Emmy Noether lives on not only in her mathematical works, but also in her many 
students and colleagues, and in their mathematical progeny as well. 
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The Theory of Spinors. By Elie Cartan. Dover, New York, 1981. (Unabridged republication of the 
complete English translation first published by Hermann of Paris in 1966.) x + 157 pp. $4.00. 


ROBERT HERMANN 
APSM, 53 Jordan Road, Brookline, MA 02146 


Cartan is certainly one of the greatest and most original minds of mathematics, whose work on 
Lie groups, differential geometry, and the geometric theory of differential equations is at the 
foundation of much of what we do today. In my view, his place in mathematics is similar to that 
of the great turn-of-the-century masters in other areas of intellectual life. Just as Freud was 
influenced by the mechanistic world view of 19th century science, but used this background to 
create something new and revolutionary which has profoundly influenced 20th century thought, so 
Cartan built, on a foundation of the mathematics which was fashionable in the 1890’s in Paris, 
Berlin and Gottingen, a mathematical edifice whose implications we are still investigating. His 
work was highly intuitive and geometric, but was also based on a formidable combination of 
original methods of calculation and analysis, ranging in mathematical expertise from algebra to 
topology. For example, he completed the work of Killing and Lie on the classification of simple 
Lie algebras. As Hawkins so convincingly demonstrates [1], this required the mastery of the most 
advanced algebraic technique of the 1890’s, a task at which Killing himself (who learned his 
algebra from Weierstrass!) had despaired. In the 1920’s, when he was already in his 50’s, he 
proved that the second homotopy group of a Lie group was zero, which was one of the first great 
general theorems about topology. As one can see in his Collected Works, he was a master of 
brutal calculations, and all of his work was based on an intimate knowledge of computational 
details and examples. In short, he was comparable to such great figures of mathematics as Gauss, 
Riemann, and Poincaré. 

Thus it is unfortunate that his original work is so inaccessible to the wide spectrum of 
mathematicians and scientists who now make use of it. (Of course, it is accessible through the 
expositions by many others in the last twenty years.) This short book is a translation of one which 
was written in French in the 1930’s, and is perhaps the most readable of his works. It was first 
published in 1966, when the work of Killing and Cartan on the classification of simple Lie groups 
was beginning to be applied in elementary particle physics. In terms of contemporary Lie group 
theory, it deals with the B and D series of simple Lie algebras and the Lie groups which go along 
with them, i.e., the orthogonal matrix groups over the real and complex numbers and their simply 
connected covering groups. Cartan starts off in complex Euclidean 3-space, describing “spinors” 
(which are elements of the vector space which carries the lowest-dimensional representation of the 
Lie algebra of 3 X 3 skew-symmetric real matrices) rather concretely and computationally, in a 
form which would satisfy the most down-to-earth physicist. He then goes on to generalize to the 
n-dimensional case in an explicit, but, perhaps inevitably sketchy, way. Any of the topics treated 
here would be useful reading for a physicist trying to understand contemporary Lie group theory 
or for a mathematics student trying to understand how linear algebra is applied in group theory. 
The last brief chapter sketches how the ideas can be combined with his “method of the moving 
frame” to describe how the Dirac equation of quantum mechanics can be formulated for an 
arbitrary Riemannian manifold. This material is of historical interest in physics, since it fore- 
shadows the recent “gauge theory” developments. Cartan also takes a certain delight in pointing 
out in the Introduction that he discovered Spinors before Dirac, in 1913, as a by-product of his 
classification of representations of semi-simple Lie groups. 

I hope I am not insulting the memory of my greatest hero to say that this book is a fraud! I 
don’t believe that Cartan thought about the subject in the form in which it is presented here. 
Clearly, he is presenting a “vulgarization” of the general theory of semi-simple Lie algebras and 
groups, which he developed almost single-handedly (with the help of Hermann Weyl!) in the 
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period 1893-1930. Cartan was very much a fan of physics, and he clearly is trying to teach the 
physicists of his day some of his profound knowledge in a form which they might find more 
palatable. The recently published correspondence between Einstein and Cartan [2] is very 
illuminating about the habits of mind of these two great men, and even somewhat sad. They were 
like ships passing in the night: Cartan enthusiastically tried to communicate some of his great 
geometric ideas to Einstein, who was rather closed-minded and even condescending. 

Finally, I would like to use this occasion to convey my thanks to Dover Publications. Their 
efforts to keep in print, at modest prices, the classics of science have been a great help in my own 
work. Many times I have picked up a Dover book at my local bookstore and discovered a jewel of 
which I was previously unaware. Even more important, it is one of the last links in today’s world 
with certain precious scholarly traditions. 
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A First Course in Group Theory. By Cyril F. Gardiner. Springer-Verlag, New York, 1980. ix +227 
pPp- 


RONALD SOLOMON 
Department of Mathematics, Ohio State University, Columbus, OH 43210 


The label “group” attaches itself to quite a menagerie of beasts. An introductory text can easily 
attain encyclopedic proportions, looming like a Mount Everest for the intrepid. This text, 
however, is for the more timorous who prefer a pleasant summer’s stroll through a few basic ideas 
and past several interesting examples. To motivate this brief excursion an ever-so-modest goal is 
proposed—the determination of all groups of order 1 through 15. In pursuit of this goal some 
important results are established, notably the structure theorem for finitely generated abelian 
groups and the Sylow theorems. Of course, the goal is clear and easy to state; it is, however, rather 
flimsy justification for the heavy artillery mobilized to attack it! 

Does the world need such a modest text? Certainly there is nothing here which is not already 
treated in greater depth in Zassenhaus’ classic text, as well as several successors. Usually a student 
is introduced to group theory at this level in a basic algebra course for which excellent texts such 
as Herstein’s Topics in Algebra are available. What then is the audience for such a book? Perhaps 
there is a place for it in a freshman or sophomore level “short course,” designed to give students a 
taste of noncalculus mathematics. Such courses have been proposed at Ohio State, but none has 
been offered, so I cannot speak from firsthand experience. Nonetheless, I believe such a course 
could be valuable and appealing to students. Accepting this as a raison d’étre for the text, we can 
examine how it serves its audience. 

The elementary approaches to the study of finite groups are of two types: the permutation 
methods and the “normal subgroup” methods. The former feature combinatorial arguments and 
tend to apply only to finite groups. The latter are often instances of much more general techniques 
of value in the study of all groups, rings, and modules. How do each of these approaches fare in 
this text? 

Permutation arguments abound, but the role of permutation representations as a unifying idea 
in group theory is never made clear. On the one hand the standard lemmas on orbits and 
stabilizers are presented and used to prove Burnside’s Counting Theorem with a (much too 
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trivial!) application to counting colored beads. On the other hand cosets and double cosets are 
introduced as tools for proving Lagrange’s theorem and Sylow’s theorems. But the two hands are 
never clasped! Nowhere is it mentioned that the cosets Hg of H in G are the H-orbits of the left 
regular permutation action on G or that the (H, K) double cosets are the K-orbits on H \ G. Such 
a unification of apparently disparate ideas would have lent beauty and clarity to the presentation, 
as well as considerably shortening some proofs. 

The “normal subgroup” approach fares even worse. Although quotient groups are introduced 
early, some of the basic isomorphism theorems are deferred to the “epilogue.” Automorphisms are 
defined only in an exercise, although they make a prior guest appearance in the definition of a 
semidirect product. What a pity! The study of automorphisms of cyclic groups provides a lovely 
arena for the introduction of some elementary number theory and the appreciation of its interface 
with group theory. Depending on the background of the students, some interrelations between 
group theory and linear algebra could also be explored. 

Given this slighting of automorphisms, it is not surprising that semidirect products are 
banished immediately after their introduction. This is the one unforgivable failing of the text! 
Although the subject can become confusing to novitiates, even a brief effort would mobilize a 
powerful weapon for the construction and classification of groups. With this in hand the author 
could have quickly disposed of groups of order 1 to 15 or even consigned them to the exercises 
where they belong. This would have impressed students with the force of the techniques 
developed. Lacking this, he is forced to identify groups by their multiplication tables, a tedious 
and page-consuming process, hardly designed to convert the uninitiated to the power or beauty of 
group theory. 

On the positive side, the highlight of the book is the unit on finite subgroups of O(IR*). The 
classification of these subgroups is a lovely elementary result of considerable interest both within 
mathematics and in the physical sciences; it is neglected by most basic texts. It is disappointing 
that this material is not given more space and prominence! Surely for an audience of beginners 
nothing could be more fun than to explore the notions of symmetry and group actions using 
models of the Platonic solids! As it is, the tetrahedral, octahedral and icosahedral groups are never 
identified as isomorphic to A,, S, and A,, respectively, although these identifications are well 
within the scope of the text. Indeed the first is trivial, the second is easy and the third may be 
inferred from a geometrical observation about the icosahedron, which the students could discover 
themselves. A second approach to the icosahedral group could use the action on pairs of antipodal 
vertices to identify it as a subgroup of S, of order 60. Then Sylow theory could be invoked to 
establish that any subgroup of S, of order 60 is isomorphic to A,;. This would serve as a nontrivial 
illustration of Sylow methods. With the concept of semidirect products in hand, another fun topic 
could be explored: Euclidean groups, tilings of R? and Escher’s graphics. 

The concluding chapter or “epilogue” touches on the general structure problem for finite 
groups, subdivided into the extension problem and the finite simple group problem. Some 
measure of the difficulty of the latter problem is hinted at by reference to the length and depth of 
the Feit-Thompson theorem. The former problem, however, receives a misleading treatment, 
which gives no sense of its complexity and might indeed be construed as equating the extension 
problem (the building of a new group out of smaller ones) with the composition series problem 
(the demolition of a given group into its constituent bricks). 

One final cavil concerns the exercises which are almost all rather pedestrian, but for which the 
author insists, nonetheless, on furnishing solutions. A bit more confidence in the ingenuity of 
students could have yielded a much meatier and more challenging volume. 

I am convinced that an audience exists for such a book but there is no text at this level which I 
can recommend. In terms of style and enthusiasm to communicate, the book under review is 
exemplary. Unfortunately, its flaws, particularly the belaboring of multiplication tables and 
groups of small order, render it unacceptable as a course text. 


LETTERS TO THE EDITOR 


Material for this department should be prepared exactly the same way as submitted manuscripts (see the inside 
front cover) and sent to Professor P. R. Halmos, Department of Mathematics, Indiana University, Bloomington, 
IN 47405. 


Editor: 


I enjoyed reading the article entitled “A Technique for Integration by Parts” by Herbert E. 
Kasube (this MONTHLY, March 1983, pp. 210-211). The author gave the impression that his 
method for choosing u and dv will always work. Although this technique is usually helpful in 
choosing u and du, I have concluded that sometimes his technique should not be used. 


xe*dx 


As an example, consider /{ 5- Using Professor Kasube’s technique, we would choose 


u= —_*_- and dv = e*dx. Thus du = (=x + 1) dx and v = e”. 
(x + 1) (x + 1) 
Therefore 
f xe*dx_ _ —__—xe* - ($3 + 1)e*dx 
(x+1)° (x41) (x +1) 
xe* xe*dx e*dx 
~ + [ - , 
(x + 1) (x +1) (x + 1) 
If we continue using Kasube’s technique on /{ xe and f ea we will not progress 
to a solution. (x + 1) (x + 1) 
; xe*dx , ; , x 
However, the evaluation of /{ (4D? can be easily accomplished if we choose u = xe* and 
x+1 
dv = a 
(x + 1) 
David H. Brown 
Department of Mathematics 
Niagara County Community College 
3111 Saunders Settlement Road 
Sanborn, NY 14132 
MISCELLANEA 
117. 


Others believe that if there is one word that expresses the spirit of the age, itis parameter, a 
mathematical term now widely misused so that nobody finds himself in the hateful position of 
having to say boundary or limit. 

—Edwin Newman, Strictly Speaking, 
Warner Books, 1974, p. 17 
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ERRATA AND ADDENDA 


Referring to his joint paper with Charles Blair, “A universal entire function,” which appeared in this MONTHLY, 
5 (1983) 331-332, Lee A. Rubel writes as follows: “Robert Burckel has kindly drawn our attention to the paper by 
Gerald MacLane, ‘Sequences of derivatives and normal families, J. Analyse Math., 2 (1952) 72-87 [MR 14, p. 74] 
[Zbl. 49, p. 57]. Among other things, MacLane constructed ‘a ubiquitous entire function’ by the same method we 
use, to have the same ‘ universal’ property. However, he also gets estimates on the rate of growth of the function, so 
that his proof is somewhat more complicated than ours.” 


ITO INSTRUCTORS OF ANALYTIC 
GEOMETRY AT ALL LEVELS... 


SSS Announces.... 


A WORK THAT PROMISES TO REVOLUTIONIZE 
& REVITALIZE THE TEACHING & SUBJECT 
MATTER OF ANALYTIC GEOMETRY 


STRUCTURAL EQUATION GEOMETRY 


THE INHERENT PROPERTIES OF CURVES & 


COORDINATE SYSTEMS 


by J.Lee Kavanau, University of California, Los Angeles 


Targeted at today’s critical problem 
areas in math instruction 


Intriguing new approaches, basic 
concepts & problems-—to stimulate 
student interest & imagination 


Penetrating new insights—to 
sharpen comprehension & teach- 
ing skills of instructors 


A UNIQUE SOURCEBOOK nogeometry instructor can afford to be without 


Nov., 1983, 512 pages, 


61 pages of figs., 


$16.95 


Recommended to innovative instructors as a college 
> text. Introduces Prof.Kavanau’s companion treatises < 
on the analysis of general algebraic curves that.... 


‘open up fields of seeming- 


ly inexhaustible wealth’ 
Prof. Alexander Grothendieck 


SYMMETRY, An Analytical Treatment 
August,1980, 656pp., illus. $29.95 


“One of the most original treatments of plane 
curves to appear in modern times. The author’s 
new and deeper studies...reveal a great number 
of beautiful & heretofore hidden properties of al- 
gebraic plane curves.” Prof. Basil Gordon 


“Provides sharp new tools for studying the prop- 
erties of general algebraic curves.” 
Prof. Richard Fowler 


“Striking new results onsymmetry & classifi- 


cation of curves...Read this book for more in sym- 


metry thanmeets the eye.” = Amer. Math. Monthly,1981 


Send SASE for $2,500 Geometry Competition details. 
1983 Co-Awardees: Profs.J.F. Rigby, Cardiff; 
J.B.Wilker, Toronto; W. Wunderlich, Vienna 


| Science Software Systems, Inc, 


‘represent tremendous am- 
ounts of new information 


Prof. Morris Newman 


CURVES & SYMMETRY, vol. 1 


Jan.,1982, 448pp., over 1,000 
indiv. curves, $21.95, set of 3,$60 


“Casts much new light on inversion & 
its generalization, the linear fraction- 
al (Moebius) transformation,with 
promise of increasing their utility by 
an order of magnitude” Prot Richard Fowler 


“Replete with fascinating, provocative 
new findings...accompanied by a wealth 
of beautiful & instructive illustrations: 

Prof. Basil Gordon 
“Extends the idea of inversion into quite 
a new field.” E. H. Lockwood 


“Examines many classical curves from 
new standpoints.” Nordisk Matem. Tids.,1982 
Bank Americard-213-477-8541-Master Card 


11899 W. Pico Blvd., Los Angeles, Calif, 90064 


SOME INFORMATIVE AND USEFU 
BOOKS FROM THE MAA... 


Browse through this list and see if your library is missing some of these important books 


published by the Association. 


APPLICATIONS OF UNDERGRADUATE 
MATHEMATICS IN ENGINEERING — written and 
edited by Ben Noble. A collection of articles prepared 
by engineers for the Committee on the Undergraduate 
Program in Mathematics. 364 pp. Hardbound. 

List: $19.00. MAA Member: $13.00. 


ANNOTATED BIBLIOGRAPHY OF EXPOSITORY 
WRITING IN THE MATHEMATICAL SCIENCES, 
prepared by M. P. Gaffney and L.A. Steen. An invaluable 
reference source of expository articles in mathematics. 
282 pp. Paperbound. 

List: $12.00. MAA Member: $8.00. 


AN ANNOTATED BIBLIOGRAPHY OF FILMS AND 
VIDEOTAPES FOR COLLEGE MATHEMATICS, by 
David I. Schneider. An up-to-date listing of films and 
videotapes available for classroom use. 107 pp. 
Paperbound. 

List: $9.00. MAA Members: $6.00. 


A BASIC LIBRARY LIST FOR TWO-YEAR COLLEGES, 
prepared by the Committee on Basic Library Lists. A 
recommended library nucleus for two-year colleges. 

66. pp. Paperbound. 

List: $8.00. MAA Member: $6.00. 


A BASIC LIBRARY LIST FOR FOUR-YEAR 
COLLEGES, prepared by CUPM. Presents listings of 
books and journals that should be in every college 
library. 106 pp. Paperbound. 

List: $9.00. MAA Member: $6.50 


THE CHAUVENET PAPERS. A Collection of Prize 
Winning Expository Papers in Mathematics, edited by 
James C. Abbott. Two-volumes of the collected prize 
winning Chauvenet Papers. Vol. 1—312 pp. 
Hardbound. Vol. 2—282 pp. Hardbound. 

List: $21.00 each. MAA Member $16.00 each. 

Two volume sets 

List: $36.00. MAA Member: $27.00. 


CRITICAL VARIABLES IN MATHEMATICS 
EDUCATION: Findings from a Survey of the 
Empirical Literature, by E.G. Begle. A joint publication 
of the MAA and the National Council of Teachers of 
Mathematics. 

List: $8.00. MAA Member: $6.40. 


A COMPENDIUM OF CUPM RECOMMENDATIONS. 
Volumes I and II. A collection of the major 
recommendations of the Committee on the 
Undergraduate Program in Mathematics. Two volumes. 
756 pp. Hardbound. 

List: $16.50. MAA Member: $12.00. 


THE WILLIAM LOWELL, PUTNAM MATHEMATICAL 
COMPETITION: PROBLEMS AND SOLUTIONS — 
1938-1964. Compiled the R. E. Greenwood, A. M. 
Gleason, and L. M. Kelly. Contains problems and 
solutions to the first 25 Putnam Exams. 652 pp. 
Hardbound. 

List: $35.00. MAA Member: $26.00. 


THE MATHEMATICAL ASSOCIATION OF AMERICA: 
Its First Fifty Years. An historical perspective of the 
Association. 170 pp. Hardbound. 

List: $10.00. MAA Member: $5.00. 


FIFTY YEAR INDEX OF THE MATHEMATICS 
MAGAZINE, edited by Lynn A. Steen, and J. Arthur 
Seebach. Cumulative index of volumes 1-50. 163 pp. 
paperbound. 

List: $10.00. MAA Member: $6.50. 


INDEX OF THE AMERICAN MATHEMATICAL 
MONTHLY. Contains the tables of contents for each 
issue of volumes 1-80 as well as subject and author 
indices. 269 pp. Hardbound. 

List: $19.00. MAA Member: $13.00. 


PRIME-80. Proceedings of a Conference on 
Prospects in Mathematics Education in the 1980's. 
Included are the background of the conference and the 
recommendations that resulted. 

84 pp. Paperbound. $3.50. 


PROFESSIONAL OPPORTUNITIES IN THE 
MATHEMATICAL SCIENCES, Eleventh Edition. 1983. 
Designed for the student interested in a career in 
mathematics. 35 pp. Paperbound. $1.50 each. 95¢ for 
orders of five or more. 

RECOMMENDATIONS ON A GENERAL 
MATHEMATICAL SCIENCES PROGRAM. Prepared by 
the Committee on the Undergraduate Program in 
Mathematics. (CUPM) 102 pp. Paperbound. $3.50 each. 


Order From: 


THE MATHEMATICAL ASSOCIATION 
OF AMERICA 

1529 Eighteenth Street, N.W. 
Washington, D.C. 20036 


The RAYMOND W. BRINK SELECTED 
MATHEMATICAL PAPERS Series. 


This series is a collection of papers on single topics selected and reprinted from the journals of the 
Mathematical Association of America. The papers are grouped by subject within the topic of the volume and 
are indexed by author. Each volume is a rich source of mathematical stimulation for students and teachers. 


Four volumes are available in this series: 


Volume 1. Volume 3. 

SELECTED PAPERS ON PRECALCULUS SELECTED PAPERS ON ALGEBRA 

Edited by Tom M. Apostol, Gulbank D. Chakerian, Edited by Susan Montgomery, Elizabeth W. Ralston, 
Geraldine C. Darden, and John D. Neff. xvii + 469 pages. S. Robert Gordon, Gerald J. Janusz, Murry M. Schacher, 
Hardbound. and Martha K. Smith. xx + 537 pages. Hardbound. 
List: $21.00; MAA Members (personal use) $16.00 List: $21.00; MAA Members (personal use) $16.00 
Volume 2. Volume 4. 

SELECTED PAPERS ON CALCULUS SELECTED PAPERS ON GEOMETRY 

Edited by Tom M. Apostol, Hubert E. Chrestenson, Edited by Ann K. Stehney, Tilla K. Milnor, Joseph D’ Atri, and 
C. Stanley Ogilvy, Donald E. Richmond,and N. James Thomas E Banchoff. ix + 338 pages. Hardbound. 
Schoonmaker. xv + 397 pages. Hardbound. List: $21.00; MAA Members (personal use) $16.00 


List: $21.00; MAA Members (personal use) $16.00 


Order From: 

THE MATHE MATICAL 
ASSOCIATION OF 
AMERICA 


1529 Eighteenth Street, N.W. 
Washington, D.C. 20036 


THE CHAUVENET PAPERS 


A Collection of Prize-Winning Expository Papers 
in Mathematics 
James C. Abbott, editor 


This two-volume collection of the twenty-four prize winning Chauvenet Papers contains the finest 
collection of expository articles in mathematics ever assembled! 

THE CHAUVENET PRIZE for special merit in mathematical exposition was first awarded by the 
Mathematical Association of America in 1925 to G. A. Bliss. Since that time, twenty-four Chauvenet Prizes 
have been awarded and the Prize has become the Association’s most prestigious award for mathematical 
exposition. The list of authors is a veritable WHO's WHO of mathematical expositors, and contains some of 
the more prominent mathematicians of the past fifty years. 

Clearly written, well organized, expository in nature, these papers are the jewels of mathematical 
writing during our times. They were selected by peer juries of experts judged for their presentation as well 
as their content. They are a sheer joy to read for those who delight in the beauty of mathematics alone. 


Volume |—xviii + 312 pages. Hardbound. List: $21.00 —MAA Member: $16.00 
Volume |l|—vii + 282 pages. Hardbound. List: $21.00 —MAA Member: $16.00 
Special Package Price for Both Volumes: List $36.00 —-MAA Member: $27.00 


Order From: 
THE MATHE MATICAL ASSOCIATION 
OF AMERICA 

1529 Eighteenth Street, N.W. 

Washington, D.C. 20036 


PASCAL FOR SCIENTISTS, ENGINEERS, 
AND MATHEMATICIANS 


Harley Flanders, Florida Atlantic University 


This text emphasizes the scientific applications 
of Pascal. The programs involve approximation, 
zeroes of functions, optimization, differential 
equations, matrices, and other numerical analysis. 
There are hundreds of worked examples that 
teach practical scientific programming rapidly 
and efficiently. The examples and exercises also 
show how programs should be written and for- 
matted. Numerous flowcharts illustrate algo- 
rithms and the logic involved in writing good 
scientific programs. 


cloth: R5451-1 $21.95 250 pp. 


Table of Contents 

1 Introduction 

2 Types and Procedures 

3 Program Flow 

4 Structured Types 

5 Applications—Numerical 

6 Applications—Algebraic 

Appendices: Reserved Words, Symbols, Stan- 
dard Identifiers. Railroad Diagrams. UCSD 
Pascal. Apple Pascal Dialogue. ASCII Charac- 
ters. Answers to Exercises. 


Reston Publishing Company, Inc. 


A Prentice-Hall Company 
11480 Sunset Hills Rd. 
Reston, VA 22090 


we 


A Tribute to 
buler 


1707-1783 
November issue of MATHEMATICS MAGAZINE, 
edited by Doris Schattschneider 
80 pp. Paperbound. Price: $4.00 


Two hundred years after Leonhard Euler’s death, the impact of his prolific . 
work can be seen in every area of mathematics. Mathematics Magazine, 
one of the Association’s respected journals, celebrates his genius with a 
special issue, November 1983. 

Seven invited articles, numerous photographs and illustrations, and an 
annotated glossary containing over forty terms, functions, formulas, equa- 
tions, techniques and theorems which today bear Euler’s name make this 
issue an important resource. 

TABLE OF CONTENTS 
Leonhard Euler, 1707-1783, by J. J. Burckhardt 
Ars Expositionis: Euler as Writer and Teacher, by G. L. Alexanderson 
Euler’s Pentagonal Number Theorem, by George E. Andrews 
Euler and Quadratic Reciprocity, by Harold M. Edwards 


Some Remarks and Problems in Number Theory Related to the Work of Euler, by 
Paul Erdés and Underwood Dudley 


Euler’s Vision of a General Partial Differential Calculus for a Generalized Kind of 
Function, by Jesper Liitzen 


Euler and Infinite Series, by Morris Kline 


Order your copy from 
The Mathematical Association 
of America 
1529 Eighteenth Street NW 
Washington, DC 20036 


ARITHMETIC 


David Novak, Simmons College 
1983 Paper 514 pages 


Instructor’s Guide with Tests / Student 
Guide to Margin Exercises / Answer 
Key / Computerized Testing Package / 
Video Tapes 


INTRODUCTORY 
ALGEBRA 

David Novak, Simmons College 
1983 Paper 586 pages 


Instructor’s Guide with Tests / Student 
Guide to Margin Exercises / Answer 
Key / Computerized Testing Package / 
Video Tapes 


Available in January! 


INTERMEDIATE ALGEBRA 
FOR COLLEGE STUDENTS 
Second Edition 


Lawrence G. Gilligan, Mattatuck 
Community College 

Robert B. Nenno, Monroe Community 
College 

1984 Paper 512 pages est. 

Instructor’s Guide with Tests / Student 

Software Package — Apple® II Plus, 

or Apple® Ile 


BRIEF CALCULUS WITH 
APPLICATIONS 


Roland E. Larson and Robert P. 
Hostetler, both of The Pennsylvania 
State University, The Behrend 
College 

1983 Cloth 621 pages 

Answer Key / Student Solutions 

Manual 


ARITHMETIC FOR 
COLLEGE STUDENTS 
Fourth Edition 

D. Franklin Wright, Cerritos College 
1983 Cloth 386 pages 

Instructor’s Guide / Student Workbook 


INTERMEDIATE 
ALGEBRA 

David Novak, Simmons College 
1983 Paper 668 pages 


Instructor's Guide with Tests / Student 
Guide to Margin Exercises / Answer 
Key / Computerized Testing Package / 
Video Tapes 


Available in December! 


CALCULUS AND 
ANALYTIC GEOMETRY 
Second Edition 
Philip Gillett, University of 

Wisconsin, Marathon County Center 
1984 Cloth 992 pages est. 
Three Volume Complete Solutions / 
Selected Solutions / Introduction to Lin- 
ear Algebra / Calculus with a Computer 


BRIEF CALCULUS WITH 

APPLICATIONS 

Alternate Edition 

Roland E. Larson and Robert P. 
Hostetler, both of The Pennsylvania 
State University, The Behrend 
College 

1983 Cloth 433 pages 

Answer Key / Student Solutions 

Manual 


CALCULUS 


Second Edition 


Roland E. Larson and Robert P. 
Hostetler, both of The Pennsylvania 
State University, The Behrend 
College 


1982 Cloth 949 pages 


Instructor’s Guide / Study and 
Solutions Guide / Three Volume 
Complete Solutions / Calculus with a 
Computer 


UNDERSTANDABLE 
STATISTICS 

Concepts and Methods 
Second Edition 


Charles Henry Brase, Regis College 
Corrinne Pellillo Brase, Arapahoe 
Community College 


1983 Cloth 489 pages 
Instructor’s Guide 


APPLICATIONS 
OF COLLEGE 
MATHEMATICS 


Management, Life, and 
Social Sciences 

Donald L. Stancl and Mildred L. 
Stancl, both of Nichols College 

1983. Cloth 720 pages 


Instructor’s Guide / Computer Problem- 
Solving Guide with TRS-80® II Diskette 


INTRODUCTION 

TO PASCAL AND 

STRUCTURED DESIGN 

Nell Dale and David Orshalick, both 
of the University of Texas at Austin 

1983. Paper 470 pages 

Instructor’s Guide / Videotapes 


New for 1984! 


Heath Computer Language Wheel: 
TRS-80® II BASIC 
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Applesoft® BASIC 
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COBOL 
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FORTRAN 77 
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us toll free: 800-225-1388 
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617-862-6650, ext. 1344 


D. C. HEATH AND COMPANY 
College Division pp 
125 Spring Street ale! 
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Now Available from 


D. C. Heath 


A Chapter in Mathematics History... 


A CONVERGENCE OF LIVES 


Sofia Kovalevskaia: 


Scientist, Writer, Revolutionary 
by Ann Koblitz 


The first woman in modern history to receive a 
doctorate in mathematics, Sofia Kovalevskaia 
was also a focal point for many of the brilliant 
scientific, mathematical, and literary minds of 
pre-revolutionary Russia. 


A CONVERGENCE OF LIVES — a fascinating 
account of the history of mathematics — 
A must for every mathematician! 


Dec. 1983 / 400 pp. / Hardcover / 19.95 / 0-8176-3162-3 / DISCOUNT PRICE: $16.95 
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PROFESSIONAL 
OPPORTUNITIES IN THE 
MATHEMATICAL SCIENCES 


Eleventh Edition - 1983 (completely revised) 
‘41 pp. Paperbound. $1.50 (95¢ for orders of five or 
more) 


This informative booklet describes the 
background and education necessary 
for many jobs in the mathematical sci- 
ences, as well as the salary expecta- 
tions and prospects for employment in 
those fields. 

lf you are thinking about a career in 
the mathematical sciences, or you are a 
faculty advisor helping young people 
make career choices, order your copy 
NOW! Some of the areas covered are 
applied mathematics and engineering, | 
computer science, operations research, 
Statistics, the actuarial profession, 
teaching, and job opportunities in gov- 
ernment, business, and industry. 


Order from: The Mathematical Association of America 
1529 Eighteenth Street, N.W. 
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God and Games: Can This Be Mathematics? 


The answer is an emphatic “‘yes.” In SUPERIOR 

BEINGS, Steven J. Brams, renowned game theorist and 

political scientist, shows how the ‘‘theory of moves’ — a 

dynamic theory of games he develops from first principles 

— can be used to explore such questions as: 

@ Are God’s superior powers compatible with human free 
will? 

@ Can they be reconciled with the problem of evil in the 
world? 

@ In what situations is God’s existence **decidable”’? 


In SUPERIOR BEINGS, Brams, author of Biblical 
Games (1980), draws on many stories from the Old Testa- 
ment, making this book a highly unusual and provocative 
application of mathematics to fundamental philosophical 
and theological questions. Ideal for teaching mathematics 
to humanities and liberal arts students. 


SUPERIOR BEINGS: 

If They Exist, How Would We Know? 

1983/202 pp./32 illus./Cloth $21 .95/Paper $11.95 

ISBN 0-387-91223-1 (cloth)/ISBN 0-387-90877-3 (paper) 


To order your copy write: Springer-Verlag New York Inc., 
175 Fifth Ave., New York, N.Y. 10010. 


Pre-publication reviews: 


‘Professor Brams has boldly invaded an unexplored region 

where modern game theory and decision theory find appli- 

cations to monotheistic theology. His carefully constructed 

arguments would have perplexed Maimonides, Aquinas, 

Luther, or the great Muslim thinkers. ... But it is hard to 

see how they can be ignored by contemporary theologians.”’ 
— Martin Gardner 


“Does game-theoristic theology exist? This book is a fresh 
partial answer, modestly phrased and interestingly written. 
Readers will enjoy it and learn from it whether or not they 
believe in either God or von Neumann.” 
— Dr. Paul R. Halmos, Professor 
of Mathematics, Indiana University 
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Berlin ooe-sozoseriin 


Jonannes-Dieckmann-StraBe 10 


B. N. PSENICNY4J and Ju. M. DANILIN 


Numerische Methoden 


fur Extremalaufgaben 
Translation from Russian 


270 pages, 1465 mm x 230mm, 
imitation leather, 55,— M 

Order data: 570 971 8/Pseniényj, 
Numer. Methoden 


In the present book methods and algorithms for 
the numerical solution of tasks are described 
occuring in mathematical optimization, in economy, 
the theory of optimum control, and in other fields 
of science and practice where also the extreme of 
functions and functionals is to be found. The main 
attention is here directed to algorithms having a 
high rate of convergence, and to those algorithms 
which are practically suited for implementation on 
computers. Methods for minimization of functions 
both without and with restrictions to independent 
variables are discussed. 


Philosophie - Geschichte - Psychologie 


Mathematik - Physik - Chemie 


B. KLOTZEK 
Einfuhrung in 


die Differentialgeometrie 
Mathematische Schiilerbiicherei, Nr. 105 and 108 


Part I: 
144 pages, 75 figures, 142 mm x 200 mm, 
paperback, 12,— M 


Part Il: 

140 pages, 45 figures, 142 mm x 200 mm, 
paperback, 12,—M 

Order data: 571 050 8/Klotzek, Einfiihrung 2 


Both Part | and Part |! are intended to be guide- 
lines on the practical application of differential 
geometry. Part | covers the curve theory and an 
introduction to the surface flexure theory. The 
chapters in Part Il are the internal geometry of a 
surface and an introduction to the Riemann 
geometry. 


The editions will be published in German. 
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Our publications can be bought from any inter- 
national book trade facility. In case difficulties 
should come up, please contact our publishers or 
the firm ,,BUCHEXPORT”, Volkseigener Aussen- 


handelsbetrieb der Deutschen Demokratischen 
Republik, DDR — 7010 Leipzig, Leninstrasse 16. 
For printed information, please write to our 
publishers. It is free of charge. 


Reston 
Has Your Next Computer Textbook 


DISCRETE MATHEMATICS FOR COMPUTER SCIENTISTS 
Joe L. Mott, Abraham Kandel, Theodore P. Baker, all of Florida State University 


Requires no specific mathematical background outside of the material ordinarily covered 
in most college algebra courses. Particularly useful for teachers who are mathematicians 
but not computer scientists by profession or training. 

R1372-3 350 pp. 1983 $24.95 


COMPUTABILITY 
George J. Tourlakis, York University 


An authoritative, comprehensive introduction to recursion theory. Offers a very good 
balance of formalism and intuition. 
R0876-4 576 pp. Sept. 1983 $27.95 


MATH COMPANION FOR COMPUTER SCIENCE 


Zamir Bavel, University of Kansas 


Widely adopted text that thoroughly discusses the math concepts and techniques most 
often encountered in computer science and presents them in an easy-to-use format. 
R4300-1 384 pp. 1982 $21.95 cloth 

R4299-5 384 pp. 1982 $16.95 paper 


PROBLEM SOLVING FOR DISCRETE STRUCTURES 
Zamir Bavel, Roxy Peck, Jan Bergandy, and Jerzy Grymala-Busse 


An invaluable supplement that furnishes numerous exercises pertinent to the study of 
discrete structures. Complements the Math Companion for Computer Science chapter 
for chapter, but can be used for other texts. 

R5618-5 256 pp. Dec. 1983 $12.95 paper 


INTRODUCTION TO THE THEORY OF AUTOMATA 


Zamir Bavel, University of Kansas 


Covers both the algebraic and the combinatorial aspects of automata. Excellent treatment 
of sequential machines, sequential tasks and processes, and finite memory programs and 
devices. 

R3271-6 650 pp. 1983 $27.95 


MATHEMATICS FOR DATA PROCESSING, 2nd EDITION 
Frank J. Clark 


This best-selling text provides a broad background for students seeking an entry level 
position in data processing. A very solid integration of modern algebra, Boolean algebra, 
and programming techniques. 

R4263-1 326 pp. 1983 $21.95 


For adoption consideration, please write for examination copies on school 
letterhead to: 

Martin Conner, Marketing Manager 

Reston Publishing Company, Inc. 

11480 Sunset Hills Road 

Reston, VA 22090 


Prices subject to change without notice. 


THE MATHEMATICAL ASSOCIATION OF AMERICA 


1529 Eighteenth Street, N.W. 
Wr. Ulantan TC INNK 


